Perfil monitorado

Lilian Weng

@lilianweng

Co-founder of Thinking Machines Lab @thinkymachines; Ex-VP, AI Safety & robotics, applied research @OpenAI; Author of Lil'Log

Posts coletados: 12 posts
Última publicação: Último · 7 de jul, 02:58
Frequência: Sync · 15 min

7 jul · 02:58·ver no X
new post on harness engineering for AI self-improvement: t.co/ZYvGfVs61k It is hard to forecast how much the future of RSI will rely on harnesses. Likely harness engineering will evolve in the direction of self-improvement and enable auto-research, and, in turn, smarter models keeps harnesses simple. Even when many harness improvement get eventually internalized into core model, the need to specify goals and context will not disappear.
25 jun · 17:06·ver no X
A super long overdue (3+ years?) post on scaling laws. Compute is expensive. Scaling laws are a way to help us reason about the optimal compute allocation between data and model size before committing to a large run. The post covers what scaling laws predict, how compute-optimal allocation works, why Kaplan et al. and Chinchilla disagree, and how data limits + fitting details make extrapolation tricky. t.co/HP26eJvjHB
25 jun · 17:17·ver no X
I wonder how many more people will just ask model to summarize the post, in comparison to a year ago. 😂
25 jun · 17:18·ver no X
Soon I should set up the model to update Lil’Log automatically but I’m not there yet.
19 mai · 14:40·ver no X
We would love to see more collaboration and research in the field of human-AI interactivity. Check it out! x.com/thinkymachines…
17 mai · 21:58·ver no X
I only recently read more about the concept of system accidents by Charles Perrow, very insightful and relatable.
11 mai · 17:50·ver no X
In the past few months, we had a lot of fun (and stress 😅) to produce 12 versions (+ many subversions) and 137 pages in our training run log book. Turns out human-human collaboration is important to improving human-AI collaboration. 😊 x.com/thinkymachines…
10 mar · 14:08·ver no X
Building technologies for better human-AI collaboration on next gen hardware at scale. Exciting. x.com/thinkymachines…
15 jan · 01:03·ver no X
I’ve been telling people this a lot today: I enjoy so much working with people who care about what they are building and craftsmanship. It is a privilege to have a chance to work on something I’m passionate about, beyond making a living. I cherish it and don’t take it for
27 out · 14:31·ver no X
On-policy distillation provides an elegant way to use the teacher model as a process reward model to provide dense reward while preventing SFT style "OOD shock" during rollout. x.com/thinkymachines…
1 out · 15:29·ver no X
GPUs are expensive and setting up the infrastructure to make GPUs work for you properly is complex, making experimentation on cutting-edge models challenging for researchers and ML practitioners. Providing high quality research tooling is one of the most effective ways to
26 set · 16:03·ver no X
Looking through those little hidden gem stories in the footnote, you will find it so inspiring that researchers with interests on the same topic are able to work together to advance a field despite their roles and locations. This is the power of open science and community. x.com/thinkymachines…