Video summary of deliberative alignment
youtu.be/1efVS4DeEOs
Links:
- Paper: arxiv.org/abs/2412.16339
- Blog: openai.com/index/delibe...
Video summary of deliberative alignment
youtu.be/1efVS4DeEOs
Links:
- Paper: arxiv.org/abs/2412.16339
- Blog: openai.com/index/delibe...
Video summary of recent work on alignment faking
www.youtube.com/watch?v=_1bz...
Had a great time at NeurIPS
Thank you to everyone I got to talk to, especially at the poster sessions
And thanks to the organizers for picking a beautiful location (the video is from a nearby hike with Vikrant)
www.youtube.com/watch?v=MBGI...
Clearly, I took the #runconference seriously.
How does data scale influence performance
NeurIPS 2024 poster presentation
By @vishaalurao.bsky.social
youtu.be/YNZ23YPasXo
The GRAB benchmark
Work with Jonathan Roberts and Kai Han
youtu.be/XW3YdNATjIU
Will be at NeurIPS next week.
DM if you're interested in meeting up for a chat (or a jog).
🚀New Paper: Active Data Curation Effectively Distills Multimodal Models
arxiv.org/abs/2411.18674
Smol models are all the rage these days & knowledge distillation (KD) is key for model compression!
We show how data curation can effectively distill to yield SoTA FLOP-efficient {C/Sig}LIPs!!
🧵👇
PrimeIntellect have released their tech report on INTELLECT-1: t.co/8hnoTILaL3
The first open-source world-wide training of a 10B model. The underlying ML distributed algo is DiLoCo (arxiv.org/abs/2311.08105) but they also built tons of engineering on top of it to make it scalable.
This is a nice benchmark for AI R&D
LLMs are closing the gap to humans
Details: metr.org/AI_R_D_Evalu...
Hello world