Eran Malach's Avatar

Eran Malach

@emalach

Research Fellow @ Kempner Institute, Harvard University Theory of Deep Learning / Learning of Deep Theory

504
Followers
83
Following
2
Posts
19.11.2024
Joined
Posts Following

Latest posts by Eran Malach @emalach

In our newest work (led by the amazing
@sunnytqin.bsky.social , w/ @emalach.bsky.social, Samy Jelassi), we investigate a core question for LLMs: "๐‘ก๐‘œ ๐‘๐‘Ž๐‘๐‘˜๐‘ก๐‘Ÿ๐‘Ž๐‘๐‘˜ ๐‘œ๐‘Ÿ ๐‘›๐‘œ๐‘ก ๐‘ก๐‘œ ๐‘๐‘Ž๐‘๐‘˜๐‘ก๐‘Ÿ๐‘Ž๐‘๐‘˜" in two prototypical logic-heavy puzzles: CountDown and Sudoku.

11.04.2025 16:29 ๐Ÿ‘ 3 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Will be presenting this work at #NeurIPS2024, today 11am, poster #2311. Come visit us!

12.12.2024 16:45 ๐Ÿ‘ 10 ๐Ÿ” 1 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Heading to NeurIPS tomorrow โœˆ๏ธ
Will be presenting a few papers during the week. Ping me if you want to chat!

09.12.2024 14:55 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
Post image

I defended my PhD dissertation back in May. I didn't have time to share it widely then (newborn baby), but I think some of you might enjoy it, especially the opening chapters: benjaminedelman.com/assets/disse...

02.12.2024 00:20 ๐Ÿ‘ 31 ๐Ÿ” 3 ๐Ÿ’ฌ 3 ๐Ÿ“Œ 1

Just put together a starter pack for Deep Learning Theory. Let me know if you'd like to be included or suggest someone to add to the list!

go.bsky.app/2qnppia

22.11.2024 21:35 ๐Ÿ‘ 87 ๐Ÿ” 31 ๐Ÿ’ฌ 29 ๐Ÿ“Œ 5
Post image

How does test loss change as we change the training data? And how does this interact with scaling laws?

We propose a methodology to approach these questions by showing that we can predict the performance across datasets and losses with simple shifted power law fits.

21.11.2024 15:11 ๐Ÿ‘ 19 ๐Ÿ” 7 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 2