Alexander Doria's Avatar

Alexander Doria

@dorialexander

LLM for the commons.

7,601
Followers
694
Following
1,939
Posts
02.09.2023
Joined
Posts Following

Latest posts by Alexander Doria @dorialexander

Jamais réussi à lire non plus. Et même sentiment : pas vraiment de vie là-dedans.

05.03.2026 21:17 👍 0 🔁 0 💬 1 📌 0
Post image

oh yes, obviously, i can make this now

05.03.2026 00:05 👍 18 🔁 1 💬 1 📌 0

who talk about cleanly?

04.03.2026 22:10 👍 4 🔁 0 💬 1 📌 0

Well 10 years of teaching it… Likely last time.

04.03.2026 22:09 👍 6 🔁 0 💬 1 📌 0

I guess Donald Knuth must have thought of that :)

04.03.2026 21:02 👍 0 🔁 0 💬 2 📌 0

just realized that jupyter is probably dead as a concept. it's all md+scripts now.

04.03.2026 20:35 👍 81 🔁 8 💬 9 📌 7

more seriously: i still think "computation" is also happening internally (just in a smooth/transient way, not that dissimilar to actual math search prior formal verification)

04.03.2026 00:39 👍 8 🔁 0 💬 1 📌 0

I'm afraid this is anthropomorphizing. The proof was there all along in future training data.

03.03.2026 23:47 👍 50 🔁 3 💬 2 📌 1
Post image

Nothing to see, just very powerful pattern matching. www-cs-faculty.stanford.edu/~knuth/paper...

03.03.2026 23:36 👍 214 🔁 44 💬 11 📌 20

actually, yes.

03.03.2026 07:25 👍 1 🔁 0 💬 0 📌 0

Not sure for the US, but in Europe started very early on (even q3 2023) with their positioning on safety/alignment and avoiding the mess openai got into at the same time (GDPR blocks, etc.)

03.03.2026 00:29 👍 1 🔁 0 💬 0 📌 0

(Our next release will actually be personas)

02.03.2026 22:59 👍 6 🔁 1 💬 1 📌 0

Would also open up the much more interesting question of how to design and tune personas. I’m currently switching to agentic model training and simulated personas are everywhere, one of the absolute core original seed.

02.03.2026 22:58 👍 18 🔁 1 💬 1 📌 0

Oh been part-time there for a while now. Always good to have a platform plan b.

02.03.2026 20:20 👍 3 🔁 0 💬 1 📌 0

Models should design, models should populate, models should compile.

02.03.2026 17:46 👍 14 🔁 1 💬 0 📌 0

Sorry to say that I'm slowly becoming anti-handcraft RL environments. Textbook bitter lesson.

02.03.2026 17:46 👍 32 🔁 1 💬 4 📌 1
Post image

Some example of how it works in practice: after golden gate claude, you can get red baguette.

01.03.2026 15:28 👍 9 🔁 1 💬 1 📌 0
Preview
lyraaaa/baguettotron-SAE-L48-8x-k16-774m · Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

SAE weights are now available. I think this make Baguettotron the smallest yet effective model available for merch interp research. huggingface.co/lyraaaa/bagu...

01.03.2026 15:23 👍 5 🔁 0 💬 1 📌 0

je crains que ce soit surtout des idées maintenant répandues dans l'électorat cœur. dérive collective…

28.02.2026 12:56 👍 1 🔁 0 💬 0 📌 0

By all account most used llm training technique, most critical one for synth pipelines, and, ThinkingMachine aside, very few committed research in the open.

28.02.2026 09:49 👍 7 🔁 0 💬 0 📌 0

Frankly the number of unsettled topics on SFT is insane.

28.02.2026 09:47 👍 10 🔁 0 💬 1 📌 0

He definitely but also seems to come more on the data design side: worked on early ChatGPT persona/behavior with Joanne Jang, recently on a literary model bundled inside gpt-5.

28.02.2026 06:18 👍 6 🔁 0 💬 0 📌 0

forever relieved to not have spent the last two years on prompt layer orchestration

26.02.2026 23:54 👍 17 🔁 0 💬 1 📌 0

Thanks Glyn for supporting our work and its recent global turn! So far the total amount in grant for Common Corpus is still about zero…

26.02.2026 21:08 👍 19 🔁 3 💬 0 📌 0

we had a hard time…

26.02.2026 16:31 👍 1 🔁 0 💬 0 📌 0
Post image

Last week, we presented officially our (famed) zip drive on French podcast A la French which got many people curious. www.youtube.com/watch?v=wirm...

26.02.2026 16:06 👍 13 🔁 1 💬 1 📌 0
Post image

Since Baguettotron is currently buzzing in France right now, announcing the first official demo on HuggingFace (in arena mode vs. gemma-270m). huggingface.co/spaces/PleIA...

26.02.2026 16:00 👍 34 🔁 1 💬 1 📌 0
Post image

New amazing interpretability/SAE work on Baguettotron! Almost surprised how much the high entropy section are actually connected to analytical features: lyramakesmusic.github.io/bread-slicer/

24.02.2026 12:00 👍 57 🔁 5 💬 1 📌 0

Sure, but at least in Europe, ambiguity is *very* unhelpful. If that's what we really mean, I think we need better words.

24.02.2026 10:41 👍 3 🔁 0 💬 0 📌 0

(latest Bender bizarre anti-Doctorow thread was all about reframing stochastic parrot to be only about harms + open data — except to have been in this space for years, hardly ever saw her)

23.02.2026 23:05 👍 3 🔁 0 💬 1 📌 0