Burny's Avatar

Burny

@burnytech

On the quest to understand the fundamental mathematics of intelligence and of the universe with curiosity. http://burnyverse.com Upskilling @StanfordOnline

831
Followers
7,824
Following
110
Posts
13.10.2024
Joined
Posts Following

Latest posts by Burny @burnytech

GPT-5.2 derives a new result in theoretical physics | Discussion

13.02.2026 19:40 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

I think about this often

09.02.2026 01:34 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
When Models Manipulate Manifolds: The Geometry of a Counting Task We find geometric structure underlying the mechanisms of a fundamental language model behavior.

transformer-circuits.pub/2025/linebre...

09.02.2026 01:22 πŸ‘ 2 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

My favorite reverse engineering how LLMs work internally research from Anthropic

They "find learned representations of position and find dual interpretations:" they "can understand them as a family of discrete features or as a one-dimensional β€œfeature manifold”/β€œmultidimensional feature”."

09.02.2026 01:22 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
A smooth galaxy from the Galaxy Zoo: Hubble project, classified by 51 volunteers.

A smooth galaxy from the Galaxy Zoo: Hubble project, classified by 51 volunteers.

A smooth galaxy, observed with the Hubble Space Telescope in the COSMOS survey.

It is at redshift 0.23 (lookback time 2.84 billion years) with coordinates (150.29382, 1.65700).

51 volunteers classified this galaxy in Galaxy Zoo: Hubble.

07.02.2026 19:29 πŸ‘ 53 πŸ” 7 πŸ’¬ 0 πŸ“Œ 2

The only way we dont have AI is if you invented time travel and mercilessly slaughtered everyone involved with matrix multiplication in history, every time its discovered, forever. Good luck banning math.

08.02.2026 22:28 πŸ‘ 17 πŸ” 4 πŸ’¬ 2 πŸ“Œ 2

Thats not a real thing that you can do. You can learn how to use it effectively and teach your students how to use it safely which you should. There *is no stopping it* because its already proven its a viable product. The question is how do we live with it, not do we live with it.

08.02.2026 22:27 πŸ‘ 26 πŸ” 1 πŸ’¬ 3 πŸ“Œ 2

In a rush, I approved a plan for Claude to train on the eval dataset 🀦

09.02.2026 00:58 πŸ‘ 2 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0
Post image

M78: Reflecting Blue in a Sea of Red apod.nasa.gov/apod/ap26012...
In the Orion Molecular Cloud complex, several bright blue nebulas are particularly apparent. Pictured here in the center are 2 of the most prominent reflection nebulas - dust clouds lit by reflecting light of bright embedded stars

28.01.2026 12:18 πŸ‘ 912 πŸ” 118 πŸ’¬ 25 πŸ“Œ 10

Holy moly this chart: Cumulative US measles cases

28.01.2026 12:54 πŸ‘ 4675 πŸ” 2240 πŸ’¬ 202 πŸ“Œ 640

Large-Scale Identification of Novel Protein Biomarkers and Therapeutic Targets in Heart and Brain Disease https://www.medrxiv.org/content/10.64898/2026.01.26.26344874v1

29.01.2026 05:55 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

ESA/Hubble photo of a distant stellar birthplace, region of the N159 star-forming complex in the Large Magellanic Cloud, approximately 160 000 light-years away. Photo credit: ESA/Hubble & NASA, R. Indebetouw

Link for more information: esahubble.org/images/potw2...

28.01.2026 22:51 πŸ‘ 2377 πŸ” 408 πŸ’¬ 42 πŸ“Œ 17
Post image

waiting for some experiments to run, so a quick thread about base models and pretraining contamination, with some weird & interesting base model generations i've collected over time.

or, why do open source models claim to be claude or chatgpt?

29.01.2026 01:11 πŸ‘ 104 πŸ” 18 πŸ’¬ 4 πŸ“Œ 1

Me in 2005: Moore's Law means that computers will get super fast over a very short period of time!

Me in 2026: my computer freezes up if I leave my browser open for too long

28.01.2026 18:14 πŸ‘ 888 πŸ” 106 πŸ’¬ 31 πŸ“Œ 13
Post image

The Pleiades star cluster (M45) captured over 25 hours. The blue glow is starlight reflecting off interstellar dust. The surrounding brown structures are the Integrated Flux Nebula, incredibly faint clouds lit by the glow of our entire galaxy.
#astrophotography #astronomy #space #m45

28.01.2026 19:03 πŸ‘ 1574 πŸ” 234 πŸ’¬ 45 πŸ“Œ 15
Preview
GitHub - antfu/skills: Anthony Fu's curated collection of agent skills. Anthony Fu's curated collection of agent skills. Contribute to antfu/skills development by creating an account on GitHub.

late to the party. I have finally been convinced by multiple awesome developers to give agents another try.

this is my first premature contribution:
github.com/antfu/skills

28.01.2026 06:51 πŸ‘ 116 πŸ” 6 πŸ’¬ 7 πŸ“Œ 1
Preview
GitHub - google/A2UI Contribute to google/A2UI development by creating an account on GitHub.

A2UI: Agent-to-User Interface

29.01.2026 05:36 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
3-panel comic. (1) [Three small arthropods on ocean floor.] ARTHROPOD 1: Now that we’re multicellular, what are your plans? I’m gonna evolve little legs and swim around with them! ARTHROPOD 2: I’m gonna evolve sharp pincers and use them to crunch stuff! ARTHROPOD 3: I’m gonna evolve glands to make string from my butt and use it to construct elaborate geometric nets hundreds of times my size to catch other animals. (2) [Silence] (3) ARTHROPOD 1: *Dude.* ARTHROPOD 2: Can you *please* just be normal about this? ARTHROPOD 3: *What??!*

3-panel comic. (1) [Three small arthropods on ocean floor.] ARTHROPOD 1: Now that we’re multicellular, what are your plans? I’m gonna evolve little legs and swim around with them! ARTHROPOD 2: I’m gonna evolve sharp pincers and use them to crunch stuff! ARTHROPOD 3: I’m gonna evolve glands to make string from my butt and use it to construct elaborate geometric nets hundreds of times my size to catch other animals. (2) [Silence] (3) ARTHROPOD 1: *Dude.* ARTHROPOD 2: Can you *please* just be normal about this? ARTHROPOD 3: *What??!*

Early Arthropods

xkcd.com/3199/

28.01.2026 19:57 πŸ‘ 4566 πŸ” 881 πŸ’¬ 27 πŸ“Œ 24

Subthreshold Kir and Ih currents modulate excitability of Layer 1 VIP interneurons in the medial prefrontal cortex https://www.biorxiv.org/content/10.64898/2026.01.28.702118v1

29.01.2026 07:15 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

I got long email with research questions, full with long equations written in LaTeX. So instead of compiling this with LaTeX, I dumped it into Gemini, asking it to render the equations for better readability.

Gemini did it ... and started to answer the questions, which it was not asked to do πŸ˜‚.

29.01.2026 07:18 πŸ‘ 14 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Preview
Is the Policy Gradient a Gradient? The policy gradient theorem describes the gradient of the expected discounted return with respect to an agent's policy parameters. However, most policy gradient methods drop the discount factor from t...

Writing the policy gradient lecture, which gives a great opportunity to discuss the sneaky missing discount factor that makes almost every paper wrong:
arxiv.org/abs/1906.07073

29.01.2026 03:17 πŸ‘ 43 πŸ” 4 πŸ’¬ 3 πŸ“Œ 0
Post image Post image

Search for τ⁻→μ⁻μ⁻μ⁺ decays at the LHCb experiment with Run 2 data arxiv.org/abs/2601.20785 - An upper limit of 1.9(2.3)Γ—10⁻⁸ is set at the 90% (95%) confidence level on the branching fraction of the decay.

29.01.2026 07:46 πŸ‘ 5 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

I view AI image generation as a sort of new type of computer graphics pipeline. We constantly use computers to render graphics so probably these systems will find a lot of uses, especially as they improve and we distill the algorithms

But we had to learn how to use photography. Learn the mechanics.

29.01.2026 07:54 πŸ‘ 3 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Converging evidence of positive selection at height-associated loci in Europe https://www.biorxiv.org/content/10.64898/2026.01.27.702172v1

29.01.2026 07:31 πŸ‘ 3 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Engagement, user satisfaction, and the amplification of divisive content on social media Abstract. Social media ranking algorithms typically optimize for users’ revealed preferences, i.e. user engagement such as clicks, shares, and likes. Many

This is pretty fascinating: academic.oup.com/pnasnexus/ar...
(1) Twitter's algorithm makes its users feel worse about their outgroup
(2) Users do not prefer the tweets it suggests relative to simple baselines

28.01.2026 15:35 πŸ‘ 67 πŸ” 16 πŸ’¬ 3 πŸ“Œ 4
Post image

Canaries in the coal mine. Worth paying attention to.

(And yes, they are both obviously interested in seeing their own products used, but hearing enough from other, independent coders that make me believe them. I wrote more about the shift here: www.oneusefulthing.org/p/management...)

28.01.2026 20:56 πŸ‘ 125 πŸ” 15 πŸ’¬ 12 πŸ“Œ 19

I’ve always liked programming. But I’ve been hearing similar reports from engineers who I respect, and I’m seeing this shift in my own work. IDK what the future holds, but programming as we knew it has already changed.

28.01.2026 21:42 πŸ‘ 40 πŸ” 4 πŸ’¬ 5 πŸ“Œ 0
Text Shot: While "thinking" models are powerful, they have historically been siloed β€” great at math, but poor at browsing the web or running code. Qwen3-Max-Thinking bridges this gap by effectively integrating "thinking and non-thinking modes".

The model features adaptive tool-use capabilities, meaning it autonomously selects the right tool for the job without manual user prompting. It can seamlessly toggle between:

Web Search & Extraction: For real-time factual queries.
Memory: To store and recall user-specific context.
Code Interpreter: To write and execute Python snippets for computational tasks.
In "Thinking Mode," the model supports these tools simultaneously. This capability is critical for enterprise applications where a model might need to verify a fact (Search), calculate a projection (Code Interpreter), and then reason about the strategic implication (Thinking) all in one turn.

Empirically, the team notes that this combination "effectively mitigates hallucinations," as…

Text Shot: While "thinking" models are powerful, they have historically been siloed β€” great at math, but poor at browsing the web or running code. Qwen3-Max-Thinking bridges this gap by effectively integrating "thinking and non-thinking modes". The model features adaptive tool-use capabilities, meaning it autonomously selects the right tool for the job without manual user prompting. It can seamlessly toggle between: Web Search & Extraction: For real-time factual queries. Memory: To store and recall user-specific context. Code Interpreter: To write and execute Python snippets for computational tasks. In "Thinking Mode," the model supports these tools simultaneously. This capability is critical for enterprise applications where a model might need to verify a fact (Search), calculate a projection (Code Interpreter), and then reason about the strategic implication (Thinking) all in one turn. Empirically, the team notes that this combination "effectively mitigates hallucinations," as…

Qwen3-max-thinking-beats-gemini-3-pro-and-gpt-5-2-on-humanitys-last-exam venturebeat.com/technology/qwe… #AI #Qwen

29.01.2026 06:20 πŸ‘ 2 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Amazing

29.01.2026 07:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Semantic axes in the brain support analogical representations https://www.biorxiv.org/content/10.64898/2026.01.28.702241v1

29.01.2026 07:15 πŸ‘ 4 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0