Must read on Chinese open source from Kevin Xu with the very similarly named substack (story for another time)
interconnect.substack.com/p/chinese-op...
@shubhendu
Interests on bsky: ML research, applied math, and general mathematical and engineering miscellany. Also: Uncertainty, symmetry in ML, reliable deployment; applications in LLMs, computational chemistry/physics, and healthcare. https://shubhendu-trivedi.org
Must read on Chinese open source from Kevin Xu with the very similarly named substack (story for another time)
interconnect.substack.com/p/chinese-op...
War is war.
AAAI sends out emails where it's so vague that you can't even tell whether you were sent a reviewer or area chair invitation.
Nature research paper: Compact deep neural network models of the visual cortex
go.nature.com/3OKRXZU
Surprise, then defensive, then basically acknowledging.
We put probabilistic circuits into diffusion language models and got a big boost in reasoning performance!
tbc, each time this happened, I did mention it to the student i.e. that I can't provide useful input if I am not able to discern what they know.
It's strange. I was only asking stuff like -- why this area? how did you get interested? What have you read or looked into? It was a way for me to probe their internal state to make some suggestions. Doesn't need perfect answers. I am terribly inarticulate these days, I would have empathized!
Willing to accept all this clockwork-like triumphalism about being "proven right about the AI bubble" on any correction if the above just gets done with. bsky.app/profile/shub...
Tech has continued on the distribution trend. Now in its 6th month. Since the date below you see it went up, but was again pushed below the 100 DMA. I hope the geopolitical turmoil and turbulence all around gives tech a reason to quickly puke 15-20% and get done with it. So bored of this crap.
Are AI models effective collaborators, or mere assistants awaiting your next command? (Preprint: arxiv.org/abs/2602.24188)
To find out, we make AI collaborate with itself, in private information games: tasks that require sharing private information, like this chess board ordering task.
*was easier
I mean, it was not like I was offering a position or anything, it was purely about how to navigate getting into research. So it baffled me. Also not the only time it has happened.
I tend to talk to undergrads (although collaborating as easier as a free agent). I just had a chat the other day, when a student reached out to me, mentioned interest in some area, and wanted advice on how to approach a PhD in said area. But for _even that_ he was clearly reading off LLM responses.
An arrangement of seven squares. Six of the squares are identical and are arranged so that reading from left to right they form three stacks that abut and are of heights 2,3,1, with each base square aligned with the second square in the stack to its left. A tilted larger square overlays the six and shares a vertex with the lower right vertex of the bottom-most square. The top left vertex of the uppermost of the six squares lies on an edge of the larger square. A line is drawn from the left-hand vertex of the larger square to the lower right vertex of the rightmost small square. The angle formed by this line and the left-hand edge of the larger square is marked with a question mark.
notes.mathforge.org/notes/publis...
#geometrypuzzle #UKMathsChat #mathsky
NB: I am not labeling it as "evil and demonic" -- that was Schmitt's appellation, who also argued for leaning into "technicity" and to follow it to "its logical conclusion."
master the new technology and which type of genuine friend-enemy groupings can develop on this new ground."
Also reminded me of Yuk Hui's writings, e.g.Β www.e-flux.com/journal/153/...Β which also engages with the sociotechnics of the Nomos and the "evil and demonic spirit" of Schmitt's technicity.
process of neutralization; every strong politics will make use of it. For this reason, the present century can only be understood provisionally as the century of technology. How ultimately it should be understood will be revealed only when it is known which type of politics is strong enough to
Slightly different, but reminded me of this from The Age of Neutralizations and Depoliticizations (1929): "The process of continuous neutralization of various domains of cultural life has reached its end because technology is at hand. Technology is no longer neutral ground in the sense of the
I have added a new tutorial on discrete diffusion models:
github.com/gpeyre/ot4ml
Long time lurker, first time poster. My thesis, titled "Scalable Kernel-Based Distances for Statistical Inference and Integration" is now on arxiv: arxiv.org/abs/2602.21846 .
The results in core chapters (3, 4, 5, 6) are previously published work; Chapter 5 has bonus results on UQ for BQ.
Back in Γ rhus and TIL that ItΓ΄ spent about 2 years here in the 60s, between his time at Stanford and Cornell. Somehow I had completely missed this. There is even a Springer book covering lectures he gave here:
Thank you. I was just headed to the local church of scientology to register. Now I am on an Uber back, reconsidering.
Geoffrey Hinton
Hopefully the string method won't become another ML meme. It's not common to see ML papers using it despite the natural conceptual alignment
arxiv.org/abs/2602.22122
*professional skeptic
Holier than thou 100x, like the sage who prematurely escaped the Himalayan cave. They are like a very prominent point in the choreographed persona graph, very much like the "silicon valley founder."
This is very much similar to the professional skeptical, cynical "high brow" academic persona. The only uncorrupted judges in the room. Like looking at the employer or net worth of an individual before examining anything said by them ("follow the money," "uncover the bias").
Some of the choreography is also necessary for hiring (otherwise, people would not know about you). But a lot of the times it is really about developing psychological armour. There is a lot of pressure, the odds are against you, so you are better off offloading the failures to a persona.
But it makes sense. Due to SM feedback loops over the years the whole thing has converged to a highly choreographed performance (due to mimetic imitation of highly successful founders). VCs are also not attentive (you would know if you have done pitching) and are highly reliant on pattern matching.
Not just that. They start aligning with this whole persona of being a founder. Similar ways of announcing things, mystery ("something new," "stepping back," "stay tuned," who cares), even style of pictures, the skill > destiny sort of enlightened posting. Feigning being pumped at any random thing.
Agents interact with environments to get information. But exploration (tools, retrieval, user interaction) is costly.
Calibrate-Then-Act allows LLM agents to balance exploration and cost:
π Estimate uncertainty about the environment
π Reason about cost-uncertainty tradeoffs
βοΈ Act accordingly