π₯³ I am incredibly humbled and grateful to share that our work, "Aligning machine and human visual representations across abstraction levels," has been published today in @nature.com β¬οΈ
π₯³ I am incredibly humbled and grateful to share that our work, "Aligning machine and human visual representations across abstraction levels," has been published today in @nature.com β¬οΈ
No two humans behave exactly alike. But what about neural networks? We found early evidence that human-like individual differences in behavior emerge from networks trained with different initializations. Hereβs a peek at our resultsβto be presented at UniReps & DBM @NeurIPS. Full paper on the way!
Presenting our #NeurIPS2025 work on modelβbehavior alignment today.
Could we even recognize the βrightβ model of behavior under flexible evaluation?
Come chat about DNNs & human visual preception!
Hall C-E #2010
Friday (today!) 4:30 β 7:30 PM
neurips.cc/virtual/2025...
Kudos to our NeurIPS 2025 reviewers for thoughtful, human-generated reviews. Iβll be presenting poster #2010 in San Diego on Fri, 5 Dec from 4:30β7:30 p.m. PT. Come say hi!
arXiv : arxiv.org/abs/2510.23321
Code and data: github.com/brainsandmachines/oddoneout_model_recovery
Our work reveals a sharp trade-off between predictive accuracy and model identifiability. Flexible mappings maximize predictivity, but blur the distinction between competing computational hypotheses.
Further analyses showed that linear probing was the culprit. The linear fit warps each model's original feature space, erasing its unique signature and making all aligned models converge toward a human-like representation.
The key dependent measure is how often the data-generating model actually achieves the highest prediction accuracy. The surprising result: even with massive datasets (millions of trials), the best-performing model is often not the right one.
Each simulation worked like this: (1) pick one model from 20 candidate NNs and fit it to human responses; (2) sample a synthetic dataset from that model using NEW triplets; (3) test all 20 models on this generated data, measuring cross-validated prediction accuracy.
We ran model recovery simulations using models fitted to the massive THINGS odd-one-out data shared by @martinhebart.bsky.social , @cibaker.bsky.social et al. Each simulation tested whether a neural network model would βwinβ the model comparison if it had generated the behavioral data.
In our new NeurIPS 2025 paper, we ask: does better predictive accuracy necessarily mean better mechanistic correspondence between neural networks and human representations? neurips.cc/virtual/2025...
They also showed that if we nudge the NN representations toward human judgments by linearly transforming the representation space itself crossvalidated prediction accuracy is boosted almost to the reliability bound. arxiv.org/abs/2211.01201
@lukasmut.bsky.social , @lorenzlinhardt.bsky.social et al, showed that neural network representations can be strong predictors of human odd-one-out judgments: the image humans select as βoddβ among three is often the one whose activation pattern differs most from the other two.
Excited to share my first paper: ModelβBehavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isnβt the Right One (NeurIPS 2025). link below.