We have a little new paper at ICLR led by @AntonBushuiev.
Test time training for proteins :)
arxiv.org/abs/2411.02109
For example, you know the sequence that you want to fold, so fine-tune ESM on it at test time to get a better ESMFold structure prediction!
05.03.2026 15:56
π 5
π 2
π¬ 0
π 0
Joint work with @roman-bushuiev.bsky.social, and Olga Pimenova, Nikola Zadorozhny, Raman Samusevich, Elisabet Manaskova, @eunbelivable.bsky.social , @hannes-stark.bsky.social, Jiri Sedlar, @martinsteinegger.bsky.social , @pluskal-lab.org, @josef-sivic.bsky.social
05.03.2026 12:08
π 3
π 0
π¬ 0
π 0
More broadly, ProteinTTT can be applied to any protein language model to improve structure, fitness, or function prediction. In the new version of the paper we also extend ProteinTTT to autoregressive and discrete diffusion models, and add a deeper analysis of success and failure cases
05.03.2026 12:08
π 0
π 0
π¬ 1
π 0
If you have a protein where AlphaFold or other methods struggle (e.g., viral proteins, antibodies, or low-homology targets), give ProteinTTT a try. ProteinTTT adapts ESMFold to one protein at a time before predicting structure, boosting downstream accuracy for most proteins
05.03.2026 12:08
π 3
π 1
π¬ 1
π 0
ProteinTTT is now easy to run on Hugging Face Spaces and Google Colab. Weβll also be presenting the paper at ICLR 2026 π§π·
π€ Hugging Face Space: huggingface.co/spaces/pimen...
βοΈ Google Colab: colab.research.google.com/drive/1l_h7c...
π§΅π
05.03.2026 12:08
π 39
π 9
π¬ 3
π 0
Mirdita Lab - Laboratory for Computational Biology & Molecular Machine Learning
Mirdita Lab builds scalable bioinformatics methods.
My time in @martinsteinegger.bsky.social's group is ending, but Iβm staying in Korea to build a lab at Sungkyunkwan University School of Medicine. If you or someone you know is interested in molecular machine learning and open-source bioinformatics, please reach out. I am hiring!
mirdita.org
20.01.2026 11:07
π 104
π 55
π¬ 7
π 1
γθ«ζη°‘εθ§£θͺ¬γOne protein is all you needη°‘εθ§£θͺ¬π¨ #shorts #vtuberζΊεδΈ #ζΎͺδΉγγ
YouTube video by Miono Yui ch. | ζΎͺδΉγγ
Amazing summary of our recent ProteinTTT paper youtube.com/shorts/XWueh...
03.01.2026 13:25
π 4
π 2
π¬ 1
π 0
Very happy @roman-bushuiev.bsky.social and I joined the amazing team led by @hannes-stark.bsky.social to work on BoltzGen, a generative model for binder design based on Boltz-2. Excited what it will enable!
27.10.2025 12:34
π 12
π 4
π¬ 0
π 0
Excited to release BoltzGen which brings SOTA folding performance to binder design! The best part of this project is collaborating with a broad network of leading wetlabs that test BoltzGen at an unprecedented scale, showing success on many novel targets and pushing the model to its limits!
26.10.2025 22:40
π 103
π 41
π¬ 3
π 5
BIG BIG congratulations to our PhD student
@roman-bushuiev.bsky.social for receiving the Google PhD Fellowship 2025 in Health Research! ππ° goo.gle/43wJWw8
25.10.2025 00:44
π 21
π 3
π¬ 1
π 1
Yes, exactly! This is an interesting perspective. Actually, running .ttt with msa_pth provided is effectively evotuning. We should highlight this connection better in the paper!
24.10.2025 02:53
π 1
π 0
π¬ 0
π 0
This is a joint work with @roman-bushuiev.bsky.social , and Olga Pimenova, Nikola Zadorozhny, Raman Samusevich, Elisabet Manaskova, @eunbelivable.bsky.social , @hannes-stark.bsky.social , Jiri Sedlar, @martinsteinegger.bsky.social , @pluskal-lab.org , @josef-sivic.bsky.social
23.10.2025 13:08
π 5
π 0
π¬ 0
π 0
Here, customization primarily benefits challenging, out-of-distribution proteins that are poorly represented in sequence databases (as measured by MSA size)
23.10.2025 13:08
π 2
π 0
π¬ 1
π 0
For example, ProteinTTT applied to ESMFold improves 19% of AlphaFold2-predicted viral protein structures in BFVD
23.10.2025 13:08
π 3
π 0
π¬ 1
π 0
This consistently improves performance of various protein language models across protein structure, fitness and function prediction, particularly on challenging targets
23.10.2025 13:08
π 2
π 0
π¬ 1
π 0
ProteinTTT overview
ProteinTTT enables *customizing* protein language models to one target protein at a time without assuming any additional data via on-the-fly self-supervised fine-tuning on the single protein
23.10.2025 13:08
π 3
π 0
π¬ 1
π 1
We train machine learning models on millions of proteins. But when it comes to making predictions, do we need them to understand all proteins at once? Often, we need an accurate model for the specific protein we are studying or designing. We address this with ProteinTTT arxiv.org/abs/2411.02109 1/π§΅
23.10.2025 13:08
π 68
π 25
π¬ 2
π 0
βοΈ @pluskal-lab.org from IOCB Prague, together with his student @roman-bushuiev.bsky.social and colleagues from #CIIRC CTU, Josef Ε ivic and @anton-bushuiev.bsky.social, have developed a machine learning model called #DreaMS β which accelerates the analysis of previously unknown molecules.
28.05.2025 13:42
π 12
π 5
π¬ 1
π 0
Self-supervised learning of molecular representations from millions of tandem mass spectra using DreaMS - Nature Biotechnology
A transformer model is used to construct the DreaMS Atlasβa molecular network of 201 million MS/MS spectra.
Mass spectrometry is a key method to discover and identify molecules in biological and environmental samples. Yet, >90% of mass spectra remain hard to interpret. In our recent paper, we present DreaMS β a foundation model to interpret mass spectra of small molecules.
www.nature.com/articles/s41...
26.05.2025 13:44
π 25
π 10
π¬ 1
π 0
This paper represents a great effort by @roman-bushuiev.bsky.social and his brother @anton-bushuiev.bsky.social. The DreaMS foundation model for mass spectra of small molecules now opens lots of avenues for possible downstream applications. It might be a game changer for computational metabolomics.
24.05.2025 08:21
π 52
π 20
π¬ 0
π 1
Back in July 2023 we organized a small bioML symposium at @iocbprague.bsky.social, and it turned out to be a very pleasant and successful event. This summer we are following up with a great line-up of speakers. Please register for free, deadline May 30. tinyurl.com/bioMLPrague25
11.04.2025 11:34
π 17
π 8
π¬ 0
π 0
π Exciting MassSpecGym leaderboard update π
Two new machine learning models achieve up to a 300% improvement in de novo molecular generation given mass spectra and corresponding chemical formulae. π₯ 1/n
07.03.2025 14:58
π 11
π 3
π¬ 1
π 0
MassSpecGym - the first comprehensive benchmark for the discovery and identification of molecules from MS/MS data.
@roman-bushuiev.bsky.social
@anton-bushuiev.bsky.social
@josef-sivic.bsky.social
@pluskal-lab.org
NeurIPS 2024 paper: arxiv.org/abs/2410.23326
#ChemSky #MassSpec #AI4Science
20.02.2025 17:16
π 8
π 4
π¬ 1
π 0
π€ In April 2024, brothers Roman and Anton Bushuiev from the teams of @pluskal-lab.org @iocbprague.bsky.social and Josef Ε ivic #CIIRC_CTU initiated a collaboration among 14 research institutes across the globe to benchmark #AI methods for the discovery of molecules from mass spectrometry data. 1/2
20.02.2025 14:06
π 11
π 5
π¬ 1
π 1
MassSpecGym: A benchmark for the discovery and identification of molecules
YouTube video by PolarisHQ
MassSpecGym is the largest publicly available collection of mass spectra data with 231K spectra for 29K unique molecular structures. 33% of the dataset was generated from newly measured, in-house data.
π‘οΈThe dataset is now certified on Polaris! polarishub.io/datasets/rom...
youtu.be/G8ZnVRm0ogc
24.01.2025 18:55
π 16
π 5
π¬ 1
π 0
The MassSpecGym team presenting the poster at #NeurIPS2024! Fei Wang,
Adamo Young,
@roman-bushuiev.bsky.social,
@anton-bushuiev.bsky.social,
Raman Samusevich
14.12.2024 17:27
π 20
π 3
π¬ 0
π 0
Met the twins @roman-bushuiev.bsky.social @anton-bushuiev.bsky.social
13.12.2024 01:43
π 14
π 2
π¬ 0
π 0