Most benchmark papers are boring but this is a masterpiece. Authors present a comprehensive breakdown of different inference strategies on variant effect prediction (VEP) in viruses. There were two surprising results that IMO deserve more attentionπ§΅
22.01.2026 15:45
π 16
π 3
π¬ 1
π 0
This line graph illustrates the percentage change in agency staff levels from the previous year for nine major U.S. federal scientific and health organizations between the fiscal years 2016 and 2025. The agencies tracked include the CDC, Department of Energy, EPA, FDA, NASA, NIH, NIST, NOAA, and NSF. For the majority of the timeline between 2016 and 2023, the agencies show relatively stable fluctuations, generally staying within a range of +5% to -5% change per year. However, there is a dramatic and uniform plummet starting in the 2024β25 period. Every agency depicted shows a sharp downward trajectory, with staffing losses ranging from approximately -15% to over -25%. The Environmental Protection Agency (EPA) shows the most significant decline, dropping to roughly -26%, while the National Institute of Standards and Technology (NIST) shows the least severe but still substantial drop at approximately -15%.
This is the most astonishing graph of what the Trump regime has done to US science. They have destroyed the federal science workforce across the board. The negative impacts on Americans will be felt for generations, and the US might never be the same again.
www.nature.com/immersive/d4...
20.01.2026 22:53
π 14449
π 8317
π¬ 90
π 765
This was fun work and a remarkable effort across the computational and wet-lab teams!
Strategies for in-silico filtering and ranking of antibody designs have been under-discussed in the literature, e.g. in most technical reports on antibody design that I've seen. Let's talk about these here! [1/n]
15.01.2026 11:19
π 18
π 8
π¬ 1
π 0
This complements our prior evaluation of deep mutational scans showcasing PLM underperformance, even when there is especially low sequence diversity.
There is still much room for improvement for viral modeling!
13.01.2026 19:06
π 0
π 0
π¬ 0
π 0
Computational variant effect predictors effectively predict viral evolution β even more so than deep mutational scans (DMS).
Yet, PLM or hybrid approaches (even with data-leakage inflating performance) provide little benefit over the best alignment-based model (EVE).
13.01.2026 19:06
π 2
π 1
π¬ 1
π 0
π¦ π We ask whether models can predict mutations that actually rose in frequency in natureβproviding the clearest available evaluation of their utility for pandemic preparedness.
13.01.2026 19:06
π 1
π 0
π¬ 1
π 0
We now evaluate protein language models (PLMs), alignment-based models, and hybrids across 30 clades from 4 well-sequenced viruses (SARS-CoV-2, HIV, H3N2, and H1N1) - the first large scale evaluation of these models on real viral evolution forecasting.
13.01.2026 19:06
π 1
π 0
π¬ 1
π 0
Weβve updated the EVEREST benchmark to include real-world viral evolution! www.biorxiv.org/content/10.1...
Co-led by Noor Youssef and me, along with co-authors Navami Jain, Aarushi Mehrotra, Sarrah Leung, Abigail Jackson, @deboramarks.bsky.social, and with @cepi.net @futurehousesf.bsky.social!
13.01.2026 19:06
π 5
π 2
π¬ 1
π 0
Fresh conflicts erupt around giant database for flu and COVID-19 sequences
Critics say βautocraticβ behavior by GISAID could hamper response to a future pandemic
The fundamental problem with GISAID is this - data on the platform are neither open, nor FAIR.
Such data - key for combating infectious diseases - are, by a large margin, paid for by taxpayers and hence must be openly available to all.
With GISAID, they are not.
www.science.org/content/arti...
07.01.2026 17:23
π 21
π 8
π¬ 1
π 0
New preprint! We measured temperature- and pH-induced aggregation for over 18,000 natural and de novo designed protein domains!
19.11.2025 21:16
π 121
π 42
π¬ 4
π 3
End-to-end protein design in the browser through evedesign. Generate and interactively explore designs in 2D/3D and export them as codon-optimized DNA. The underlying open source framework (released soon) is build to easily add new methods, more on that soon.
π evedesign.bio
22.10.2025 14:30
π 93
π 29
π¬ 2
π 1
Thrilled to announce our new preprint, βProtein Hunter: Exploiting Structure Hallucination within Diffusion for Protein Design,β in collaboration with @Griffin, @GBhardwaj8 and @sokrypton.org
π§¬Code and notebooks will be released by the end of this week.
π§Golden- Kpop Demon Hunters
13.10.2025 15:45
π 52
π 16
π¬ 3
π 2
Large AI models are reported to achieve high accuracy (AUROC) predicting pathogenic variants across the genome.
A preprint reports that the predictions are based on splice variants. Using only this info (no sequences, no AI) achieves AUROC=0.944 across noncoding variants.
1/2
09.09.2025 23:01
π 15
π 7
π¬ 1
π 0
Recent advances in the inference of deep viral evolutionary history | Journal of Virology
Phylogenetic studies examining the origins, emergence, and spread of viruses have arguably been one of the most active and successful areas of evolutionary biology and form the bedrock of the flourishing field of genomic epidemiology. This, in part, reflects the ability of viruses, particularly those with RNA genomes, to evolve at rates much greater than their cellular counterparts (1). The rapid rate at which viruses evolve and accumulate mutations enables evolutionary signals to be identified through comparative genomics at short timescales relevant for outbreak investigation and response. The integration of phylogenetics and epidemiology, known as phylodynamics, has become a vital tool in response to numerous viral outbreaks, epidemics, and pandemics, including Ebola (2), Zika (3), and, more recently, COVID-19 (4) and mpox (5).
Thereβs been a bunch of new approaches looking at deep viral evolutionary history. Weβve put together a mini review highlighting some recent advancements in structural phylogenetics and time-dependent rate models and what they could do for the field π¦
π journals.asm.org/doi/full/10....
25.08.2025 20:32
π 28
π 13
π¬ 2
π 2
Divergent viral phosphodiesterases for immune signaling evasion
Cyclic dinucleotides (CDNs) and other short oligonucleotides play fundamental roles in immune system activation in organisms ranging from bacteria to humans. In response, viruses use phosphodiesterase...
Excited to share our new preprint co-led by @jnoms.bsky.social!
Here we reveal an exceptional diversity of viral 2H phosphodiesterases (PDEs) that enable immune evasion by selectively degrading oligonucleotide-based messengers. This 2H PDE fold has evolved striking substrate breath & specificity.
22.08.2025 19:02
π 43
π 28
π¬ 2
π 3
Thanks!
20.08.2025 20:50
π 0
π 0
π¬ 0
π 0
πAmazing collaboration co-led with Noor Youssef
and Navami Jain, @deboramarks.bsky.social, and our funders @cepi.net!
11/12
17.08.2025 03:42
π 2
π 0
π¬ 1
π 0
This matters for:
β οΈ Future-proof vaccine and therapeutics design
β οΈ Monitoring of high-pandemic risk viruses
β οΈ Dual-use biosecurity risk assessment
Without reliable models, we risk underestimating viral evolutionβand overestimating our ability to counter it.
10/12
17.08.2025 03:42
π 3
π 0
π¬ 1
π 0
EVEREST highlights:
β
Where models failβand why
β
Which viruses are least/most predictable
β
How to estimate per-protein, model-specific reliability
β
Concrete steps to improve ML for viral mutation prediction
9/12
17.08.2025 03:42
π 6
π 0
π¬ 2
π 0
πCurrent models fail to reliably predict mutations in more than half of the high-priority viruses identified by the WHO.
8/12
17.08.2025 03:42
π 4
π 0
π¬ 1
π 0
πͺIs bigger always better? Maybe not for other taxa but for viruses - yes! For viruses, models continue to improve with increased numbers of parameters.
7/12
17.08.2025 03:42
π 4
π 0
π¬ 1
π 0
π€Why? Viruses are severely underrepresented in training datasets (<1%) and are further downsampled after common clustering approaches.
6/12
17.08.2025 03:42
π 8
π 0
π¬ 1
π 0
πDespite the hype, protein language models trained across the βprotein universeβ are outperformed by even the simplest, site-independent alignment-based model.
5/12
17.08.2025 03:42
π 14
π 2
π¬ 1
π 0
πImagine: Itβs Day 0 of an outbreak and thereβs little experiment data. Computational mutational effect predictions could provide valuable informationβ¦if we could trust them. Can we?
EVEREST doesnβt just assess performance. It also quantifies reliability for new viruses.
4/12
17.08.2025 03:42
π 2
π 0
π¬ 1
π 0
πTo find out, we built EVEREST: Evolutionary Variant Effect prediction with Reliability ESTimation.
We benchmark models across 45 viral deep mutational scanning datasets spanning >340,000 mutations.
3/12
17.08.2025 03:42
π 2
π 0
π¬ 1
π 0
π¦ Protein language models (PLMs) have shown impressive performance in predicting mutation effects. But... viruses are a different beast.
They evolve fast, cross species, and are under pressure from host immunity. Do PLMs still work here?
2/12
17.08.2025 03:42
π 4
π 0
π¬ 1
π 0
π¨New paper π¨
Can protein language models help us fight viral outbreaks? Not yet. Hereβs why π§΅π
1/12
17.08.2025 03:42
π 56
π 23
π¬ 3
π 2