A new 156 page paper defines Scientific General Intelligence as end to end research capability and introduces SGI Bench with 1,000 plus expert tasks. Results show LLMs fail badly at full scientific workflows. arxiv.org/abs/2512.16969
@simonemarini
AI, BioML, compbio, data science | pathogens, AMR antimicrobial and antibiotic res, inflammation, CBD, single-cell RNA seq, metagenomics | Asst Prof @ University of Florida, AI advisor @ enGenome | Prev: UMich, UniPV, KyotoU | Immigrant πͺπΊππ°π―π΅πΊπΈ
A new 156 page paper defines Scientific General Intelligence as end to end research capability and introduces SGI Bench with 1,000 plus expert tasks. Results show LLMs fail badly at full scientific workflows. arxiv.org/abs/2512.16969
suspect a big reason why many academics and others who work in areas where getting facts RIGHT is key are disinterested in using LLMs for research:
theyβve tried it, they keep noticing major errors in output, and they conclude that having to verify all that doesnβt actually save them time.
Ampicillin/Sulbactam in Combination with Ceftazidime/Avibactam against Metallo-Γ-Lactamase-Producing Carbapenem-Resistant Acinetobacter baumannii: A Genomics-Informed Mechanism-Based Model
β
Just Accepted
π https://bit.ly/3WUrcmU
An activate-to-eliminate approach in HIV
Joel N. Blankson & team report on a strategy to selectively eliminate antigen-specific HIV infected CD4+ T cells by stimulation with cognate peptides and incubation with antiproliferative drugs: doi.org/10.1172/JCI1...
National Academy of Medicine launches AI patient safety initiative to develop harm prevention frameworks across healthcare systems www.beckershospitalreview.com/healthcare-i...
Thanks for sharing! These 1st hand experiences are really important, especially for students.
Clifti-GPT: Privacy-preserving federated fine-tuning and transferable inference of foundation models on clinical single-cell data #SingleCell π§ͺπ§¬π₯οΈ
https://www.researchsquare.com/article/rs-7917089/latest
A table showing profit margins of major publishers. A snippet of text related to this table is below. 1. The four-fold drain 1.1 Money Currently, academic publishing is dominated by profit-oriented, multinational companies for whom scientific knowledge is a commodity to be sold back to the academic community who created it. The dominant four are Elsevier, Springer Nature, Wiley and Taylor & Francis, which collectively generated over US$7.1 billion in revenue from journal publishing in 2024 alone, and over US$12 billion in profits between 2019 and 2024 (Table 1A). Their profit margins have always been over 30% in the last five years, and for the largest publisher (Elsevier) always over 37%. Against many comparators, across many sectors, scientific publishing is one of the most consistently profitable industries (Table S1). These financial arrangements make a substantial difference to science budgets. In 2024, 46% of Elsevier revenues and 53% of Taylor & Francis revenues were generated in North America, meaning that North American researchers were charged over US$2.27 billion by just two for-profit publishers. The Canadian research councils and the US National Science Foundation were allocated US$9.3 billion in that year.
A figure detailing the drain on researcher time. 1. The four-fold drain 1.2 Time The number of papers published each year is growing faster than the scientific workforce, with the number of papers per researcher almost doubling between 1996 and 2022 (Figure 1A). This reflects the fact that publishersβ commercial desire to publish (sell) more material has aligned well with the competitive prestige culture in which publications help secure jobs, grants, promotions, and awards. To the extent that this growth is driven by a pressure for profit, rather than scholarly imperatives, it distorts the way researchers spend their time. The publishing system depends on unpaid reviewer labour, estimated to be over 130 million unpaid hours annually in 2020 alone (9). Researchers have complained about the demands of peer-review for decades, but the scale of the problem is now worse, with editors reporting widespread difficulties recruiting reviewers. The growth in publications involves not only the authorsβ time, but that of academic editors and reviewers who are dealing with so many review demands. Even more seriously, the imperative to produce ever more articles reshapes the nature of scientific inquiry. Evidence across multiple fields shows that more papers result in βossificationβ, not new ideas (10). It may seem paradoxical that more papers can slow progress until one considers how it affects researchersβ time. While rewards remain tied to volume, prestige, and impact of publications, researchers will be nudged away from riskier, local, interdisciplinary, and long-term work. The result is a treadmill of constant activity with limited progress whereas core scholarly practices β such as reading, reflecting and engaging with othersβ contributions β is de-prioritized. What looks like productivity often masks intellectual exhaustion built on a demoralizing, narrowing scientific vision.
A table of profit margins across industries. The section of text related to this table is below: 1. The four-fold drain 1.1 Money Currently, academic publishing is dominated by profit-oriented, multinational companies for whom scientific knowledge is a commodity to be sold back to the academic community who created it. The dominant four are Elsevier, Springer Nature, Wiley and Taylor & Francis, which collectively generated over US$7.1 billion in revenue from journal publishing in 2024 alone, and over US$12 billion in profits between 2019 and 2024 (Table 1A). Their profit margins have always been over 30% in the last five years, and for the largest publisher (Elsevier) always over 37%. Against many comparators, across many sectors, scientific publishing is one of the most consistently profitable industries (Table S1). These financial arrangements make a substantial difference to science budgets. In 2024, 46% of Elsevier revenues and 53% of Taylor & Francis revenues were generated in North America, meaning that North American researchers were charged over US$2.27 billion by just two for-profit publishers. The Canadian research councils and the US National Science Foundation were allocated US$9.3 billion in that year.
The costs of inaction are plain: wasted public funds, lost researcher time, compromised scientific integrity and eroded public trust. Today, the system rewards commercial publishers first, and science second. Without bold action from the funders we risk continuing to pour resources into a system that prioritizes profit over the advancement of scientific knowledge.
We wrote the Strain on scientific publishing to highlight the problems of time & trust. With a fantastic group of co-authors, we present The Drain of Scientific Publishing:
a π§΅ 1/n
Drain: arxiv.org/abs/2511.04820
Strain: direct.mit.edu/qss/article/...
Oligopoly: direct.mit.edu/qss/article/...
Three critical vulnerabilities (now patched) have been discovered in a portable ONT DNA sequencing device.
While our attention is focused on the security of genomic data storage, this is a strong reminder we have to factor in hardware security as well.
www.nature.com/articles/s41...
The use of GenAI in health research necessitates transparent reporting standards.
New guidelines help: CHART for chatbot health advice, TRIPOD-LLM for model development or prediction, and GAMER for GenAI-assisted manuscript writing.
#MedSky #MedAI #MLSky
This is why we build knowledgebases & build our own agentic AI tools for querying them www.nature.com/articles/s42... #llms
WHO reports 1 in 6 bacterial infections worldwide are antibiotic-resistant, with resistance rising sharply since 2018. Gram-negative bacteria like E. coli and K. pneumoniae pose the biggest threat. Action on #AMR surveillance and responsible antibiotic use is needed.
www.who.int/news/item/13...
MetaGraph by Karasikov and coauthors makes the worldβs DNA searchable. By turning 67 petabases of raw sequences into a compressed graph structure, it enables fast, low-cost search across global genomic dataβbringing biology closer to having its own βsearch engineβ. www.nature.com/articles/s41...
New preprint reveals bacteria can't just collect all resistance genes like Pokemon cards.
We found mutually exclusive evolutionary pathways to multidrug resistance in E. coli & P. aeruginosa - some resistance mechanisms actively prevent others from coexisting www.biorxiv.org/content/10.1...
The Impact of Tokenizer Selection in Genomic Language Models. #GenomicLanguageModels #Genomics #LLMs #Bioinformatics
academic.oup.com/bioinformati...
White mold fungi split their genome across several nuclei, with implications for future gene editing - Could enable dramatic revolutions in gene editing: theconversation.com/white-mold-f... #CdnSci #genomics #science #SciChat
PBMCpedia: A Harmonized PBMC scRNA-seq Database With Unified Mapping and Enhanced Celltype Annotation #SingleCell π§ͺπ§¬π₯οΈ
https://www.biorxiv.org/content/10.1101/2025.08.06.668843v1
Paper out! SARITA, a Genomic language model to predict SARS-CoV-2 mutations and support the design of antigenic variants for downstream use in therapy development and immunological studies.
academic.oup.com/bib/article/...
Respiratory infections reactivate dormant metastatic breast cancer cells in mouse lungs, correlating with evidence in humans that #SARS-CoV-2 infection increases the risk of cancer-related mortality & lung metastasis
π biology but double whammy for cancer patients
www.nature.com/articles/s41...
π§ͺ
Paper out on HIV epidemiology + network science!
We identify risk groups and clusters of transmission and exposure throughout Florida, via a rich database of multi-year contact tracing interviews.
publichealth.jmir.org/2025/1/e65573/
Every time I read about experiments showing AI cheating, deceiving, tricking--the last one finding how o3 is a master schemer when playing Diplomacy against other AIs--I keep thinking how we are slowly approaching the proof that philosophical zombies are real.
π§π· The Brazilian Reproducibility Initiative published the results of 143 replication attempts in biomedical science.
Success rates ranged from just 15β45%.
Now, the project team reflects on what made replication so hard and what needs to change.
buff.ly/55j9Sax
I am very happy (and anxious) to share with you our most recent work in which we evaluated four of the most popular long-read assemblers,
www.biorxiv.org/content/10.1...
and tell you just a little bit about it in the following π§΅
The global antibiotic market is projected to grow from $51B in 2025 to $65B by 2032, driven by rising infections and R&D. Key challenges include #AMR and high costs, with new opportunities in phage therapy and diagnostics.
#InfectiousDisease
www.htfmarketinsights.com/report/41017...
Illumina announced the discontinuation of the old MiSeq, MiniSeq and old i100 instruments.
NIH is the best investment there is.
www.forbes.com/sites/michae...
This example, along with others, seems to tell us that, after all, philosophical zombies could be a thing.
I mean, you get all these AI behaviors that we naively ascribed to conscious (or partially conscious) beings only. Because
they not getting conscious, one step at the time, right? Right...?
Composite image of women scientists from the Joyce lab over the past 20 years
Happy International Day of Women and Girls in Science!
Its been a privilege to work with these extraordinary scientists in my lab over the years - all making an incredible impact throughout the world π
We must never give up on #equality #diversity & #inclusion πͺ
#WomeninScience #GirlsInSTEM π§ͺπ¬ππ