Constantin Ahlmann-Eltze's Avatar

Constantin Ahlmann-Eltze

@const-ae

Postdoc at UCL with James Reading. Previously at EMBL working with Wolfgang Huber. Biostats, R, cancer immunology

2,021
Followers
645
Following
65
Posts
28.08.2023
Joined
Posts Following

Latest posts by Constantin Ahlmann-Eltze @const-ae

I have also seen, from people who know base really well, incredibly elegant code using tapply, merge, and clever indexing. But, in my experience, the typical code of a medium experienced person that uses the tidyverse is more robust than the one that uses base.

05.03.2026 07:58 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 0

It's an example. As I said, the tidyverse has the advantage that it provides functions for typical data analysis tasks all in one place with great documentation and common semantics. Also, personally I'm not a fan that merge auto-sorts the output.
Another example would be pivot_longer :)

05.03.2026 07:54 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

Teach the tidyverse! I have seen the code from people, who only learned base, that reinvents a join function and it's not pretty ๐Ÿ˜…
In the end, the tidyverse is a collection of useful functions for typicy data analysis tasks and beginners benefit from being told those exist :)

05.03.2026 07:33 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Biological Laboratory | LEGOยฎ Ideas This is a two-floor realistic Biological Laboratory building. In this design, I try to tell stories about biologists, introduce biology concepts to general publโ€ฆ

I just made an account and voted for this biology laboratory made of LEGO to "promote biological research and inspire more people into the world of biology". They need 10K votes. Reposting and/or voting below would be helpful. ๐Ÿค—
beta.ideas.lego.com/product-idea...

16.10.2025 08:07 ๐Ÿ‘ 221 ๐Ÿ” 131 ๐Ÿ’ฌ 8 ๐Ÿ“Œ 12
Preview
How far back in time can you understand English? An experiment in language change

www.deadlanguagesociety.com/p/how-far-ba...

I cannot speak as to the accuracy of the piece yet I found it a fascinating exercise @adamroberts.bsky.social @drlauravarnam.bsky.social

22.02.2026 07:05 ๐Ÿ‘ 117 ๐Ÿ” 63 ๐Ÿ’ฌ 22 ๐Ÿ“Œ 39

screaming this from the rooftops

16.02.2026 18:56 ๐Ÿ‘ 15 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Screenshot of Job Ad

Screenshot of Job Ad

EMBL site in Heidelberg

EMBL site in Heidelberg

Zurich, incl. a bit of UZH/USZ

Zurich, incl. a bit of UZH/USZ

Job offer: PhD-level biostatistician with ML&omics experience to work on prediction models for CAR T cell therapy, in an exciting clinical trial - spatial omics - data science consortium (LOOP/INTeRCePT3.0 centered in Zurich; this position in Heidelberg)

embl.wd103.myworkdayjobs.com/en-GB/EMBL/j...

13.02.2026 08:41 ๐Ÿ‘ 10 ๐Ÿ” 5 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

It definitely helps, but I think it's also simply that most scientists have a LinkedIn account and occasionally check it, whereas still only a minority has a bluesky account.

31.01.2026 19:38 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Yeah, the amount of engagement on LinkedIn compared to bluesky is crazy. I recently posted about a new paper and it got 480 likes on LinkedIn vs 7 on bluesky (and I have more followers here than there...) :/

31.01.2026 11:00 ๐Ÿ‘ 2 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

We wrote a review of representation learning methods of single-cell RNA-seq data, where we compare factor models, autoencoders, contrastive learning, and foundation models ๐ŸŽ‰

rnajournal.cshlp.org/content/earl...

21.01.2026 09:56 ๐Ÿ‘ 8 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Postdoctoral Researcher or Senior Scientist (AI ร— Biology)

Boeva Lab is #hiring postdocs and senior scientists in AI/ML for Cancer Biology! Check the offer and apply at: jobs.ethz.ch/job/view/JOP...

20.01.2026 14:49 ๐Ÿ‘ 8 ๐Ÿ” 8 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0
View from the hotel room

View from the hotel room

Poster session 2024, with Valentina Boeva, Constantin Ahlmann-Eltze and others

Poster session 2024, with Valentina Boeva, Constantin Ahlmann-Eltze and others

Wednesday afternoon hike incl. swim in the mountain river

Wednesday afternoon hike incl. swim in the mountain river

Another view from the hotel room

Another view from the hotel room

Apply for the Ascona workshop "Statistical and AI methods for multi-modal multi-scale modeling of biological systems", 28 Jun-3 Jul 2026 on Monte Veritร , Lago Maggiore at the foot of the Swiss Alps.

ascona2026.sciencesconf.org

15.01.2026 16:16 ๐Ÿ‘ 21 ๐Ÿ” 16 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 3
Post image

Save the date: April 9 from 4pm to 6pm CET. Our department is hosting an online seminar with @noeliaferruz.bsky.social @sdomcke.bsky.social @const-ae.bsky.social who will talk about models for protein design, large-scale perturbation screens, and benchmarking of perturbation prediction models.

14.01.2026 13:26 ๐Ÿ‘ 6 ๐Ÿ” 3 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

Join me in 5.5h, when we discuss our benchmark of perturbation prediction models and what the right metric is to assess if a gene expression prediction is good!

02.12.2025 14:20 ๐Ÿ‘ 7 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

๐Ÿš€ Excited to share our new preprint: msBayesImpute - A Versatile Framework for Addressing Missing Values in Biomedical Mass Spectrometry Proteomics Data
๐Ÿ‘‰ Improves imputation accuracy, normalization, and differential expression detection
๐Ÿ“https://www.biorxiv.org/content/10.1101/2025.10.02.679746v1

07.10.2025 08:47 ๐Ÿ‘ 16 ๐Ÿ” 2 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1

Heads up: ignore samtools dot org, similarly minimap2 dot com and likely others. It's owned by a known phishing site and while the binaries they offer look valid currently (but note they may be serving us different binaries to others), that could change.

Ie: it's not us (Samtools team)! Be warned

15.09.2025 08:40 ๐Ÿ‘ 146 ๐Ÿ” 127 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 5
Post image

We're excited to share that our preprint on anndataR, a new package bringing Python's AnnData to R, is now available on bioRxiv ๐ŸŽ‰

๐Ÿ”— Read the paper: www.biorxiv.org/content/10.1...
๐Ÿ’ป Check the package in action: anndatar.data-intuitive.com

25.08.2025 15:24 ๐Ÿ‘ 22 ๐Ÿ” 6 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 1
An arrow with a LaTeX equation

An arrow with a LaTeX equation

Trigonometric functions and a unit circle

Trigonometric functions and a unit circle

A bivariate change model with structured residuals

A bivariate change model with structured residuals

A hierarchical model of cognitive abilities

A hierarchical model of cognitive abilities

Now on CRAN, ggdiagram is a #ggplot2 extension that draws diagrams programmatically in #Rstats. Allows for precise control in how objects, labels, and equations are placed in relation to each other.
wjschne.github.io/ggdiagram/ar...

20.08.2025 10:43 ๐Ÿ‘ 180 ๐Ÿ” 73 ๐Ÿ’ฌ 10 ๐Ÿ“Œ 9

Makes sense. I imagine this would simply be my primary use case, and I would prefer not having to refer to `x` twice. Something like:

replace_values <- function(x, ..., from=NULL, to=NULL, lookup=NULL){
if(! is.null(lookup)){
from <- names(lookup)
to <- lookup
}
...
}

10.08.2025 10:22 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

This looks great! I think it would be neat if the functions also supported named look-up vectors in addition to the `to` and `from` arguments :)

10.08.2025 09:58 ๐Ÿ‘ 0 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Beeswarm plot of the prediction error across different methods of double perturbations showing that all methods (scGPT, scFoundation, UCE, scBERT, Geneformer, GEARS, and CPA) perform worse than the additive baseline.

Beeswarm plot of the prediction error across different methods of double perturbations showing that all methods (scGPT, scFoundation, UCE, scBERT, Geneformer, GEARS, and CPA) perform worse than the additive baseline.

Line plot of the true positive rate against the false discovery proportion showing that none of the methods is better at finding non additive interactions than simply predicting no change.

Line plot of the true positive rate against the false discovery proportion showing that none of the methods is better at finding non additive interactions than simply predicting no change.

Our paper benchmarking foundation models for perturbation effect prediction is finally published ๐ŸŽ‰๐Ÿฅณ๐ŸŽ‰

www.nature.com/articles/s41...

We show that none of the available* models outperform simple linear baselines. Since the original preprint, we added more methods, metrics, and prettier figures!

๐Ÿงต

04.08.2025 13:52 ๐Ÿ‘ 126 ๐Ÿ” 57 ๐Ÿ’ฌ 2 ๐Ÿ“Œ 6
Preview
Pre-Cancer Immunology The Pre-Cancer Immunology Lab (James Reading Lab) is mapping pre-invasive T cell dynamics during carcinogenesis to detect and intercept cancer development.

๐Ÿšจ PhD Position available in our lab ๐Ÿšจ exploring the power of blood immune multi-omics to detect lung cancer years prior to clinical diagnosis in a unique cohort of >10,000 CT screened individuals.
โœ… Wet & dry lab
โœ… September 2025 enrolment
โœ… UK tuition fees only

www.ucl.ac.uk/medical-scie...

05.08.2025 17:57 ๐Ÿ‘ 16 ๐Ÿ” 15 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

I wrote about AI foundation models for biology last year: www.nytimes.com/2024/03/10/s... Benchmarking tests since then aren't finding that they're better than simpler models of how genes and cells work.

04.08.2025 16:45 ๐Ÿ‘ 38 ๐Ÿ” 13 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Deep-learning-based gene perturbation effect prediction does not yet outperform simple linear baselines - Nature Methods The analysis presented in this Brief Communication shows that, despite their complexity, current deep learning models do not outperform linear baselines in predicting gene perturbation effects, thus e...

An analysis shows that current deep learning models do not beat linear baselines in predicting gene perturbation effects, thus emphasizing the importance of further method development and evaluation. @const-ae.bsky.social @wkhuber.bsky.social @s-anders.bsky.social

www.nature.com/articles/s41...

04.08.2025 16:07 ๐Ÿ‘ 64 ๐Ÿ” 25 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 3

Haha, would also be a succinct summary of most of my academic work ๐Ÿ˜…

04.08.2025 15:38 ๐Ÿ‘ 4 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
Preview
Deep-learning-based gene perturbation effect prediction does not yet outperform simple linear baselines Nature Methods - The analysis presented in this Brief Communication shows that, despite their complexity, current deep learning models do not outperform linear baselines in predicting gene...

And lastly, a big shout-out to @wkhuber.bsky.social and @s-anders.bsky.social!

Link to pdf: rdcu.be/ey7x0

04.08.2025 13:52 ๐Ÿ‘ 7 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

- Li et al. doi.org/10.1101/2024.12.23.630036
- Chen Li et al. doi.org/10.1101/2024.12.20.629581
- Wong et al. doi.org/10.1093/bioinformatics/btaf317

And probably many more that I am missing here.

04.08.2025 13:52 ๐Ÿ‘ 10 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

- @kasparmartens.bsky.social et al. openreview.net/forum?id=eb3ndUlkt4
- Gaudelet et al. doi.org/10.48550/arXiv.2404.16907
- @aaronwtr.bsky.social et al. openreview.net/forum?id=t04D9bkKUq
- Bendidi et al. doi.org/10.48550/arXiv.2410.13956
- Wu et al. doi.org/10.48550/arXiv.2408.10609

04.08.2025 13:52 ๐Ÿ‘ 9 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0

I also encourage everyone to checkout the parallel efforts by groups around the world who came to similar conclusions:
- @ekernf01.bsky.social et al. doi.org/10.1101/2023.07.28.551039
- Csendes et al. doi.org/10.1186/s12864-025-11600-2
- @kasia.codes et al. doi.org/10.1186/s13059-025-03574-x

04.08.2025 13:52 ๐Ÿ‘ 13 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0
A recap of virtual cell releases circa June 2025 In October 2024, I twote that โ€œsomething is deeply wrongโ€ with what we now call virtual cell models. A lot has happened since then: modelers are advancing new architectures and mining new sources of i...

*We benchmarked scGPT, scFoundation, GEARS, CPA (which claim predictive ability), and scBERT, Geneformer, and UCE (which do not claim this ability). I can't comment on methods released in the last 5 months. For a summary of recent developments, see @ekernf01.bsky.social's post

04.08.2025 13:52 ๐Ÿ‘ 9 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0