I have also seen, from people who know base really well, incredibly elegant code using tapply, merge, and clever indexing. But, in my experience, the typical code of a medium experienced person that uses the tidyverse is more robust than the one that uses base.
05.03.2026 07:58
๐ 1
๐ 0
๐ฌ 2
๐ 0
It's an example. As I said, the tidyverse has the advantage that it provides functions for typical data analysis tasks all in one place with great documentation and common semantics. Also, personally I'm not a fan that merge auto-sorts the output.
Another example would be pivot_longer :)
05.03.2026 07:54
๐ 0
๐ 0
๐ฌ 1
๐ 0
Teach the tidyverse! I have seen the code from people, who only learned base, that reinvents a join function and it's not pretty ๐
In the end, the tidyverse is a collection of useful functions for typicy data analysis tasks and beginners benefit from being told those exist :)
05.03.2026 07:33
๐ 1
๐ 0
๐ฌ 1
๐ 0
Biological Laboratory | LEGOยฎ Ideas
This is a two-floor realistic Biological Laboratory building. In this design, I try to tell stories about biologists, introduce biology concepts to general publโฆ
I just made an account and voted for this biology laboratory made of LEGO to "promote biological research and inspire more people into the world of biology". They need 10K votes. Reposting and/or voting below would be helpful. ๐ค
beta.ideas.lego.com/product-idea...
16.10.2025 08:07
๐ 221
๐ 131
๐ฌ 8
๐ 12
How far back in time can you understand English?
An experiment in language change
www.deadlanguagesociety.com/p/how-far-ba...
I cannot speak as to the accuracy of the piece yet I found it a fascinating exercise @adamroberts.bsky.social @drlauravarnam.bsky.social
22.02.2026 07:05
๐ 117
๐ 63
๐ฌ 22
๐ 39
screaming this from the rooftops
16.02.2026 18:56
๐ 15
๐ 2
๐ฌ 1
๐ 0
Screenshot of Job Ad
EMBL site in Heidelberg
Zurich, incl. a bit of UZH/USZ
Job offer: PhD-level biostatistician with ML&omics experience to work on prediction models for CAR T cell therapy, in an exciting clinical trial - spatial omics - data science consortium (LOOP/INTeRCePT3.0 centered in Zurich; this position in Heidelberg)
embl.wd103.myworkdayjobs.com/en-GB/EMBL/j...
13.02.2026 08:41
๐ 10
๐ 5
๐ฌ 0
๐ 0
It definitely helps, but I think it's also simply that most scientists have a LinkedIn account and occasionally check it, whereas still only a minority has a bluesky account.
31.01.2026 19:38
๐ 0
๐ 0
๐ฌ 0
๐ 0
Yeah, the amount of engagement on LinkedIn compared to bluesky is crazy. I recently posted about a new paper and it got 480 likes on LinkedIn vs 7 on bluesky (and I have more followers here than there...) :/
31.01.2026 11:00
๐ 2
๐ 0
๐ฌ 1
๐ 0
We wrote a review of representation learning methods of single-cell RNA-seq data, where we compare factor models, autoencoders, contrastive learning, and foundation models ๐
rnajournal.cshlp.org/content/earl...
21.01.2026 09:56
๐ 8
๐ 2
๐ฌ 1
๐ 0
Postdoctoral Researcher or Senior Scientist (AI ร Biology)
Boeva Lab is #hiring postdocs and senior scientists in AI/ML for Cancer Biology! Check the offer and apply at: jobs.ethz.ch/job/view/JOP...
20.01.2026 14:49
๐ 8
๐ 8
๐ฌ 0
๐ 0
View from the hotel room
Poster session 2024, with Valentina Boeva, Constantin Ahlmann-Eltze and others
Wednesday afternoon hike incl. swim in the mountain river
Another view from the hotel room
Apply for the Ascona workshop "Statistical and AI methods for multi-modal multi-scale modeling of biological systems", 28 Jun-3 Jul 2026 on Monte Veritร , Lago Maggiore at the foot of the Swiss Alps.
ascona2026.sciencesconf.org
15.01.2026 16:16
๐ 21
๐ 16
๐ฌ 0
๐ 3
Save the date: April 9 from 4pm to 6pm CET. Our department is hosting an online seminar with @noeliaferruz.bsky.social @sdomcke.bsky.social @const-ae.bsky.social who will talk about models for protein design, large-scale perturbation screens, and benchmarking of perturbation prediction models.
14.01.2026 13:26
๐ 6
๐ 3
๐ฌ 0
๐ 0
Join me in 5.5h, when we discuss our benchmark of perturbation prediction models and what the right metric is to assess if a gene expression prediction is good!
02.12.2025 14:20
๐ 7
๐ 0
๐ฌ 0
๐ 0
๐ Excited to share our new preprint: msBayesImpute - A Versatile Framework for Addressing Missing Values in Biomedical Mass Spectrometry Proteomics Data
๐ Improves imputation accuracy, normalization, and differential expression detection
๐https://www.biorxiv.org/content/10.1101/2025.10.02.679746v1
07.10.2025 08:47
๐ 16
๐ 2
๐ฌ 1
๐ 1
Heads up: ignore samtools dot org, similarly minimap2 dot com and likely others. It's owned by a known phishing site and while the binaries they offer look valid currently (but note they may be serving us different binaries to others), that could change.
Ie: it's not us (Samtools team)! Be warned
15.09.2025 08:40
๐ 146
๐ 127
๐ฌ 2
๐ 5
We're excited to share that our preprint on anndataR, a new package bringing Python's AnnData to R, is now available on bioRxiv ๐
๐ Read the paper: www.biorxiv.org/content/10.1...
๐ป Check the package in action: anndatar.data-intuitive.com
25.08.2025 15:24
๐ 22
๐ 6
๐ฌ 1
๐ 1
An arrow with a LaTeX equation
Trigonometric functions and a unit circle
A bivariate change model with structured residuals
A hierarchical model of cognitive abilities
Now on CRAN, ggdiagram is a #ggplot2 extension that draws diagrams programmatically in #Rstats. Allows for precise control in how objects, labels, and equations are placed in relation to each other.
wjschne.github.io/ggdiagram/ar...
20.08.2025 10:43
๐ 180
๐ 73
๐ฌ 10
๐ 9
Makes sense. I imagine this would simply be my primary use case, and I would prefer not having to refer to `x` twice. Something like:
replace_values <- function(x, ..., from=NULL, to=NULL, lookup=NULL){
if(! is.null(lookup)){
from <- names(lookup)
to <- lookup
}
...
}
10.08.2025 10:22
๐ 0
๐ 0
๐ฌ 1
๐ 0
This looks great! I think it would be neat if the functions also supported named look-up vectors in addition to the `to` and `from` arguments :)
10.08.2025 09:58
๐ 0
๐ 0
๐ฌ 1
๐ 0
Beeswarm plot of the prediction error across different methods of double perturbations showing that all methods (scGPT, scFoundation, UCE, scBERT, Geneformer, GEARS, and CPA) perform worse than the additive baseline.
Line plot of the true positive rate against the false discovery proportion showing that none of the methods is better at finding non additive interactions than simply predicting no change.
Our paper benchmarking foundation models for perturbation effect prediction is finally published ๐๐ฅณ๐
www.nature.com/articles/s41...
We show that none of the available* models outperform simple linear baselines. Since the original preprint, we added more methods, metrics, and prettier figures!
๐งต
04.08.2025 13:52
๐ 126
๐ 57
๐ฌ 2
๐ 6
Pre-Cancer Immunology
The Pre-Cancer Immunology Lab (James Reading Lab) is mapping pre-invasive T cell dynamics during carcinogenesis to detect and intercept cancer development.
๐จ PhD Position available in our lab ๐จ exploring the power of blood immune multi-omics to detect lung cancer years prior to clinical diagnosis in a unique cohort of >10,000 CT screened individuals.
โ
Wet & dry lab
โ
September 2025 enrolment
โ
UK tuition fees only
www.ucl.ac.uk/medical-scie...
05.08.2025 17:57
๐ 16
๐ 15
๐ฌ 0
๐ 0
I wrote about AI foundation models for biology last year: www.nytimes.com/2024/03/10/s... Benchmarking tests since then aren't finding that they're better than simpler models of how genes and cells work.
04.08.2025 16:45
๐ 38
๐ 13
๐ฌ 1
๐ 0
Haha, would also be a succinct summary of most of my academic work ๐
04.08.2025 15:38
๐ 4
๐ 0
๐ฌ 1
๐ 0
- Li et al. doi.org/10.1101/2024.12.23.630036
- Chen Li et al. doi.org/10.1101/2024.12.20.629581
- Wong et al. doi.org/10.1093/bioinformatics/btaf317
And probably many more that I am missing here.
04.08.2025 13:52
๐ 10
๐ 0
๐ฌ 1
๐ 0
- @kasparmartens.bsky.social et al. openreview.net/forum?id=eb3ndUlkt4
- Gaudelet et al. doi.org/10.48550/arXiv.2404.16907
- @aaronwtr.bsky.social et al. openreview.net/forum?id=t04D9bkKUq
- Bendidi et al. doi.org/10.48550/arXiv.2410.13956
- Wu et al. doi.org/10.48550/arXiv.2408.10609
04.08.2025 13:52
๐ 9
๐ 0
๐ฌ 1
๐ 0
I also encourage everyone to checkout the parallel efforts by groups around the world who came to similar conclusions:
- @ekernf01.bsky.social et al. doi.org/10.1101/2023.07.28.551039
- Csendes et al. doi.org/10.1186/s12864-025-11600-2
- @kasia.codes et al. doi.org/10.1186/s13059-025-03574-x
04.08.2025 13:52
๐ 13
๐ 0
๐ฌ 1
๐ 0
A recap of virtual cell releases circa June 2025
In October 2024, I twote that โsomething is deeply wrongโ with what we now call virtual cell models. A lot has happened since then: modelers are advancing new architectures and mining new sources of i...
*We benchmarked scGPT, scFoundation, GEARS, CPA (which claim predictive ability), and scBERT, Geneformer, and UCE (which do not claim this ability). I can't comment on methods released in the last 5 months. For a summary of recent developments, see @ekernf01.bsky.social's post
04.08.2025 13:52
๐ 9
๐ 0
๐ฌ 1
๐ 0