Elizabeth Atkinson (@egatkinson)

Thanks to all of our SMaHT colleagues and especially to @sedlazeck.bsky.social who led the hackathon which spawned the prototype of this pipeline!

09.12.2025 18:06 👍 0 🔁 0 💬 0 📌 0

MosaicSim offers a realistic, scalable approach for assessing detection limits, with immediate applications to large sequencing efforts including those within the SMaHT Network, which was the springboard for this work.

09.12.2025 18:06 👍 0 🔁 0 💬 1 📌 0

A key (surprising) result was that ultra-high coverage (300×–450×) yields diminishing returns for mosaic variant detection. In many settings, 150× coverage performs comparably or better, highlighting opportunities for cost-effective study design.

09.12.2025 18:06 👍 2 🔁 0 💬 1 📌 0

Using MosaicSim, we benchmarked DRAGEN and found strong VAF- and depth-dependent performance limits. Sensitivity decreases sharply at low VAF, especially in complex genomic regions.

09.12.2025 18:06 👍 0 🔁 0 💬 1 📌 0

Detecting mosaic variants is challenging due to low VAFs and real sequencing noise. MosaicSim layers user-defined variants directly onto empirical WGS data, preserving true read-level properties while providing a controlled ground-truth set for benchmarking.

09.12.2025 18:06 👍 0 🔁 0 💬 1 📌 0

MosaicSim: A Novel Mosaic Variant Simulator Reveals Diminishing Returns of Ultra-High Coverage for Mosaic Variant Detection Genetic mutations within select cells of a tissue, termed mosaic variants (MV), are being increasingly recognized for their role in human disease. This growing interest underscores the need for specia...

We are pleased to share our new preprint introducing MosaicSim, a framework for generating realistic mosaic variants! Mosaic variants - mutations present in only a subset of cells - are crucial for development, disease, and cancer, but are notoriously hard to call.
www.biorxiv.org/content/10.6...

09.12.2025 18:06 👍 3 🔁 0 💬 1 📌 1

A fun lab outing to the zoo ahead of conference season! 🦒

12.10.2025 02:27 👍 2 🔁 0 💬 0 📌 0

So since we only include >0.1% MAF variants in this article we can't address ultrarare, but check out Supp Fig 3; when comparing ancestry-specific AFs many variants deviate from the 1:1 line. We plotted this on the log₁₀(AF) scale to help magnify the low-frequency range.

10.10.2025 15:32 👍 0 🔁 0 💬 0 📌 0

To limit the noise from ultra-rare alleles we only looked at variants ≥0.1% MAF. Totally appreciate that's still quite low frequency, but even with that filter, we still saw the noted ancestry-specific frequency differences.

10.10.2025 15:01 👍 0 🔁 0 💬 0 📌 0

Great point; we thought about that too! Pragati stratified by whether variants were monomorphic or not to capture at least that aspect, but you’re right that the impact depends on where a variant sits on the SFS. Rare ones can show big fold-changes but small absolute shifts.

10.10.2025 14:58 👍 0 🔁 0 💬 0 📌 0

Texas Children's/Baylor College of Medicine Researchers Create Groundbreaking Tool to Improve Accuracy of #GeneticTesting @egatkinson.bsky.social @bcmgenetics.bsky.social @bcmhouston.bsky.social #TCHResearchNews #TexasChildrens @natcomms.nature.com tinyurl.com/jj6kyrrv

06.10.2025 14:16 👍 6 🔁 1 💬 0 📌 0

Thrilled to share our new @natcomms.nature.com paper on local ancestry informed allele frequencies in gnomAD, which are live now on the browser! Check out my stellar PhD student @pragskore.bsky.social’s Bluetorial on how this brings finer detail to variant interpretation 🧬🖥️

06.10.2025 18:44 👍 14 🔁 4 💬 1 📌 0

Pan-UK Biobank genome-wide association analyses enhance discovery and resolution of ancestry-enriched effects - Nature Genetics Genome-wide analyses for 7,266 traits leveraging data from several genetic ancestry groups in UK Biobank identify new associations and enhance resources for interpreting risk variants across diverse p...

A project many years in the process, we’re pleased to present our work on multi-ancestry meta-analysis across a boatload of traits in the UK Biobank: www.nature.com/articles/s41...

18.09.2025 17:25 👍 64 🔁 25 💬 1 📌 0

Delighted to amplify my talented PhD student’s work! Check it out for a great way to streamline and harmonize Tractor analyses.

13.09.2025 00:47 👍 6 🔁 2 💬 0 📌 0

Thanks for the interest! The tutorial code is available to download as supplemental information of the paper, and has been deposited as a community workspace in the All of Us Researcher Workbench.

23.07.2025 15:05 👍 2 🔁 0 💬 0 📌 0

In summary, we present a replicable training model that empowers early-career researchers - including and especially those new to computational genomics - to responsibly leverage large-scale biobank data into their research programs and teaching.

22.07.2025 16:36 👍 0 🔁 0 💬 1 📌 0

From years 1–3, training outcomes reported by scholars to stem directly from this training included:
📊 17 conference presentations
🔬 Multiple funded research grants
🎓 Numerous genomics modules added in undergrad courses
🤝 Sustained collaborations across institutions

22.07.2025 16:36 👍 0 🔁 0 💬 1 📌 0

During the summit, scholars used real short-read WGS data to:
• Prepare phenotypes & covariates
• Run GWAS via Hail
• Visualize results with PCA, Manhattan & QQ plots
• Manage compute costs
All in ~4 hours with no prior coding required.

22.07.2025 16:36 👍 0 🔁 0 💬 1 📌 0

Our training was part of the All of Us Biomedical Researcher Scholars Program through @bcmgenetics.bsky.social focused on mentoring early-stage faculty in genomic data science. The curriculum launches with an intensive Faculty Summit, where scholars get hands-on experience working with genomic data.

22.07.2025 16:36 👍 0 🔁 0 💬 1 📌 0

Access to big genomic data is growing, but parallel access to skills needed to use it hasn’t kept up.
We created an accessible, cloud-based genomic analysis training bootcamp using real All of Us data, Jupyter notebooks, and the Hail framework to lower the barrier for early-career researchers.

22.07.2025 16:36 👍 0 🔁 0 💬 1 📌 0

🚨 New perspective piece in @ajhgnews.bsky.social! 🚨
We developed a hands-on training resource for large-scale genomic data analysis in the All of Us Researcher Workbench, now published here:

22.07.2025 16:36 👍 13 🔁 8 💬 1 📌 0

Tractor-Mix builds on Tractor’s strengths to detect ancestry-enriched signals while adding power and robust false-positive control for relatedness via a GRM. By modeling both admixture and relatedness, it overcomes key GWAS barriers and enables more accurate, representative genomic discovery.

09.06.2025 18:31 👍 2 🔁 0 💬 0 📌 0

Tractor-Mix uses ancestry-specific genotypes as predictors, outputting ancestry-specific effect sizes and P values. We benchmark our new tool in simulations and apply it to multiple admixed cohorts (including UKBiobank and Mexico City Prospective Study), uncovering signals missed by standard GWAS.

09.06.2025 18:31 👍 2 🔁 0 💬 1 📌 0

In this work, we introduce Tractor-Mix, a new GWAS method that extends Tractor to handle related admixed samples. It combines a mixed model framework (like GMMAT) with local ancestry-aware genotypes (like Tractor) in a 2 d.o.f. test.

09.06.2025 18:31 👍 2 🔁 0 💬 1 📌 0

As biobanks and global cohorts grow, so does the inclusion of admixed individuals with close or cryptic relatedness. This introduces the statistical challenge of two interwoven sources of stratification: admixture and relatedness, which are rarely handled together.

09.06.2025 18:31 👍 2 🔁 0 💬 1 📌 0

We previously developed Tractor, a local ancestry-aware GWAS method that’s been widely used to uncover ancestry-enriched signals and refine genetic architecture in admixed populations. But Tractor (being a GLM) only works on unrelated samples, limiting its use in many real-world datasets.

09.06.2025 18:31 👍 2 🔁 0 💬 1 📌 0

We're excited to introduce Tractor-Mix, our new method for GWAS in admixed cohorts with relatedness, led by the fantastic @doubletaotan.bsky.social! Read the full preprint here: www.medrxiv.org/content/10.1...
Thanks to all our amazing collaborators who helped make this work possible!

09.06.2025 18:31 👍 12 🔁 3 💬 1 📌 0

Human Genetics | Genomic Scientist Fellows | UCLA Medical School The Emerging Genomic Scientist Fellows Program is a cornerstone of justice, equity, diversity, and inclusion initiatives in the Department of Human Genetics.

Check out my stellar PhD student, Pragati's talk on our work generating local ancestry informed frequency estimates in gnomAD as part of the prestigious Emerging Genomic Scientist Symposium next week! Congrats on being selected for this amazing event!

09.04.2025 15:57 👍 5 🔁 1 💬 0 📌 0

I'm delighted to be part of this symposium, put on by University of Pennsylvania Perelman School of Medicine, and led by @bpasaniuc.bsky.social and @sarahtishkoff.bsky.social. See you in a few weeks! upenn.co1.qualtrics.com/jfe/form/SV_...

03.04.2025 14:48 👍 8 🔁 4 💬 0 📌 2

👏 Huge thanks to all our amazing LAGC collaborators! Special shoutout to Estela Bruxel and Diego Rovaris for leading this crucial work, and of course @janitzamontalvo.bsky.social and @giustilab.bsky.social for co-founding the LAGC and co-leading alongside myself. 💪

02.04.2025 15:24 👍 2 🔁 1 💬 0 📌 0

Elizabeth Atkinson

Latest posts by Elizabeth Atkinson @egatkinson