Allen Downey's Avatar

Allen Downey

@allendowney

Former professor at Olin College, principal data scientist at PyMC Labs, author of Think Python, and Probably Overthinking It -- blog and book -- and stark raving Bayesian.

5,069
Followers
73
Following
160
Posts
22.06.2023
Joined
Posts Following

Latest posts by Allen Downey @allendowney

Thanks to my friends at @datascienceweekly for featuring Probably Overthinking It ... now available in paperback!

11.12.2025 22:59 πŸ‘ 7 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0

Honestly, this is a weakness of my writing -- I don't do enough signposting. But contrary to what I actually do, I think there should be something at the beginning that presents the value proposition and something at the end that provides the big picture.

04.12.2025 18:51 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
The Lost Chapter - Probably Overthinking It I’m happy to report that Probably Overthinking It is available now in paperback. If you would like a copy, you can order from Bookshop.org and Amazon (affiliate links). To celebrate, I’m publishing Th...

Probably Overthinking It is now available in paperback.

To celebrate, I'm publishing The Lost Chapter, which is about the strangest paradox in probability, the girl named Florida problem.

The key to the problem is Captain Chelsea Sullenberger.

www.allendowney.com/blog/2025/12...

04.12.2025 13:48 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

Modeling the SAT Math Gap with #PyMC

Why do male test takers score ~30 points higher? Ability or selection bias?

At PyData Boston 2025, @allendowney.bsky.social shows how Bayesian modeling untangles confounding to reveal what’s real.

#BayesianModeling #DataScience

01.12.2025 16:02 πŸ‘ 1 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
It's Levels - Probably Overthinking It A recent Reddit post asks β€œAmateur athletes of Reddit: what’s your β€˜There’s levels to this shit’ experience from your sport?” Responses included: We have some good runners who can win local races … An...

β€œI’m way closer to LeBron James than you are to me.”
-- Brian Scalabrine

He's probably right, because in a lognormal distribution of ability, it's levels to this …

www.allendowney.com/blog/2025/11...

04.11.2025 15:52 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Preview
Clojure DSP meeting 2025-11-02 Once in a while, a few of the Scicloj freinds will meet to learn about signal processing, following the Think DSP book by Allen B. Downey, and implementing things in Clojure. Our notes will be publish...

The #Scicloj group is initiating a new study group:

#DSP in #Clojure.

We will follow @allendowney.bsky.social's #ThinkDSP book and practice it in Clojure.

clojureverse.org/t/clojure-ds...

02.11.2025 10:22 πŸ‘ 4 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

The newest chapter of Think Linear Algebra is up now!

It is about least squares regression, QR decomposition, and orthogonality:

allendowney.github.io/ThinkLinearA...

29.10.2025 14:30 πŸ‘ 16 πŸ” 4 πŸ’¬ 0 πŸ“Œ 1
Preview
Cancer Survival Rates Are Misleading - Probably Overthinking It Five-year survival might be the most misleading statistic in medicine. For example, suppose 5-year survival for a hypothetical cancer is What can we infer from these statistics? In fact, none of these...

4. And if 5-year survival increases over time, it is tempting to conclude that treatment has improved.

In fact, none of these inferences are correct.
www.allendowney.com/blog/2025/10...

27.10.2025 13:58 πŸ‘ 5 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

2. Looking at the difference in survival between early and late detection, it is tempting to conclude that more screening would save lives.

3. In a case where a patient is diagnosed late and dies of cancer, it is tempting to say that they would have survived if their cancer had been caught early.

27.10.2025 13:58 πŸ‘ 0 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

Five-year survival rates might be the most misleading statistics in medicine.

Even smart people can make incorrect inferences. Here are the top four:

1. If a patient is diagnosed early, it is tempting to think the probability is 91% that they will survive five years after diagnosis.

27.10.2025 13:58 πŸ‘ 0 πŸ” 2 πŸ’¬ 1 πŸ“Œ 0

And we've got the graduates to prove we were right.

22.10.2025 19:29 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

The curriculum at Olin College is our answer to this question:

1) Engineering early and often,

2) Emphasize design, entrepreneurship, teamwork, and communication,

And my focus was on

3) Use computational tools before or instead of math

22.10.2025 19:28 πŸ‘ 1 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

The effect of this error on engineering education is like the effect of the iceberg on the Titanic.

22.10.2025 18:52 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Preview
The Foundation Fallacy - Probably Overthinking It At Olin College recently, I met with a group from the Kyiv School of Economics who are creating a new engineering program. I am very impressed with the work they are doing, and their persistence despi...

The original sin of the engineering curriculum is the Foundation Fallacy:

The assumption that math (especially calculus) and science (especially physics) are (1) the foundations of engineering, and therefore (2) the prerequisites of engineering education.

www.allendowney.com/blog/2025/10...

22.10.2025 18:52 πŸ‘ 10 πŸ” 0 πŸ’¬ 2 πŸ“Œ 0

My new auxiliary emergency backup team is taking it down to the wire...

21.10.2025 02:51 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Video thumbnail

I just posted a new chapter of Think Linear Algebra.

It's about projection, rejection, rotation, and pool!

allendowney.github.io/ThinkLinearA...

19.10.2025 17:54 πŸ‘ 11 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

Yes, I had not made that distinction, and you are very right.

I'm not sure I would have bothered debunking it if I had realized.

16.10.2025 21:02 πŸ‘ 1 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Oh, no -- it gets worse! It looks like they also including missing data in the analysis, treating it as zero. That explains the black line in the figure.

16.10.2025 20:19 πŸ‘ 4 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Preview
Simpson's What? - Probably Overthinking It I like Simpson’s paradox so much I wrote three chapters about it in Probably Overthinking It. In fact, I like it so much I have a Google alert that notifies me when someone publishes a new example (or...

I love a good Simpson's paradox. Sadly, this is not one of them
www.allendowney.com/blog/2025/10...

In fact, I think the whole paper is nonsense.

Published in Nature, too.

16.10.2025 20:05 πŸ‘ 13 πŸ” 2 πŸ’¬ 2 πŸ“Œ 0
Allen Downey - Extremes, outliers, and GOATS: on life in a lognormal world | PyData Global 2023
Allen Downey - Extremes, outliers, and GOATS: on life in a lognormal world | PyData Global 2023 YouTube video by PyData

I gave a talk about that chapter here: www.youtube.com/watch?v=44D1...

14.10.2025 23:25 πŸ‘ 0 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0

Thanks! It was a fun interview to record.

Now if only the playback had speed control :(

14.10.2025 23:25 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Sadly, my primary team (the Red Sox) and emergency backup team (the Padres) were both knocked out of the playoffs yesterday.

Now I am left to cheer for my team of last resort (the Notyankees).

03.10.2025 17:40 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

Sometimes we can use Bayesian methods to infer the effect of selection bias and produce an unbiased estimate.

Here's an example that uses PyMC to solve a classic probability puzzle (the image shows what I think is the original version).

www.allendowney.com/blog/2025/09...

26.09.2025 13:43 πŸ‘ 8 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0

At this point I'm just barely making enough $ on this cohort to cover the platform fees.

It's going to run anyway but I'd really love to have a few more people signed up.

Use the code SIXTY for 60% off at registration.

24.09.2025 20:02 πŸ‘ 7 πŸ” 8 πŸ’¬ 1 πŸ“Œ 2
Video thumbnail

I have published five new chapters of Think Linear Algebra!

Read about the project here
allendowney.com/blog/2025/09...

Or jump straight to the book
allendowney.github.io/ThinkLinearA...

And now… Asteroids!

22.09.2025 14:41 πŸ‘ 9 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Preview
Presentation Night: A future of data science (Allen Downey), Wed, Sep 3, 2025, 6:00 PM | Meetup **A future of data science** *Allen Downey* In the hype cycle of data science, I suggest that the "peak of inflated expectations" was in 2012, the "trough of disillusionme

On September 3 I'm giving a talk for the Boston Python User Group, called "A Future of Data Science"

www.meetup.com/bostonpython...

This is a talk from posit::conf last year, updated with new data and the experience of an interesting year.

19.08.2025 19:41 πŸ‘ 13 πŸ” 3 πŸ’¬ 0 πŸ“Œ 0
Post image

Between 2021 and 2024, marijuana was legalized in eight states totaling 18% of the US population. During this time, adult use increased and youth use was unchanged.

Data from NSDUH 2024.

13.08.2025 17:16 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

As a graduate of an all-boys high school, I am very interested to see the results...

24.07.2025 22:04 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

For anyone who likes LLMs and daytime game shows.

24.07.2025 14:05 πŸ‘ 8 πŸ” 2 πŸ’¬ 0 πŸ“Œ 0
Preview
Are Young Men Veering Right? Not really. On most issues, gender gaps are small and not much different among young and older people

Some news articles suggest young men are conservative Republicans with sexist attitudes.

But big picture, young men's views are pretty much on trend.

Here's the data: allendowney.substack.com/p/are-young-...

23.07.2025 14:15 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0