Probably Overthinking It

Probably Overthinking It is for anyone who wants to use data to better understand the world. It explains the most important ideas using clear prose and data visualization instead of equations and code.

This book is my tribute to the power of data to answer questions, settle debates and help us make better decisions. But it’s easy to get it wrong, and sometimes mistakes have consequences.

Order from Bookshop.org or Amazon (affiliate links). Read more about the book at Goodreads.

The book is based on my blog, also called Probably Overthinking It, where I have posted some excerpts.

Supporting code for the book is in this GitHub repository.

I have presented several talks based on chapters of the book:

  • “The Inspection Paradox is Everywhere” at PyData NYC 2019. Slides, Video.
  • “Chasing the Overton Window” at PyData NYC 2022. Slides, Video.
  • “Taming Black Swans” at SciPy 2023. Slides, Video.
  • “Extremes, outliers, and GOATS: On life in a lognormal world” at PyData Global 2023. Slides, Video
  • “Causation, Collision, and Confusion”, Talks at Google. Slides, Video
  • “Who Wants To Live Forever”, ODSC East 2024. Slides