Reading Shelf
Books I keep coming back to.
Books that shaped how I think about machine learning, AI engineering, and shipping production-grade systems. Listed in no particular order. The reviews are honest; some are warmer than others.
-
O'Reilly Media 2024
AI Engineering Chip Huyen
If most AI books feel like they were written by someone who’s never shipped anything, AI Engineering by Chip Huyen is the opposite. It’s a hands-on, no-nonsense tour of what it actually takes to build and deploy AI systems without lighting your infrastructure, and sanity, on fire.This was my first actual AI book, not the “let’s re-explain linear regression for the 47th time” kind, and it delivered. What makes AI Engineering stand out is that it treats AI like what it really is: an engineering problem. Not a research paper. Not a hype machine. A messy, real-world system that has to work in production without breaking every five minutes. The book pulls off a rare trick, it’s comprehensive without being exhausting. Chip Huyen covers a ton of ground, but never drowns you in theory you’ll forget in two days. Everything feels intentional, practical, and immediately useful. The parts that really hit for me:- RAG and agents: finally explained in a way that makes you think, “oh, so that’s how people are actually improving these systems,” instead of vague architectural diagrams that go nowhere.
- Fine-tuning: concepts like quantization and PEFT are usually treated like insider secrets. Here, they’re explained clearly enough that you can actually use them, not just nod along pretending you understood.
- Observability, testing, monitoring: the least sexy part of AI, and therefore the one most teams ignore until everything explodes. This section alone should be mandatory reading for anyone shipping LLM apps.
-
StatQuest 2022
The StatQuest Illustrated Guide To Machine Learning Josh Starmer
This book takes brain-melting machine learning algorithms and chops them into snackable, bite-sized chunks you can actually digest without a PhD.Josh Starmer is basically the Gandalf of machine learning—he makes all the scary, math-heavy stuff approachable without turning you into a zombie from endless equations. This is the book you grab if you want a solid grip on ML without drowning in jargon or crying over derivatives.
Sure, it won’t replace a full-on math bootcamp, but it gives you the big picture and shows how the gears turn behind the scenes.
My favorite bit? Watching Josh walk you through turning raw input data into predictions.- First, he fiddles with weights and biases like a chef seasoning a dish.
- Then, he reveals the “magic” of gradient descent so your model actually learns something.
-
StatQuest 2025
The StatQuest Illustrated Guide to Neural Networks and AI Josh Starmer
This book takes you on a wild ride from basic neural networks all the way to state-of-the-art Transformers powering modern AI like ChatGPT, and yes, it even throws in hands-on PyTorch tutorials.I went in super hyped because I loved Josh Starmer's previous book. And while this one covers a lot of ground, the PyTorch examples felt a bit like Groundhog Day—helpful for building architectures from scratch, but light on the kind of juicy variations that make you truly understand the concepts.
Still, it’s a solid resource if you want a high-level grip on neural networks without drowning in math. Like the last book, this isn’t about deep equations—it’s the big picture view, showing how networks tick.
Where it shines is the peek behind the curtain at the latest AI toys: encoder-only Transformers, decoder-only Transformers, and the many flavors of attention mechanisms. Josh grabs your hand and walks you step by step through the math that magically turns raw words into numbers and then back into words. It’s like watching a wizard translate gibberish into sense, and somehow, it works. -
O'Reilly Media 2025
Learning Langchain Mayo Oshin & Nuno Campos
If you’re building LLM apps and aren’t using LangChain as your skeleton, what are you even doing? This book is basically your backstage pass to all the secret sauce hidden in the LangChain ecosystem. It smartly juggles theory and code without making you feel like you need a PhD in AI. Sure, the chapters on testing and monitoring lean on a LangSmith subscription, but don’t let that scare you—this guide will take you from your first dumb prompt to a production-ready LLM setup faster than you can say 'prompt engineering.'This book doesn’t just talk the talk—it makes your code walk the walk. LangChain Expression Language gets demystified like a magician revealing tricks, with the pipe operator and batch invocations turning messy workflows into neat, readable chains. Preprocessing? Parallel inference? Check. RAG and agentic logic? Covered, with advanced toys like ColBERT, RAPTOR, and Reciprocal Rank Fusion handed to you with practical patterns for reflection and modular subgraphs. Forget just theory: you’ll learn how to build self-correcting agents and slick retrieval routing like a pro. If you want your autonomous systems to actually perform reliably, this is your blueprint. -
O'Reilly Media 2025
Managing Memory for AI Agents Benjamin Labaschin, Jim Allen Wallace, Andrew Brookins, and Manvinder Singh
Lately, I swear the universe is nudging me towards Redis, and I’m not even mad about it.It all kicked off on YouTube when Salvatore Sanfilippo, Redis’ founding wizard, popped up in my feed. I knew zilch about Redis, but curiosity got the better of me. Then Instagram decided to guilt-trip me with an O'Reilly ad for Managing Memory for AI Agents, a collab between the Redis crew and O'Reilly. The title hit me like a paper cut, so I figured, yeah, probably should read that.
Thankfully, 'should' quickly became 'did'. True to O'Reilly’s efficient style, the book is detailed yet compact—a rare combo. I absolutely geeked out over the authors’ metaphor using System 1 and 2 from Kahneman’s Thinking, Fast and Slow to explain agentic memory types and multimodal architectures. If you know that book, it’s like sliding into a warm, familiar pair of socks.
Bonus points for the section on Collective Intelligence, which was completely new to me but super fascinating. And the part dissecting framework vs custom architecture? Classic Joel Spolsky territory—timeless dilemmas, well served.