Image
user avatar
Chip Huyen
@chipro
San Francisco, CA
Joined June 2008
Posts
  • Pinned
    user avatar
    My 8000-word note on agents: huyenchip.com//2025/01/07/ag… Covering: 1. An overview of agents 2. How the capability of an AI-powered agent is determined by the set of tools it has access to and its capability for planning 3. How to select the best set of tools for your agent 4.
  • user avatar
    Image
    Small-but-happy win: If you tell ChatGPT not to use em-dashes in your custom instructions, it finally does what it's supposed to do!
  • user avatar
    $100 for anyone who can show me how to get ChatGPT to stop using emdashes. it's driving me insane
    Image
    Image
    Image
    Image
  • user avatar
    This thread is a combination of 10 free online courses on machine learning that I find the most helpful. They should be taken in order.
  • user avatar
    I wrote an 8k word doc on machine learning systems design. This covers: 1. Project setup 2. Data pipeline 3. Training & debugging 4. Serving with case studies, resources, and 27 exercises. This is the 1st draft so feedback is much needed. Thank you! github.com/chiphuyen/mach…
  • user avatar
    Machine learning engineering is 10% machine learning and 90% engineering.
  • user avatar
    OMG I'M SO HAPPY IT'S FINALLY HERE!!!
    Image
  • user avatar
    Things I’d prioritize learning if I was to study to become a ML engineer again: 1. Version control 2. SQL + NoSQL 3. Python 4. Pandas/Dask 5. Data structures 6. Prob & stats 7. ML algos 8. Parallel computing 9. REST API 10. Kubernetes + Airflow 11. Unit/integration tests
  • user avatar
    Finally got my copy! “AI Engineering” is officially out 🙏 🎉
    Image
  • user avatar
    It’s done! 150,000 words, 200+ illustrations, 250 footnotes, and over 1200 reference links. My editor just told me the manuscript has been sent to the printers. - The ebook will be coming out later this week. - Paperback copies should be available in a few weeks (hopefully
    Image
  • user avatar
    My date: “You’re my number 1.” Me: “Are you zero indexed or one indexed?” Me: *single*
  • user avatar
    Sooo I wrote a 13,000-word lecture note on data distribution shifts, monitoring, and causes of ML failures. This was very difficult for me to write, because academia & industry literature use very different terminology. 
Feedback appreciated 🙏 docs.google.com/document/d/14u…
    Image
  • user avatar
    So I wrote a 5400-word lecture note on the basics of data engineering for my students, covering: * data formats (row- vs. column-based, text vs. binary) * ETL * batch processing vs. stream processing * training datasets WIP. Feedback much appreciated! docs.google.com/document/d/1b9…
    Image
  • user avatar
    My editors just shared with me the feedback from early reviewers and I'm in tears 😭 With the help of so many people, I worked really hard on this book. I'm grateful that people gave it a chance. Read the book online: learning.oreilly.com/library/view/d… Pre-order: amazon.com/Designing-Mach…
    Image
    Image