How I Published Three Papers This Year — Without a PhD or Research JobA mid-career engineer’s story of becoming a published ML researcherOct 18A response icon1Oct 18A response icon1
LLM Multi-GPU Batch Inference With AccelerateAn Implementation WalkthroughSep 10, 2023A response icon2Sep 10, 2023A response icon2
Solving The Issue of Falcon Text Generation Never StoppingHow to make an overly chatty bird stop talking.Jul 26, 2023A response icon1Jul 26, 2023A response icon1
Scalable Streaming of OpenAI Model Responses with FastAPI and asyncioA tutorialJul 13, 2023Jul 13, 2023