Published inBentoMLA Guide to Model CompositionGain an overview of model composition to build compound AI systems.Jul 30, 2024Jul 30, 2024
Serving A LlamaIndex RAG App as REST APIsBuild and serve a LlamaIndex RAG app as REST APIs with BentoML.May 28, 2024May 28, 2024
Published inBentoMLBuilding RAG with Open-Source and Custom AI ModelsRetrieval-Augmented Generation (RAG) is a widely used application pattern for Large Language Models (LLMs). It uses information retrieval…May 6, 2024May 6, 2024
Published inBentoMLA Guide to Open-Source Image Generation ModelsUnderstand open-source image generation models and find answers to frequently asked questions about them.Mar 28, 2024A response icon1Mar 28, 2024A response icon1
Published inBentoMLDeploying A Large Language Model with BentoML and vLLMBuild an LLM application with vLLM for enhanced efficiency and deploy it on BentoCloud for scalable, efficient AI solutions in the cloud.Mar 22, 2024Mar 22, 2024
Published inBentoMLNavigating the World of Large Language ModelsExplore the most popular open-source large language models and find answers to common questions in using them.Mar 22, 2024Mar 22, 2024
Published inBentoMLDeploying Stable Diffusion XL with Latent Consistency Model LoRAs on BentoCloudAccelerate image generation with LCM LoRAs on BentoCloudFeb 29, 2024Feb 29, 2024
Published inBentoMLUnderstanding Retrieval-Augmented Generation: Part 2Understand the practical applications of RAG, design ideas for a RAG system, and the prospect of this technology.Feb 1, 2024Feb 1, 2024
Published inBentoMLUnderstanding Retrieval-Augmented Generation: Part 1Learn how Retrieval-Augmented Generation (RAG) transforms AI, enhancing language models with dynamic, external data access.Jan 25, 2024Jan 25, 2024