A multi-tenant support agent with per-customer memory banks and global pattern detection so the bot knows about an outage before you finish explaining it. Built on Hindsight.
Building an AI support bot is easy. Building one that actually remembers the customer is the hard part. To solve this, Akshat Talwar and I built an agent architecture that deeply integrates with Hindsight memory. We were tired of support bots that treat every interaction like a first date. You can grind on system prompts for hours, but without a robust, isolated memory layer, the LLM is essentially an amnesiac. Customers hate repeating their order numbers every time they open a new chat window. For quite some time, we have been working on IRIS (Intelligent Recall & Issue Support). Instead of just another standalone chatbot wrapper, we built a headless, multi-tenant API service. It sits between an e-commerce brand's existing helpdesk and their order management system to provide automated support with actual long-term context. We just published a technical deep-dive on how the system is built. Here is a look under the hood at how we handle state: - Dumping raw chat logs into a single vector database is a disaster. The agent gets confused and cross-references different users. We use Hindsight (https://lnkd.in/gdCpFNpE) by Vectorize AI, Inc. to strictly segregate state into per-customer banks and global pattern banks. - Global pattern detection is a massive business lever. If 50 people report a "warehouse delay" at the same time, the global memory catches it. The bot stops troubleshooting individual users and immediately acknowledges the known outage, saving hundreds of redundant API calls. - Summarize, do not concatenate. We use intermediate reflection steps to summarize a user's profile before the main LLM call. This drastically improves accuracy and reduces token costs compared to cramming past transcripts into the context window. If you are an engineer dealing with stateful LLM applications or agent memory, we wrote up the exact code behind our memory routers and the lessons we learned. Check out the overall system here: https://lnkd.in/gEYeMKKF Read the full technical write-up on https://lnkd.in/gbMNCrUe