-
-
WorkFlow of EchoPod
-
Generates structured content using Bedrock and stores it in S3 via Lambda and Step Functions.
-
Polly runs in parallel to generate MP3s for each chapter, storing them in S3 efficiently.
-
Compresses audio parts in parallel and saves the final podcast-ready MP3 in S3.
-
Secure login and signup powered by Amazon Cognito, with built-in session and user management.
-
Browse playlists, customize topics, and enjoy voice + transcript playback—all in one place.
-
Generate, listen to, and read your AI-powered audio course with synced transcripts.
-
Welcome to EchoPod – Smart Audio Learning on the Go. Your personalized, AI-powered podcast learning experience
Inspiration
We’re full-time students, drowning in textbooks, staring at screens, and constantly multitasking. But what if learning could happen while living—walking to class, washing dishes, or just lying down with your eyes closed?
We craved a platform that could turn any topic into a personalized podcast, instantly. Not generic content. Not pre-recorded lectures. Something smart, fun, and completely tailored to us. And that's how EchoPod was born—your AI-powered podcast generator, built for the way we actually live and learn.
What it does
EchoPod takes your curiosity and turns it into a binge-worthy audio experience—in under 2 minutes.
Here’s what happens:
- 🎯 You enter a topic — anything from “Binary Trees” to “Quantum Entanglement”
- 🧠 Choose a category (Programming, Science, Math, etc.)
- 🎭 Select a tone (Factual, Conversational, or Storytelling)
- 📚 Set a difficulty level (Beginner, Intermediate, Advanced)
Then the magic begins 💫
- EchoPod generates up to 7 well-structured chapters, each with intros, examples, and takeaways — crafted using Amazon Bedrock (Claude Sonnet 3.5)
- Each chapter is transformed into audio using Amazon Polly, running in parallel for blazing-fast output
- The final result is a personalized podcast, ready to play:
- ✅ Synced transcripts
- 🎧 Chapter navigation
- 🗣️ Narration modes
- 🎙️ Speech-to-text support for entering new topics hands-free
- ✅ Synced transcripts
It’s your learning assistant, narrator, and podcast host—all in one.
How we built it
EchoPod runs on a fully serverless, event-driven architecture, designed for speed, scalability, and cost efficiency:
🧑💻 Frontend
Built with Flutter for smooth cross-platform support (Android & iOS), including gesture navigation and transcript playback.
🧠 Content Generation
- Uses Amazon Bedrock (Claude 3.5) to generate structured content
- Custom prompts based on topic, tone, difficulty, and category
- Outputs a chapter-wise structured JSON saved to S3
🔊 Audio Generation
- Amazon Polly converts each chapter to MP3
- Polly tasks run in parallel using AWS Step Functions Map State
- Audio files are stored in Amazon S3, organized by chapter
📦 Compression & Finalization
- Audio parts are compressed in parallel using another Step Function
- A dedicated Lambda function merges and finalizes the podcast-ready MP3
🔐 Authentication
- Amazon Cognito handles secure login and signup
We built the backend using FastAPI (Python), integrated Boto3 for AWS service access, and managed orchestration with Event Driven Architecture.
Challenges we ran into
- 💥 Parallel Polly Headaches: Handling multiple Polly tasks while keeping chapter order and combining parts on-the-fly was tricky
- 🔁 Syncing Audio + Transcript: Polly doesn’t return timestamps, so we had to map our own logic to keep transcript display accurate
- 🎛️ Prompt Tuning: Getting the perfect tone + difficulty match without making the content dry or too complex took so much tweaking
- 🔐 Cognito Integration: Handling auth tokens, refreshes, and mobile flow while keeping it smooth for users was a learning curve
Accomplishments that we're proud of
- 🧠 Full podcast generation in under 2 minutes, with up to 7 chapters
- ⚡ Parallel processing of both content and audio—speed without sacrificing structure
- 🎧 Beautiful mobile experience: swipe through chapters, tap to listen, read along
- 🗣️ Speech-to-text and narration modes make it fun, accessible, and easy to use
- 💻 Actually used it to revise for a Systems Design exam while cooking pasta. Yes, it worked.
What we learned
- 🔄 How to design parallel Lambda + Step Function pipelines for high-speed TTS workflows
- 📦 The power of modular, event-driven backend design
- 🧠 How prompt design affects everything—tone, quality, engagement
- 🎯 That people love personalization—especially in how they consume knowledge
- 🔊 That good UX for audio-first learning is not just possible—it’s essential
What's next for EchoPod
- 💬 Quiz Mode: AI-generated comprehension checks after each chapter
- 👥 Narration Style Toggle: Monologue vs Dialogue learning styles
- 🌍 Multi-language Support: Spanish, Hindi, Korean, Arabic & more
- 📱 Playlist Builder: Save your favorite podcasts by topic
- 🔊 Platform Integrations: Spotify, Audible, and LMS tools
- 🧩 Chapter Visualizer: Auto-generated visuals or diagrams per chapter
Built With
- amazon-amplify
- amazon-api-gateway
- amazon-bedrock
- amazon-cognito
- amazon-dynamodb
- amazon-lambda
- amazon-polly
- amazon-step-function
- amazon-web-services
- amazon-websockets
- claude
- flutter
Log in or sign up for Devpost to join the conversation.