extrapolaTED is a pioneering application designed to craft TED-like lectures on any conceivable topic on demand. Through a fusion of AI and human genius, we're pushing the boundaries of education and information dissemination. Our mission is to democratize knowledge, making every recent scientific achievement known and inspiring to all.

🚀 Features

  • Boundless Exploration: Venture into any subject with TED-like insights powered by Generative AI.
  • Holistic Understanding: Utilize a rich tapestry of multi-modal resources, including Arxiv papers and Wiki.
  • Creative Generation: Witness the blend of ChatGPT and Stable Diffusion in producing captivating stories.
  • Voice Synthesis: Experience smooth, natural narrations courtesy of ElevenLabs API.
  • Multi-step API Calls: Harness the power of Wordware for efficient API orchestration.

💡 Methodology

Technology

Embark on a technological odyssey encompassing retrieval, embedding, story, and image generation:

  • USearch is our go-to technology for Semantic Vector Search.
  • UForm takes charge of Vision Language Understanding.
  • Wordware facilitates multi-step API calls, forming a cohesive workflow.
  • ChatGPT transforms raw data into engaging narratives, bringing topics to life.
  • Stable Diffusion creates captivating visuals that resonate with the generated content.
  • ElevenLabs API gives a recognizable voice to our content, making the learning experience more immersive.

Data

Dive deep into the heart of extrapolaTED with a wide array of datasets that serve as the bedrock of our content generation:

  • TED Dataset: With over 1,000 transcripts, this dataset provides a profound understanding of the topics already covered in TED talks, aiding in exploring new territories.
  • Arxiv Abstracts (unum-cloud/ann-arxiv-2m): A treasure trove of 2 million vectorized abstracts summarizing the latest strides in scientific research.
  • WIT - Wikipedia Images Dataset: A rich collection of well over 3 million images aiding in the visual representation of generated content.
  • Wikipedia Abstracts (wikipedia): The 6 million abstracts in this dataset are a solid foundation for textual content, providing ground-truth retrieval of factual information.

Built With

Share this project:

Updates