Inspiration

Every day, we read thousands of words — articles, research papers, reports — but rarely see how ideas connect. We wanted to build a tool that could automatically visualize the relationships between concepts inside a document, transforming plain text into a structured map of understanding. That’s how LinkMosaic was born — to turn unstructured information into meaningful connections using AI and cloud automation.

What it does

LinkMosaic lets users upload a PDF or text file and automatically:

  • Extracts the content from the document.
  • Clusters related concepts and summarizes key points.
  • Generates a JSON-based knowledge map showing how ideas connect.
  • Uploads and displays this map visually through a simple web interface. It’s a fast, serverless way to convert any document into an interactive network of insights.

How we built it

Backend: AWS Lambda functions written in Python 3.11, orchestrated to process uploads and build maps. Storage: Amazon S3 buckets store uploaded files and generated knowledge maps. Integration: Amazon API Gateway connects the frontend to the backend securely via presigned URLs. Frontend: A lightweight HTML/JS interface (hosted on GitHub Pages) lets users upload documents and view the final map.

Pipeline:

  • UploadSigner generates secure upload URLs
  • Orchestrator coordinates processing
  • MapBuilder outputs a final JSON map to S3

Everything runs serverlessly — no dedicated servers or databases required.

Challenges we ran into

Dependency packaging on AWS Lambda: Some Python libraries (like pdfminer and PyPDF2) required custom bundling to work correctly in the Lambda environment.

IAM permissions: S3 bucket access and cross-Lambda invocation policies were tricky to configure properly.

API Gateway integration: Adjusting the orchestrator to handle both S3 triggers and HTTP requests took extensive debugging.

Time management: Balancing multiple moving parts — Lambda functions, front end, and API routes — was challenging under hackathon deadlines.

Accomplishments that we're proud of

  • Integrated multiple AWS services: Lambda, S3, API Gateway, into one cohesive system.
  • Built a secure, scalable architecture with real-world use potential (document summarization, research tools, educational applications).
  • Learned how to debug complex serverless workflows across multiple AWS layers.

What we learned

  • How to design event-driven architectures on AWS.
  • Best practices for packaging Python dependencies for Lambda.
  • How to use presigned URLs for secure file uploads.
  • How modularizing cloud functions improves scalability and debugging.
  • That even small automation steps can drastically improve how we process and visualize information.

What's next for LinkMosaic

🌐 Add interactive visualizations (D3.js or React Graph) for map viewing. 🧩 Integrate LLMs via AWS Bedrock or OpenAI for deeper semantic analysis. 🔍 Support multi-document clustering to build cross-source knowledge graphs. 👥 Add user accounts for personalized maps and document histories. 🧠 Expand the pipeline with intelligent summarization and citation mapping.

Share this project:

Updates