πŸ•΅οΈ Deepsint

One username β†’ one trusted profile card.
Turning scattered OSINT signals into explainable insights, fast.


🌟 Inspiration

We all hear the warnings of data breaches and the dangers of the internet.
We've also heard that someone can reconstruct everything about you using OSINT toolsβ€”without ever meeting you.

But in practice, existing OSINT tools are powerful yet slow, complex, and fragmented across 30 different tabs.
We wanted something simpler:
➑️ A single-click experience.
➑️ One username β†’ one trusted profile card.


βš™οΈ What It Does

  1. Input β†’ a single username.
  2. Crawl β†’ Blackbird scans the open web for likely matches.
  3. Collect β†’ a web scraper gathers relevant, publicly available data from each hit.
  4. Reason β†’ Cohere embeddings correlate personas, detect behavioral patterns, and connect fuzzy signals.
  5. Synthesize β†’ outputs an explainable profile card:
    • Clean facts
    • Source links
    • Timestamps
    • Per-claim confidence scores
  6. Guardrails β†’ public data only, PII redaction, audit trail, and sensitive inferences disabled by default.

πŸ› οΈ How We Built It

Pipeline architecture:

  • Discovery β†’ Blackbird for username enumeration and candidate gathering.
  • Extraction β†’ site-aware scraping of bios, handles, links, timestamps.
  • Normalization β†’ unify fields, dedupe items, standardize time/text.
  • Correlation β†’ reasoning models score cross-platform matches using:
    • Handle similarity
    • Cross-linked bios
    • Writing-style cues
    • Semantic similarity
  • Evidence Grading β†’ assign confidence based on independent signals + recency.
  • Profile Card β†’ concise summary with sources, timestamps, and caveats.

🚧 Challenges We Ran Into

  • Entity resolution is hard β†’ avoiding false positives requires careful scoring & explicit caveats.
  • Noisy & incomplete data β†’ profiles change, vanish, or contradict each other.
  • Anti-automation & rate limits β†’ building a polite, robust collector without brittle hacks.
  • UX for trust β†’ making confidence, evidence, and caveats visible without overwhelming users.

πŸ† Accomplishments

  • Built a usable β€œusername β†’ trusted profile card” in minutes, not hours.
  • Evidence-first design β†’ every claim is traceable, timestamped, and scored.
  • Cross-platform correlation beyond exact string matches:
    • Semantic similarity
    • Image reuse detection
  • A clean, analyst-friendly UI β†’ facts first, exploration second.

πŸ“š What We Learned

  • In OSINT, speed is nothing without explainability.
  • Confidence scores + links build trustβ€”and catch mistakes early.
  • Most value comes from normalization & correlation, not just bigger models.
  • Ethical defaults are essential for adoption and long-term viability.

πŸš€ What’s Next for Deepsint

  • Name-based discovery β†’ privacy-respecting search by name to widen correlation.
  • Image-based discovery β†’ profile picture correlation layered with AI reasoning & embeddings.

Built With

Share this project:

Updates