GitLab Knowledge Graph

This page contains information related to upcoming products, features, and functionality. It is important to note that the information presented is for informational purposes only. Please do not rely on this information for purchasing or planning purposes. The development, release, and timing of any products, features, or functionality may be subject to change or delay and remain at the sole discretion of GitLab Inc.
Status Authors Coach DRIs Owning Stage Created
ongoing michaelangeloio michaelusa jgdoyon1 bohdanpk ahegyi shekharpatnaik andrewn dgruzd michaelangeloio devops analytics 2025-10-12

Overview

The GitLab Knowledge Graph (Orbit) is a Rust service that builds a property graph from GitLab instance data (SDLC metadata and code structure) and exposes it through a JSON-based Cypher-like DSL compiled to ClickHouse SQL. It provides a unified context API for AI systems (via MCP) and human users.

The service indexes two types of data into property graph format:

  • SDLC metadata: issues, merge requests, CI pipelines, work items, groups, projects, and other GitLab entities streamed via Siphon CDC from PostgreSQL through NATS into ClickHouse.
  • Code: call graphs, definitions, references, and repository metadata fetched from Gitaly and parsed into ClickHouse graph tables.

Architecture

flowchart LR
    GitLab[GitLab Core] -- CDC replication --> DIP[Data Insights Platform]
    GKG -- Git RPC --> GitLab
    DIP -- datalake --> CH[(ClickHouse)]
    CH <-- graph tables --> GKG[Knowledge Graph · Orbit]
    GitLab -. gRPC / AuthZ .-> GKG

    style GitLab fill:#333,color:#fff,stroke:#333
    style DIP fill:#6E49CB,color:#fff,stroke:#6E49CB
    style CH fill:#FFCC00,color:#000,stroke:#FFCC00
    style GKG fill:#FC6D26,color:#fff,stroke:#FC6D26

Design documents

The full design documents now live alongside the code in the knowledge-graph repository:

Resources

Resource Location
Repository gitlab-org/orbit/knowledge-graph
Primary epic #19744
Program page internal handbook