Skip to content
@docling-project

Docling Project

Welcome to the Docling Project

This is the GitHub organization Docling open-source project. We like to get continuous feedback from the community: take the poll!

Docling

Image

Docling is our main open-source package. It is a powerful library which simplifies document processing, parsing diverse formats — including advanced PDF understanding — and providing seamless integrations with the gen AI ecosystem.

We support an amazing community which helps us driving forward the adoption of Docling. Give it a try and join the community!



The key repositories of Docling are:

  • docling - The home of the main docling package.
  • docling-core - The definition of types, transforms, serializers, etc. If it has to do with the DoclingDocument you will find it here.
  • docling-parse - The backend PDF parser used by Docling.
  • docling-serve - The FastAPI wrappers for running Docling as REST API and distribute large jobs.
  • docling-ibm-models - The AI models powering Docling.
  • docling-sdg - Synthetic data generation (SDG) on documents for dataset generation for RAG, finetuning, etc.
  • docling-mcp - The definition of tools with the Model Context Protocol for document conversion, manipulation and generation agents.
  • docling-java - A Java API for interacting with Docling, currently based on docling-serve.

LF AI & Data

Docling is hosted as a project in the LF AI & Data Foundation.

Image

IBM ❤️ Open Source AI

The project was started by the AI for knowledge team at IBM Research Zurich.

Pinned Loading

  1. docling docling Public

    Get your documents ready for gen AI

    Python 56.8k 3.9k

  2. docling-serve docling-serve Public

    Running Docling as an API service

    Python 1.4k 285

  3. docling-core docling-core Public

    Docling core data types and transformations

    Python 239 147

  4. community community Public

    6 1

Repositories

Showing 10 of 25 repositories
  • docling-java Public

    A Java API for Docling

    docling-project/docling-java’s past year of commit activity
    Java 101 MIT 16 9 (1 issue needs help) 2 Updated Mar 30, 2026
  • docling-graph Public

    Transform unstructured documents into validated, rich and queryable knowledge graphs.

    docling-project/docling-graph’s past year of commit activity
    Python 117 MIT 18 3 4 Updated Mar 30, 2026
  • docling-parse Public

    Simple package to extract text with coordinates from programmatic PDFs

    docling-project/docling-parse’s past year of commit activity
    C++ 261 MIT 58 40 8 Updated Mar 30, 2026
  • docling-core Public

    Docling core data types and transformations

    docling-project/docling-core’s past year of commit activity
    Python 239 MIT 147 46 (1 issue needs help) 28 Updated Mar 30, 2026
  • docling-eval Public

    Evaluation framework for document processing models and services.

    docling-project/docling-eval’s past year of commit activity
    Python 67 MIT 11 10 12 Updated Mar 30, 2026
  • docling-cvat-tools Public

    Collection of CVAT parsing and campaign utilities for Docling

    docling-project/docling-cvat-tools’s past year of commit activity
    Python 1 MIT 1 0 2 Updated Mar 30, 2026
  • docling Public

    Get your documents ready for gen AI

    docling-project/docling’s past year of commit activity
    Python 56,760 MIT 3,857 830 (7 issues need help) 36 Updated Mar 30, 2026
  • docling-metrics Public

    Core package for type and interface definitions of docling metric implementations

    docling-project/docling-metrics’s past year of commit activity
    C++ 4 MIT 1 1 1 Updated Mar 30, 2026
  • docling-serve Public

    Running Docling as an API service

    docling-project/docling-serve’s past year of commit activity
    Python 1,380 MIT 285 105 5 Updated Mar 30, 2026
  • docling-project/docling-jobkit’s past year of commit activity
    Python 26 MIT 22 10 3 Updated Mar 30, 2026