Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 746 138

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 415 67

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.8k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.8k 240

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 4.1k 489

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.8k 991

Repositories

Showing 10 of 683 repositories
  • Megatron-LM Public

    Ongoing research training transformer models at scale

    NVIDIA/Megatron-LM’s past year of commit activity
    Python 15,587 3,672 309 (1 issue needs help) 317 Updated Mar 11, 2026
  • NeMo-Retriever Public

    NeMo Retriever Library is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, contextualize, and extract text, tables, charts and images that you can use in downstream generative applications.

    NVIDIA/NeMo-Retriever’s past year of commit activity
    Python 2,857 Apache-2.0 304 104 (1 issue needs help) 60 Updated Mar 11, 2026
  • bare-metal-manager-core Public

    NVIDIA Bare Metal Manager - Hardware Lifecycle Management and multitenant networking

    NVIDIA/bare-metal-manager-core’s past year of commit activity
    Rust 89 Apache-2.0 58 82 (3 issues need help) 32 Updated Mar 11, 2026
  • cuCascade Public

    GPU Memory Reservation Library

    NVIDIA/cuCascade’s past year of commit activity
    C++ 27 Apache-2.0 14 11 2 Updated Mar 11, 2026
  • NVSentinel Public

    NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated computing environments

    NVIDIA/NVSentinel’s past year of commit activity
    Go 200 Apache-2.0 52 44 22 Updated Mar 11, 2026
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 13,056 2,161 537 560 Updated Mar 11, 2026
  • IsaacTeleop Public

    The unified framework for sim & real robot teleoperation

    NVIDIA/IsaacTeleop’s past year of commit activity
    Python 12 Apache-2.0 3 4 13 Updated Mar 11, 2026
  • tilus Public

    Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.

    NVIDIA/tilus’s past year of commit activity
    Python 447 Apache-2.0 15 8 0 Updated Mar 11, 2026
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 2,130 Apache-2.0 287 74 108 Updated Mar 11, 2026
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 959 345 428 (16 issues need help) 114 Updated Mar 11, 2026