Skip to content
@NVIDIA

NVIDIA Corporation

Pinned Loading

  1. cuopt cuopt Public

    GPU accelerated decision optimization

    Cuda 630 104

  2. cuopt-examples cuopt-examples Public

    NVIDIA cuOpt examples for decision optimization

    Jupyter Notebook 391 62

  3. open-gpu-kernel-modules open-gpu-kernel-modules Public

    NVIDIA Linux open GPU kernel module source

    C 16.5k 1.6k

  4. aistore aistore Public

    AIStore: scalable storage for AI applications

    Go 1.7k 231

  5. nvidia-container-toolkit nvidia-container-toolkit Public

    Build and run containers leveraging NVIDIA GPUs

    Go 3.9k 455

  6. GenerativeAIExamples GenerativeAIExamples Public

    Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

    Jupyter Notebook 3.7k 943

Repositories

Showing 10 of 645 repositories
  • cuda-quantum Public

    C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows

    NVIDIA/cuda-quantum’s past year of commit activity
    C++ 876 315 407 (16 issues need help) 85 Updated Dec 26, 2025
  • barney Public

    A Scalable (and Optionally, Data-Parallel) ANARI Multi-GPU Path Tracer

    NVIDIA/barney’s past year of commit activity
    C++ 21 Apache-2.0 4 2 0 Updated Dec 26, 2025
  • TensorRT-LLM Public

    TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

    NVIDIA/TensorRT-LLM’s past year of commit activity
    Python 12,479 1,979 520 479 Updated Dec 26, 2025
  • OSMO Public

    The developer-first platform for scaling complex Physical AI workloads across heterogeneous compute—unifying training GPUs, simulation clusters, and edge devices in a simple YAML

    NVIDIA/OSMO’s past year of commit activity
    Python 61 Apache-2.0 6 22 13 Updated Dec 26, 2025
  • stdexec Public

    `std::execution`, the proposed C++ framework for asynchronous and parallel programming.

    NVIDIA/stdexec’s past year of commit activity
    C++ 2,157 Apache-2.0 222 114 13 Updated Dec 26, 2025
  • TensorRT-Incubator Public

    Experimental projects related to TensorRT

    NVIDIA/TensorRT-Incubator’s past year of commit activity
    MLIR 116 22 37 (1 issue needs help) 12 Updated Dec 25, 2025
  • garak Public

    the LLM vulnerability scanner

    NVIDIA/garak’s past year of commit activity
    Python 6,675 Apache-2.0 735 265 (34 issues need help) 39 Updated Dec 25, 2025
  • warp Public

    A Python framework for accelerated simulation, data generation and spatial computing.

    NVIDIA/warp’s past year of commit activity
    Python 5,966 Apache-2.0 404 178 3 Updated Dec 25, 2025
  • Model-Optimizer Public

    A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM, TensorRT, vLLM, etc. to optimize inference speed.

    NVIDIA/Model-Optimizer’s past year of commit activity
    Python 1,724 Apache-2.0 223 57 56 Updated Dec 25, 2025
  • nvshmem Public

    NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmers to perform one-sided communication from within CUDA kernels and on CUDA streams.

    NVIDIA/nvshmem’s past year of commit activity
    C++ 427 48 20 13 Updated Dec 25, 2025