LLMs

Dec 05, 2025

NVIDIA Kaggle Grandmasters Win Artificial General Intelligence Competition

NVIDIA researchers on Friday won a key Kaggle competition many in the field treat as a real-time pulse check on humanity’s progress toward artificial general...

3 MIN READ

Dec 02, 2025

NVIDIA-Accelerated Mistral 3 Open Models Deliver Efficiency, Accuracy at Any Scale

The new Mistral 3 open model family delivers industry-leading accuracy, efficiency, and customization capabilities for developers and enterprises. Optimized...

6 MIN READ

Dec 01, 2025

Build Efficient Financial Data Workflows with AI Model Distillation

Large language models (LLMs) in quantitative finance are increasingly being used for alpha generation, automated report analysis, and risk prediction. Yet...

11 MIN READ

Nov 19, 2025

Breaking Through Reinforcement Learning Training Limits with Scaling Rollouts in BroRL

When training large language models (LLMs) with reinforcement learning from verifiable rewards (RLVR), one of the most compelling questions is how to overcome...

7 MIN READ

Nov 17, 2025

Pioneering AI Co-Scientists for Fusion Research and Cancer Treatment

AI is reshaping scientific research and innovation. Scientists can leverage AI to generate, summarize, combine, and analyze scientific data. AI models can find...

8 MIN READ

Nov 13, 2025

Achieve CUTLASS C++ Performance with Python APIs Using CuTe DSL

CuTe, a core component of CUTLASS 3.x, provides a unified algebra for describing data layouts and thread mappings, and abstracts complex memory access patterns...

9 MIN READ

Nov 10, 2025

How to Achieve 4x Faster Inference for Math Problem Solving

Large language models can solve challenging math problems. However, making them work efficiently at scale requires more than a strong checkpoint. You need the...

7 MIN READ

Nov 07, 2025

Benchmarking LLMs on AI-Generated CUDA Code with ComputeEval 2025.2

Can AI coding assistants write efficient CUDA code? To help measure and improve their capabilities, we created ComputeEval, a robust, open source benchmark for...

2 MIN READ

Nov 06, 2025

Enhancing GPU-Accelerated Vector Search in Faiss with NVIDIA cuVS

As companies collect more unstructured data and increasingly use large language models (LLMs), they need faster and more scalable systems. Advanced tools for...

11 MIN READ

Nov 06, 2025

Accelerating Large-Scale Mixture-of-Experts Training in PyTorch

Training massive mixture-of-experts (MoE) models has long been the domain of a few advanced users with deep infrastructure and distributed-systems expertise....

7 MIN READ

Nov 03, 2025

Make Sense of Video Analytics by Integrating NVIDIA AI Blueprints

Organizations are increasingly seeking ways to extract insights from video, audio, and other complex data sources. Retrieval-augmented generation (RAG) enables...

11 MIN READ

Oct 30, 2025

Streamline AI Infrastructure with NVIDIA Run:ai on Microsoft Azure

Modern AI workloads, ranging from large-scale training to real-time inference, demand dynamic access to powerful GPUs. However, Kubernetes environments have...

9 MIN READ

Oct 28, 2025

Develop Specialized AI Agents with New NVIDIA Nemotron Vision, RAG, and Guardrail Models

Agentic AI is an ecosystem where specialized language and vision models work together. They handle planning, reasoning, retrieval, and safety guardrailing....

9 MIN READ

Oct 24, 2025

How NVIDIA DGX Spark's Performance Enables Intensive AI Tasks

Today’s demanding AI developer workloads often need more memory than desktop systems provide or require access to software that laptops or PCs lack. This...

5 MIN READ

Oct 23, 2025

Train an LLM on NVIDIA Blackwell with Unsloth—and Scale for Production

Fine-tuning and reinforcement learning (RL) for large language models (LLMs) require advanced expertise and complex workflows, making them out of reach for...

5 MIN READ

Oct 22, 2025

Create Your Own Bash Computer Use Agent with NVIDIA Nemotron in One Hour

What if you could talk to your computer and have it perform tasks through the Bash terminal, without you writing a single command? With NVIDIA Nemotron Nano v2,...

14 MIN READ