BI

BitLlama

Pure Rust LLM inference engine featuring 1.58-bit ternary quantization, Test-Time Training, Soul learning system, MCP server/client, and private RAG. Supports Llama, Gemma, Mistral, Qwen, and BitNet models. Includes OpenAI-compatible API server.
Latest: 1.0.0 GitHub
Last checked: Jun 9, 2026 12:10am
Rank: 9595/15140
Also monitored via:
Site Monitor Winget
Follow to track new versions in your feed.
Report

Overview

0
License: MITWinget: Available

Version & Lifecycle

0
Current: 1.0.0 N-2: 0.15.0

Top Contributors

Top sitewide contributors:

  1. Anbarasan
  2. nico_k
  3. Bob
  4. Vigneshwaran

Community Notes

No community notes yet

Be the first to as a good question or share deployment tips, customization scripts, command lines, or troubleshooting steps.

Release Notes & Updates

0
Avg cadence:
Updates • 0

Help us match vulnerabilities

No vulnerability match yet. Pick the right product:

Looking for matching products…
Don’t see it? Paste a CPE

Also known as

Other names people use for this app — helps search and matching.

BitLlamaimonoonoko BitLlama

Packaging Notes

0

Pure Rust implementation

Notes

0

Latest version: 0.16.0