Skip to content

chore(model gallery): Add npc-llm-3-8b#8498

Merged
mudler merged 1 commit intomudler:masterfrom
rampa3:add-npc-llm-3-8b
Feb 12, 2026
Merged

chore(model gallery): Add npc-llm-3-8b#8498
mudler merged 1 commit intomudler:masterfrom
rampa3:add-npc-llm-3-8b

Conversation

@rampa3
Copy link
Copy Markdown
Contributor

@rampa3 rampa3 commented Feb 10, 2026

Description

This PR adds npc-llm-3-8b in Q4_K_M GGUF quantization to the model gallery, based upon an older request in LocalAI Discord.

This PR is one from a set of model additions I am working on currently, where I attempt to satisfy all not yet satisfied model requests I have found, which don't require any backend side changes (such as need for moderation endpoint for Qwen 3 Guard (requested on LocalAI Discord), or what would adding Mochi 1 (requested on LocalAI Discord) entail).

Notes for Reviewers

⚠️ This model is explicitly specialized to generate actions for game NPC characters based on specially formatted input prompt, and is not useful for general purpose. If you feel we don't need such model in the gallery, please close the PR without merging. ⚠️

Tests done on the model before PR:

  • pulling model from LocalAI's webUI
  • acting upon example input prompt from model card

Signed commits

  • Yes, I signed my commits.

Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com>
@netlify
Copy link
Copy Markdown

netlify bot commented Feb 10, 2026

Deploy Preview for localai ready!

Name Link
🔨 Latest commit 783672a
🔍 Latest deploy log https://app.netlify.com/projects/localai/deploys/698b5da17a6c500008e5fce5
😎 Deploy Preview https://deploy-preview-8498--localai.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

- filename: "phi-2-orange.Q4_0.gguf"
sha256: "49cb710ae688e1b19b1b299087fa40765a0cd677e3afcc45e5f7ef6750975dcf"
uri: "huggingface://TheBloke/phi-2-orange-GGUF/phi-2-orange.Q4_0.gguf"
- url: "github:mudler/LocalAI/gallery/phi-3-chat.yaml@master"
Copy link
Copy Markdown
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this really using the phi-3? from the description didn't looked like. But we can test once in the gallery

Copy link
Copy Markdown
Contributor Author

@rampa3 rampa3 Feb 12, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to the model card on the safetensors version, it is trained Phi-3. I tested it using this setup, and it seemed to output what they shown in examples on the original. This model is a weird one...

Note: The template the model has on Hugging Face is much simpler than the Phi-3 one I used, but I guess it is expectable for a specially trained model to have just basic minimal one, no?

@mudler mudler merged commit 2ab6be1 into mudler:master Feb 12, 2026
36 of 38 checks passed
@rampa3 rampa3 deleted the add-npc-llm-3-8b branch February 12, 2026 17:13
localai-bot pushed a commit to localai-bot/LocalAI that referenced this pull request Mar 25, 2026
Signed-off-by: rampa3 <68955305+rampa3@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants