fix(realtime): Better support for thinking models and setting model parameters#8595

richiejp · 2026-02-18T09:04:02Z

fix(realtime): Wrap functions in OpenAI chat completions format
feat(realtime): Set max tokens from session object
fix(realtime): Find thinking start tag for thinking extraction
fix(realtime): Don't send buffer cleared message when we automatically drop it

Description

Various fixes for realtime mode, made while testing with a thinking model that uses llama-cpp's embedded tokenizer template

Notes for Reviewers

Signed commits

Yes, I signed my commits.

Signed-off-by: Richard Palethorpe <io@richiejp.com>

…y drop it Signed-off-by: Richard Palethorpe <io@richiejp.com>

netlify · 2026-02-18T09:04:07Z

✅ Deploy Preview for localai ready!

Name	Link
🔨 Latest commit	`fd2b0e5`
🔍 Latest deploy log	https://app.netlify.com/projects/localai/deploys/69958085c8092f0008a947ab
😎 Deploy Preview	https://deploy-preview-8595--localai.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

…arameters (mudler#8595) * fix(realtime): Wrap functions in OpenAI chat completions format Signed-off-by: Richard Palethorpe <io@richiejp.com> * feat(realtime): Set max tokens from session object Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Find thinking start tag for thinking extraction Signed-off-by: Richard Palethorpe <io@richiejp.com> * fix(realtime): Don't send buffer cleared message when we automatically drop it Signed-off-by: Richard Palethorpe <io@richiejp.com> --------- Signed-off-by: Richard Palethorpe <io@richiejp.com>

richiejp added 4 commits February 16, 2026 16:52

fix(realtime): Wrap functions in OpenAI chat completions format

d2488c7

Signed-off-by: Richard Palethorpe <io@richiejp.com>

feat(realtime): Set max tokens from session object

693cf49

Signed-off-by: Richard Palethorpe <io@richiejp.com>

fix(realtime): Find thinking start tag for thinking extraction

c4d148c

Signed-off-by: Richard Palethorpe <io@richiejp.com>

fix(realtime): Don't send buffer cleared message when we automaticall…

fd2b0e5

…y drop it Signed-off-by: Richard Palethorpe <io@richiejp.com>

mudler approved these changes Feb 18, 2026

View reviewed changes

mudler enabled auto-merge (squash) February 18, 2026 09:32

mudler disabled auto-merge February 18, 2026 13:36

mudler added the bug Something isn't working label Feb 18, 2026

mudler merged commit 86b3bc9 into mudler:master Feb 18, 2026
38 checks passed

BrewTestBot mentioned this pull request Feb 20, 2026

localai 3.12.0 Homebrew/homebrew-core#268600

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(realtime): Better support for thinking models and setting model parameters#8595