Prerequisites
Feature Description
In the server tool, I would like the stream returned by the /completion endpoint to return "prompt processing progress" similar to what is displayed in the server log.
slot update_slots: id 0 | task 0 | prompt processing progress, n_past = 19200, n_tokens = 256, progress = 0.827836
Motivation
This would allow me to show evaluation progress to the user for long completion processes.
Possible Implementation
No response