-
Notifications
You must be signed in to change notification settings - Fork 657
Error occurs when koboldCpp acts as a server and connects to OpenCode via OpenAI-compatible API #2087
Description
Description
When using koboldCpp as a server to connect to OpenCode through an OpenAI-compatible API, errors appear during usage. The system lags, performance is not fully utilized, and generation speed is reduced compared to using KoboldAI Lite. The function remains usable but significantly affects overall efficiency.
Steps to Reproduce
1. Start koboldCpp as an API server
2. Configure the OpenAI-compatible API endpoint in OpenCode
3. Send code generation or invocation requests
4. Errors appear with noticeable lag and slow generation speed
Error Message
Impact
AI model calls from OpenCode via koboldCpp experience lag and slow response, reducing work efficiency.
Temporary Solution
No effective temporary solution. Performance and API compatibility need to be investigated and optimized.
Notes
This issue is reproducible consistently. It does not break functionality but severely impacts generation speed and user experience.Describe the Issue**
A clear and detailed description of what the issue is, and how to duplicate it (if applicable).
Additional Information:
Please provide as much relevant information about your setup as possible, such as the Operating System, CPU, GPU, KoboldCpp Version, and relevant logs (helpful to include the launch params from the terminal output, flags and crash logs)