Skip to content

Error occurs when koboldCpp acts as a server and connects to OpenCode via OpenAI-compatible API #2087

@GarrisonDivision

Description

@GarrisonDivision

Description

When using koboldCpp as a server to connect to OpenCode through an OpenAI-compatible API, errors appear during usage. The system lags, performance is not fully utilized, and generation speed is reduced compared to using KoboldAI Lite. The function remains usable but significantly affects overall efficiency.

Steps to Reproduce

1. Start koboldCpp as an API server
2. Configure the OpenAI-compatible API endpoint in OpenCode
3. Send code generation or invocation requests
4. Errors appear with noticeable lag and slow generation speed

Error Message

q8日志.txt

 

Impact

AI model calls from OpenCode via koboldCpp experience lag and slow response, reducing work efficiency.

Temporary Solution

No effective temporary solution. Performance and API compatibility need to be investigated and optimized.

Notes

This issue is reproducible consistently. It does not break functionality but severely impacts generation speed and user experience.Describe the Issue**
A clear and detailed description of what the issue is, and how to duplicate it (if applicable).

Additional Information:
Please provide as much relevant information about your setup as possible, such as the Operating System, CPU, GPU, KoboldCpp Version, and relevant logs (helpful to include the launch params from the terminal output, flags and crash logs)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions