feat: implement persistent job queue with bbolt and maintenance worker #375

Hell1213 · 2026-01-06T06:26:30Z

The current job queue was volatile and jobs would be lost on backend restarts. This implements a bbolt-based persistent queue that stores jobs to disk and restores them on startup. Also added a cron-based maintenance worker that automatically cleans up old completed and failed job logs to prevent unbounded disk usage. All existing functionality is preserved and the new features are fully configurable via environment variables.

Description

Added persistent job queue using bbolt database to store jobs on disk. Jobs now survive backend restarts and are automatically restored on startup. Implemented maintenance worker with cron scheduler to clean up old job logs and prevent disk usage issues.

Fixes: Persistent Job Queue & Maintenance Worker for backend #367

Checklist

Ran npx prettier --write . (for formatting)
Ran gofmt -w . (for Go backend)
Ran npm test (for JS/TS testing)
Added unit tests, if applicable
Verified all tests pass
Updated documentation, if needed

Terminal Screenshot

Additional Notes

New environment variables added for configuration:

CLEANUP_CRON_SCHEDULE - Schedule for maintenance worker (default: daily at midnight)
CLEANUP_RETENTION_DAYS - How long to keep job logs (default: 7 days)
QUEUE_DB_PATH - Database file location (default: /app/data/queue.db)

The implementation gracefully falls back to in-memory queue if persistent storage fails, ensuring no breaking changes.

github-actions · 2026-01-06T06:26:39Z

Thank you for opening this PR!

Before a maintainer takes a look, it would be really helpful if you could walk through your changes using GitHub's review tools.

Please take a moment to:

Check the "Files changed" tab
Leave comments on any lines for functions, comments, etc. that are important, non-obvious, or may need attention
Clarify decisions you made or areas you might be unsure about and/or any future updates being considered.
Finally, submit all the comments!

More information on how to conduct a self review:
https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/reviewing-changes-in-pull-requests/reviewing-proposed-changes-in-a-pull-request

This helps make the review process smoother and gives us a clearer understanding of your thought process.

Once you've added your self-review, we'll continue from our side. Thank you!

backend/README.md

backend/controllers/controllers_test.go

backend/controllers/job_queue.go

Hell1213 · 2026-01-06T06:38:30Z

backend/go.mod

added required dependencies for bbolt and cron

Hell1213 · 2026-01-06T06:38:38Z

backend/go.sum

added required dependencies for bbolt and cron

backend/main.go

backend/utils/maintenance_worker.go

backend/utils/persistent_queue.go

backend/utils/persistent_queue_test.go

Hell1213

This PR solves the job persistence issue cleanly. Jobs were getting lost on backend restarts, now they're saved to disk with bbolt and restored automatically. Added a maintenance worker to clean up old logs so disk doesn't fill up. Everything is configurable and backward compatible - existing code keeps working even if the database fails. Tested live and confirmed jobs survive restarts.

Hell1213 · 2026-01-06T19:04:54Z

hey @its-me-abhishek , PR is ready for Review , open to any further changes required.

its-me-abhishek · 2026-01-07T17:05:57Z

@Hell1213, the implementation looks great. will review it once locally first and then here.

backend/go.mod

backend/utils/maintenance_worker.go

Hell1213 · 2026-01-12T15:53:30Z

Hey @its-me-abhishek, should we keep Go 1.23 or downgrade bbolt to v1.3.7 (works with Go 1.19)? The Go 1.23 bump is because bbolt v1.4.3 requires it - the link you mentioned in the issue points to use bbolt v1.4.3.

The current job queue was volatile and jobs would be lost on backend restarts. This implements a bbolt-based persistent queue that stores jobs to disk and restores them on startup. Also added a cron-based maintenance worker that automatically cleans up old completed and failed job logs to prevent unbounded disk usage. All existing functionality is preserved and the new features are fully configurable via environment variables. Resolves CCExtractor#367

Hell1213 · 2026-01-15T06:29:04Z

hey @its-me-abhishek ,
I have downgraded bbolt to v1.3.7 to keep Go 1.19 consistent with the main app. Also added cron schedule comments,
pls take a look when you got time .

backend/README.md

backend/utils/persistent_queue.go

Hell1213 · 2026-01-17T19:09:43Z

hey @its-me-abhishek ,pls check the pr ,its ready I have implemented those changes as requested

its-me-abhishek · 2026-01-17T19:17:05Z

@Hell1213 there seems to be some misunderstanding, please do add those updated creds to
production/example.backend.env and the related Docs to production/README along with their default values. Since that is the ideal way to run the backend.

Additionally, just checked that this PR will probably require to update the Kubernetes config, as well, in order to keep that working

Hell1213 · 2026-01-17T19:21:29Z

@Hell1213 there seems to be some misunderstanding, please do add those updated creds to production/example.backend.env and the related Docs to production/README along with their default values. Since that is the ideal way to run the backend.

Additionally, just checked that this PR will probably require to update the Kubernetes config, as well, in order to keep that working

thanks , apologies for misunderstanding ,I'm on it will make those changes as told .

Fixed job restoration to prevent duplicates and added queue environment variables to production files for Docker/Kubernetes deployments.