feat(ruvLLM): P1 persistent trajectory sidecar + P4 wiki-corpus pretraining by Stricttype · Pull Request #414 · ruvnet/RuVector

Stricttype · 2026-05-02T18:42:20Z

Summary

Two independent foundation patches for ruvLLM (examples/ruvLLM/), shipped together. Both are non-default — gated behind opt-in feature flags so existing builds (incl. no_std/ESP32) are unaffected.

P1 — Persistent Trajectory Sidecar (feature: `persistence`)

Replaces the lossy in-memory crossbeam::ArrayQueue trajectory buffer with a durable SQLite-backed store. Fixes the SONA principle violation "no query is wasted" in src/sona/trajectory.rs.

src/sona/persist.rs — PersistentTrajectoryStore with bounded mpsc channel + background writer thread
WAL mode, fsync NORMAL, schema versioning, drop-policy on overflow
src/bin/sidecar.rs — minimal v1 stub (embedded thread; external-process upgrade is mechanical once IPC contract exists)
4 integration tests incl. p95 record-latency measurement (target <50µs under contention)

P4 — Wiki-Corpus Pretraining Pipeline (feature: `real-inference`)

Curated cold-start replacing random-init perplexity baseline.

src/corpus/{mod,wiki,tokenize}.rs — Simple-English-Wiki dump reader, sharder, HF-tokenizers wrapper
src/training.rs — DatasetSource trait, ModelCheckpoint serde, save_checkpoint / load_checkpoint, measure_baseline_perplexity
src/bin/pretrain.rs — rewired with --corpus CLI flag
scripts/fetch-simple-wiki.sh — pulls + extracts ~250MB dump
5 fixture-based pipeline tests

Disjoint by design

P1 touches src/sona/* + new sidecar binary. P4 touches src/corpus/* + src/training.rs + src/bin/pretrain.rs. Zero file overlap.

Smoke tests (all green)

cargo check (default, persistence, real-inference, persistence+real-inference)
cargo test --test persist_integration --features persistence
cargo test --test wiki_pipeline_test --features real-inference
cargo build --bin ruvllm-sidecar --features persistence
cargo build --bin ruvllm-pretrain --features real-inference

Known follow-ups (not in this PR)

P4.1: Trainer::train computes loss but does not call optimizer.step against weight matrices — perplexity-improvement acceptance criterion is structurally blocked until this lands. Pipeline correctness validated; quality target deferred.
P2 (KV-cache mixed-precision scheduler) and P3 (multi-LoRA unified-pool / S-LoRA) intentionally deferred — both need empirical trajectory + adapter-usage data that P1 will start collecting.

Test plan

CI green on default features
CI green on --features persistence
CI green on --features real-inference
No regression on ESP32/no_std build (persistence is gated)

🤖 Generated with Claude Code

- examples/ruvLLM/Cargo.lock - examples/ruvLLM/Cargo.toml - examples/ruvLLM/docs/api-reference.md - examples/ruvLLM/docs/code-standards.md - examples/ruvLLM/docs/codebase-summary.md - examples/ruvLLM/docs/configuration-guide.md - examples/ruvLLM/docs/deployment-guide.md - examples/ruvLLM/docs/handoffs/2026-05-02-1943-auto.md - examples/ruvLLM/docs/project-overview-pdr.md - examples/ruvLLM/docs/system-architecture.md - examples/ruvLLM/docs/testing-guide.md - examples/ruvLLM/learn/260502-1900-init-ruvllm/learn-results.tsv - examples/ruvLLM/learn/260502-1900-init-ruvllm/summary.md Co-Authored-By: Pi Coding Agent <pi@localhost>

- examples/ruvLLM/Cargo.lock - examples/ruvLLM/config/example.toml - examples/ruvLLM/config/pretrain.toml - examples/ruvLLM/scripts/fetch-simple-wiki.sh - examples/ruvLLM/src/bin/pretrain.rs - examples/ruvLLM/src/bin/sidecar.rs - examples/ruvLLM/src/config.rs - examples/ruvLLM/src/lib.rs - examples/ruvLLM/src/sona/mod.rs - examples/ruvLLM/src/sona/persist.rs - examples/ruvLLM/src/training.rs - examples/ruvLLM/tests/persist_integration.rs Co-Authored-By: Pi Coding Agent <pi@localhost>

- rename src/data/ → src/corpus/ (gitignore conflict: data/ pattern blocks Rust source) - add corpus module: wiki corpus iter, tokenizer wrapper, tokenized dataset - add tests/wiki_pipeline_test.rs (5/5 PASS) - surgical fixes for pre-existing candle 0.8 API drift in src/inference_real.rs - add From<candle_core::Error> shim in src/error.rs (unblocks --features real-inference) - extend src/training.rs: DatasetSource trait, ModelCheckpoint serde, save_checkpoint, measure_baseline_perplexity P4 status: DONE_WITH_CONCERNS — pre-existing issues surfaced: - Trainer::train computes loss but does not call optimizer.step (no backprop) → perplexity-delta is structurally 0% until follow-up patch - SmallTransformer lacks from_checkpoint() constructor → trained checkpoints are saved but to_q4_weights() re-randomizes; follow-up needed to load saved checkpoint into inference path - TokenizerWrapper::from_pretrained stubbed (requires tokenizers/http feature, not currently enabled); inline whitespace WordLevel fallback works for offline pilot Smoke tests: - cargo check: PASS - cargo check --features persistence: PASS - cargo check --features real-inference: PASS - cargo check --features persistence,real-inference: PASS - cargo test --features persistence --test persist_integration: 4/4 PASS - cargo test --features real-inference --test wiki_pipeline_test: 5/5 PASS

Crew Worker and others added 3 commits May 2, 2026 20:41

Stricttype mentioned this pull request May 2, 2026

fix(ruvLLM): P4.1 wire trainer backprop endpoint gradients #416

Open

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ruvLLM): P1 persistent trajectory sidecar + P4 wiki-corpus pretraining#414

feat(ruvLLM): P1 persistent trajectory sidecar + P4 wiki-corpus pretraining#414
Stricttype wants to merge 3 commits intoruvnet:mainfrom
Stricttype:feat/ruvllm-p1-sidecar-p4-pretrain

Stricttype commented May 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Stricttype commented May 2, 2026

Summary

P1 — Persistent Trajectory Sidecar (feature: persistence)

P4 — Wiki-Corpus Pretraining Pipeline (feature: real-inference)

Disjoint by design

Smoke tests (all green)

Known follow-ups (not in this PR)

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

P1 — Persistent Trajectory Sidecar (feature: `persistence`)

P4 — Wiki-Corpus Pretraining Pipeline (feature: `real-inference`)