Issue/4249-finish-tinyagents-migration by senamakel · Pull Request #4399 · tinyhumansai/openhuman

senamakel · 2026-07-02T02:24:20Z

Summary

What changed and why.
Keep this to 3-6 bullets focused on user-visible or architecture-impacting changes.

Problem

What issue or risk this PR addresses.
Include context needed for reviewers to evaluate correctness quickly.

Solution

How the implementation solves the problem.
Note important design decisions and tradeoffs.

Submission Checklist

If a section does not apply to this change, mark the item as N/A with a one-line reason. Do not delete items.

Tests added or updated (happy path + at least one failure / edge case) per Testing Strategy
Diff coverage ≥ 80% — changed lines (Vitest + cargo-llvm-cov merged via diff-cover) meet the gate enforced by .github/workflows/pr-ci.yml. Run pnpm test:coverage and pnpm test:rust locally; PRs below 80% on changed lines will not merge.
Coverage matrix updated — added/removed/renamed feature rows in docs/TEST-COVERAGE-MATRIX.md reflect this change (or N/A: behaviour-only change)
All affected feature IDs from the matrix are listed in the PR description under ## Related
No new external network dependencies introduced (mock backend used per Testing Strategy)
Manual smoke checklist updated if this touches release-cut surfaces (docs/RELEASE-MANUAL-SMOKE.md)
Linked issue closed via Closes #NNN in the ## Related section

Impact

Runtime/platform impact (desktop/mobile/web/CLI), if any.
Performance, security, migration, or compatibility implications.

AI Authored PR Metadata (required for Codex/Linear PRs)

Keep this section for AI-authored PRs. For human-only PRs, mark each field N/A.

Linear Issue

Key:
URL:

Commit & Branch

Branch:
Commit SHA:

Validation Run

Validation Blocked

command:
error:
impact:

Behavior Changes

Intended behavior change:
User-visible effect:

Parity Contract

Legacy behavior preserved:
Guard/fallback/dispatch parity checks:

Duplicate / Superseded PR Handling

Duplicate PR(s):
Canonical PR:
Resolution (closed/superseded/updated):

coderabbitai · 2026-07-02T02:24:35Z

Important

Review skipped

Draft detected.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 07bc6a6b-da11-41bf-a98c-3cb2d3e542d1

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

_{Comment @coderabbitai help to get the list of available commands.}

…ed crate delta) (tinyhumansai#4249) Claude-Session: https://claude.ai/code/session_01UCE4k5uj5FsjFzgZSvXHQy

Adds DomainEvent::{WorkspacePrepared, WorkspaceViolation, WorkspaceCleanup} for the 08.5 worktree-isolation workstream, plus cargo fmt normalization across session_import and payload_summarizer. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

Wraps OpenHuman's EmbeddingProvider as tinyagents harness::embeddings::EmbeddingModel, bridging the &[String] vs &[&str] signature and anyhow->TinyAgentsError::Embedding error mapping. Preserves dimensions()/signature() fidelity. Not yet wired into the recall path (09.2); re-exported so it is part of the crate surface. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

Wire GitWorktreeIsolation prepare/cleanup and a new enforce_workspace_path helper to publish DomainEvent::{WorkspacePrepared,WorkspaceCleanup, WorkspaceViolation} onto the global bus for the security audit trail. Descriptor stays a carrier; SecurityPolicy/landlock remains the enforcement authority. worktree_context.rs deletion + acting-tool migration deferred. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…ch (10.2) run_turn_via_tinyagents_shared now inspects registry.diagnostics() after harness assembly and aborts the turn before any model dispatch when an error-severity diagnostic (duplicate name / dangling alias) is present, via new AgentError::RegistryValidationFailed. Warnings are logged only. Existing hand-rolled dedup left intact (deletion deferred until parity). Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

New read-only RPC projecting the CapabilityRegistry inventory (models/tools/graphs/agents with ComponentMetadata) plus a Graphviz DOT export from durable descriptor sources reachable outside a turn. Additive sibling to agent.graph_topologies; the full per-agent tool surface and per-run-only kinds are documented deferrals in the response. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…(06.1) UsageInfo now carries cache_creation_tokens/reasoning_tokens through the bridge into the persisted TokenUsage record instead of hardcoding 0. claude_code provider path populates real cache-creation tokens. Providers that do not report these keep 0. No public cost RPC shape change. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…onMiddleware (03.1) Retain the ContextCompressionMiddleware handle on the assembled turn and drain records() after the run, logging per-compaction provenance (source ids, before/after token estimates, reason) under a grep-friendly [context] prefix. Additive; ToolOutputMiddleware Compressed contract untouched. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…ity gating (02.1) New tinyagents/routes.rs projects the 7 router tiers (chat/reasoning/ agentic/coding/burst/summarization/vision) into the crate ModelRegistry as per-route ProviderModel entries with real ModelProfile (vision/reasoning/ context-window). set_default_model still points at the turn's effective model so dispatch is unchanged; this enables 02.2 fallback ordering. Adds RequiredCapabilitiesMiddleware stamping the turn CapabilitySet onto each ModelRequest so unfit models are rejected pre-dispatch (vision wired; tool/reasoning/BYOK signals documented as follow-ups). Route policy stays in router.rs/factory.rs. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…uman overlays (02.4) unified_model_catalog() seeds from tinyagents ModelCatalog::seed(), overlays KNOWN_MODEL_PRICING rates/windows (source of truth, identical numbers), local runtime models, and pattern-window backfill. Model-picker RPC now sources local models from config. estimate_cost_usd/context_window stay on KNOWN_MODEL_PRICING to guarantee numeric identity; duplicate-table deletion deferred until a snapshot lookup is proven identical. No cost numbers changed. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

Adds PromptCacheSegmentMiddleware stamping content-fingerprinted system/tools PromptSegments, then PromptCacheGuardMiddleware with protect_prompt_prefix=true; CacheLayoutEvents surface as structured [cache] warnings alongside the retained CacheAlignMiddleware. Threads deterministic_cacheable through the shared runner to attach InMemoryResponseCache only for internal deterministic runs — all three production callers (chat/channel/subagent) set false, so interactive turns are never served cached responses. CacheHit/Miss counted in the bridge; cost-footer DTO wiring deferred to 06. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…ent (09.2/09.3) New tinyagents/retriever.rs wraps Memory::recall as the swappable retrieval seam: projects entries to crate ScoredDoc, carries path_scope and applies the id-keyed dedupe rule, and emits AgentEvent::MemoryLoaded. memory_context.rs and memory_loader.rs load recall through the facade; CROSS_CHAT_HEADER, citation format, and collect_recall_citations output stay byte-identical (engine unchanged, adapter-first). Concrete crate Retriever exposed as the engine-swap seam. Embedding usage/cost (09.4) deferred to coordinate with 06. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

… (05.2) New agent_orchestration/subagent_events.rs centralizes construction + publish of DomainEvent::Subagent{Spawned,Completed,Failed,AwaitingUser} across 24 sites in 6 files. Event variants, field values, and ordering are byte-identical, so RunLedgerFinalizeSubscriber and UI consumers see no change; this is the single hook point for future ordering/rate-limiting/journal-mirroring (05.1). Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…rojections (10.1) assemble_turn_harness now registers Agent descriptors deduped from the runtime AgentDefinitionRegistry and agent_registry builtins so they appear in the snapshot/diagnostics streams, and exercises to_model_registry()/to_tool_registry() as validation projections with a [registry] count summary. ComponentMetadata description/tags persistence and register_agent (needs executable blueprint) are documented follow-ups; live register_model/register_tool glue left intact. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…02.2) RunPolicy.fallback now carries an ordered same-family alternate chain from the 02.1 routes; FallbackObserverMiddleware emits AgentEvent::FallbackSelected with no extra dispatch; RetryScheduled surfaces (dormant while max_attempts pinned=1 to avoid double-retry with still-wrapped ReliableProvider). ProviderModel maps permanent/billing rejections to non-retryable via OpenHuman classifiers. Adapter- first: reliable.rs annotated for deletion in the 11-testing conformance pass. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

Tool-argument fragments now ride ModelStreamItem::ToolCallDelta into AgentProgress::ToolCallArgsDelta instead of ThinkingForwarder. Start event + tool_name stay on the forwarder (crate ToolDelta has no name field); a shared per-turn ToolNameMap labels streamed fragments so UI timeline parity holds. Removed emit_tool_args from ThinkingForwarder; its start marker + non-streaming reasoning fallback stay live. Child-run streaming preserved via scope-aware bridge. Ledger row updated. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…uthority (07.3) Maps Redirect/Pause/Resume/Cancel to crate SteeringCommand via a new SteeringDirective, delivered only through the registered SteeringHandle with fail-closed policy checks. SteeringPolicy tightens by run class (background subagent runs accept control-flow steering without transcript injection; interactive keeps InjectMessage+Pause). Steered event projected under [steering]. Recursion: documents why spawn_depth_context stays a thin projector (cross-process MCP-hop depth + synchronous pre-dispatch surface); cap=3 and SpawnDepthExceeded wording unchanged. run_queue mechanics retained. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…4.1) Behind OPENHUMAN_SESSION_DUAL_WRITE (default OFF), after a successful legacy JSONL transcript write, also append the turn to the slash-free store stream session.{stem}.messages and upsert the NS_SESSIONS descriptor, reusing the session_import convert normalization so live and imported records are shape-identical. Store writes are fire-and-forget and non-fatal; OFF-default behavior is byte-identical to today. Reads stay legacy (04.2). Factored shared open_session_stores() helper. StoreChatHistory adoption evaluated + deferred. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

Attaches a second EventSink listener (FanOutSink -> RedactingSink -> JournalSink -> StoreEventJournal over the 04.1 JsonlAppendStore) alongside the unchanged OpenhumanEventBridge, plus a Store-backed FileStatusStore (crate ships only in-memory) writing running/completed/failed snapshots keyed by run_id with list_by_root/thread/active. RedactingSink masks credential-valued env secrets before persistence. Adds read_run_events/read_run_status replay-reader seam for a future replay RPC. Writes are best-effort/non-fatal. BEHAVIOR NOTE: the per-turn EventSink is now created unconditionally (was gated on on_progress/pause) so a run is reconstructable without subscribing at start; all journal/status I/O is non-fatal. Subscribers untouched; no deletions. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…3/09.4) Adds run_id/root_run_id to TokenUsage (serde default + skip_serializing_if, stamped None pending run-tree threading; rollup swap deferred). ProviderEmbedding Model.embed records best-effort embedding usage (provider/model/dims/vectors) priced via the unified catalog, zero-cost when no embedding rate exists. Non-fatal [cost][embed] recording; public cost DTOs backward-compatible. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…perative cancel (07.2) Adds reconcile_orphaned_tasks_on_boot: scans the durable DetachedTaskStore for tasks left live by a prior process, settles them terminal (CancelRequested-> Cancelled, else Failed with an orphaned-by-restart reason), and emits the 05.2 terminal lifecycle event so the run ledger finalizes. Hooked in bootstrap_core_runtime next to run-ledger recovery. Flips the CancellationToken before abort in cancel_for_thread/cancel_all so cooperative cancel is uniform; terminal store write preserved. Best-effort/non-fatal; no deletions, no shrink. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

… review gate (08.3) Adds DelegationConfig::require_review_approval: the durable delegation graph emits NodeResult::Interrupt at the review approval point (persisted Sync via the existing SqlRunLedgerCheckpointer so the pause survives restart) and resumes via Command::resume, mapping the stable ApprovalDecision RPC wire strings (approve_once/approve_always_for_tool/deny) with deny overriding. run_delegation_ durable/resume_delegation added; deny_decision() preserves TTL-deny. Interactive chat approval gate untouched (durable-vs-chat boundary documented). Workflow human-review + live approval-RPC delivery noted as follow-ups. Claude-Session: https://claude.ai/code/session_01Frnx4CvLQBCGoDyT6FT6Sq

…inyagents-migration # Conflicts: # src/openhuman/agent_orchestration/parent_context/mod.rs

senamakel added 29 commits July 1, 2026 19:30

refactor(tinyagents): stage spawn parallel workers in graph

9acaaa6

refactor(tinyagents): run spawn parallel graph from wrapper

e96abe4

docs(tinyagents): retarget migration plan to tinyagents 1.3.0 (verifi…

b4d0307

…ed crate delta) (tinyhumansai#4249) Claude-Session: https://claude.ai/code/session_01UCE4k5uj5FsjFzgZSvXHQy

refactor(tinyagents): type spawn parallel validation

af3c4a8

refactor(tinyagents): call map_reduce fanout directly

b6137bc

docs(tinyagents): record map_reduce fanout cutover status

72b94cb

refactor(tinyagents): enable sqlite baseline

3785c0a

refactor(tinyagents): persist detached subagent task ledger

a62c464

refactor(tinyagents): key detached task stores by workspace

e64f9b7

refactor(tinyagents): run subagent pipeline skeleton

9c24150

refactor(tinyagents): move fanout action root into graph

1de097e

refactor(tinyagents): run fanout graph skeleton

7edb96d

refactor(tinyagents): drive fanout phases on graph

77c6588

refactor(tinyagents): stage fanout dispatch on graph

c3fefd3

refactor(tinyagents): validate fanout limit on graph

2a6f7f0

refactor(tinyagents): move fanout context lookup into graph

75f7035

refactor(tinyagents): parse fanout requests in graph

8f772c0

docs(tinyagents): refresh baseline sdk gaps

9ad382b

refactor(tinyagents): classify tool adapters

7879675

refactor(tinyagents): use sdk unknown tool recovery

60097ba

refactor(tinyagents): install sdk tool policy snapshot

bf90a7a

refactor(tinyagents): read output caps from sdk policies

496ed39

refactor(tinyagents): compact tool output in middleware

14cd070

refactor(tinyagents): log exposure and compression events

f4c7788

refactor(tinyagents): persist oversized tool outputs

4067262

docs(tinyagents): refresh tooling migration status

4f6c2b1

refactor(tinyagents): honor tokenjuice-only middleware

6baeaeb

refactor(tinyagents): index tool result artifacts

ffe0dff

refactor(tinyagents): emit compression events for tool output

c1b488b

senamakel added 30 commits July 2, 2026 09:11

feat(tinyagents): carry workspace context through parent scope

62b86c7

feat(tinyagents): pass run cancellation into spawn parallel

a9963b1

refactor(subagents): move handoff helper out of ops

6325ea5

feat(tinyagents): preserve workspace context for session memory

84b1cca

feat(subagents): resolve steering sessions from task store

5115d50

feat(subagents): resolve session cancellation from task store

e5ec77d

feat(tinyagents): scope generated media to workspace descriptor

768f1f0

feat(tinyagents): scope codegraph to workspace descriptor

a0cc609

Merge remote-tracking branch 'upstream/main' into issue/4249-finish-t…

a86a423

…inyagents-migration # Conflicts: # src/openhuman/agent_orchestration/parent_context/mod.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Issue/4249-finish-tinyagents-migration#4399

Issue/4249-finish-tinyagents-migration#4399
senamakel wants to merge 269 commits into
tinyhumansai:mainfrom
senamakel:issue/4249-finish-tinyagents-migration

senamakel commented Jul 2, 2026

Uh oh!

coderabbitai Bot commented Jul 2, 2026 •

edited

Loading

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

senamakel commented Jul 2, 2026

Summary

Problem

Solution

Submission Checklist

Impact

Related

AI Authored PR Metadata (required for Codex/Linear PRs)

Linear Issue

Commit & Branch

Validation Run

Validation Blocked

Behavior Changes

Parity Contract

Duplicate / Superseded PR Handling

Uh oh!

coderabbitai Bot commented Jul 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented Jul 2, 2026 •

edited

Loading