Atlas AI

Neuroscience-inspired cognitive architecture with distributed compute

Live dashboard: atlas-sjsu.duckdns.org/schematic.html · Brand assets: erisml.org/brand/

Atlas AI is a production cognitive architecture that mirrors the vertebrate brain's cortical / subcortical split. High-level reasoning runs on frontier cloud LLMs (the cortex), fast pattern learning and procedural memory run on local GPUs (the subcortical brain), and the two tiers are coordinated by NATS JetStream acting as a global workspace (Baars, 1988). Persistent memory lives in PostgreSQL + pgvector.

A Freudian agent model — Ego, Id, and the Divine Council (acting as Superego) — negotiates decisions through structured debate. An autonomous learning loop (Erebus, the ARC Scientist) improves its own problem-solving strategies over time, and The Primer (inspired by Stephenson's Young Lady's Illustrated Primer) is an always-on Claude-style tutor that teaches Erebus through verified reference implementations written back to the wiki. A Unified Knowledge Graph now tracks both what has been taught and what is still asked-but-unanswered as first-class nodes, so the teaching loop is observable end-to-end.

Cognitive ensemble over one bus. Every Ego turn, every Council deliberation, and every Primer teaching cycle is a vMOE (virtual Mixture-of-Experts) call that fans out across heterogeneous voices — frontier LLMs on the NRP managed API for raw reasoning capacity, paired with small locally-fine-tuned specialists on Atlas for identity, memory grounding, and value alignment. Frontier cognition comes from NRP; the self comes from Atlas. The synthesizer picks or merges. This is what "higher brain in the cloud, subcortical voices at home" actually looks like in production.

One bus, many shapes

Every subsystem publishes on the same NATS fabric (agi.*). Cortex agents, subcortical GPU workers, memory tiers, and remote burst pods are all first-class subscribers — no separate dispatcher, no polling glue.

flowchart TB
  subgraph Ensemble["vMOE cognitive ensemble — one substrate across Ego, Council, Primer"]
    direction TB
    subgraph Frontier["Frontier cognition — NRP Envoy Gateway (ellm.nrp-nautilus.io/v1)"]
      direction LR
      Qwen["qwen3 397B<br/>FP8 · 262K ctx"]
      Kimi["kimi 1T<br/>MXFP4"]
      GLM["glm-4.7 358B<br/>FP8"]
      MM["minimax-m2 230B<br/>FP8"]
      GPTOSS["gpt-oss 120B<br/>MXFP4"]
      OLMO["olmo 32B"]
    end
    subgraph Local["Local specialists — Atlas GV100s (fine-tunable)"]
      direction LR
      Eth["ErisML voice<br/>7B QLoRA on<br/>2.4M ethical passages"]
      MemV["Memory-RAG voice<br/>7B + pgvector<br/>indexed via<br/>qwen3-embedding"]
      Pat["Pattern voice<br/>7B on Erebus's<br/>solved-code corpus"]
    end
    Frontier -. synthesis .-> Router(("vMOE router<br/>+ synthesizer"))
    Local -. synthesis .-> Router
  end
  NATS[("NATS JetStream — global workspace · agi.*")]
  subgraph Mem["Persistent memory — Atlas"]
    direction LR
    PG[("PostgreSQL<br/>+ pgvector")]
    Wiki[("Sensei wiki<br/>verified notes")]
    UKG[("Unified Knowledge Graph<br/>filled + gap nodes")]
  end
  Router <-->|"agi.ego / agi.council / agi.primer.*"| NATS
  Frontier <-->|"leaf link :7422 (TLS)"| NATS
  MemV -. retrieval .-> Mem
  Mem <--> NATS

Which subsystem handles what?

flowchart TD
  Q{"What kind of work?"}
  Q -->|"chat / tool-use"| Ego["Ego turn<br/>vMOE route by hint<br/>frontier + local specialists"]
  Q -->|"ethical / policy gate"| Eth["ErisML voice<br/>local 7B fine-tuned specialist"]
  Q -->|"high-stakes decision"| Council["Divine Council<br/>vMOE ensemble of heterogeneous<br/>NRP frontier models as advocates"]
  Q -->|"memory recall /<br/>episodic context"| MemV["Memory-RAG voice<br/>local 7B + live pgvector/UKG"]
  Q -->|"ARC / NeuroGolf puzzle"| Sci{"mentor note exists?"}
  Sci -->|"yes"| SciLoop["Erebus + sensei preamble<br/>strategy library + Thompson"]
  Sci -->|"no — stuck ≥10 attempts"| Prim["Primer vMOE<br/>verify-then-publish<br/>sensei note"]
  Q -->|"heavy GPU burst /<br/>parallel solver swarm"| Burst["nats-bursting →<br/>NRP Nautilus burst pool"]
  Q -->|"nightly consolidation"| Dream["Dream subsystem<br/>QLoRA specialist adapters<br/>on Atlas GV100s"]

Architecture at a glance

                         ╔═══════════════════════════════════════════════════╗
                         ║   CORTEX  —  NRP Envoy Gateway                     ║
                         ║   ellm.nrp-nautilus.io/v1 (shared, zero marginal)  ║
                         ╠═══════════════════════════════════════════════════╣
                         ║                                                    ║
                         ║   qwen3         397B  FP8  · 262K ctx             ║
                         ║   kimi          1T    MXFP4 (eval)                ║
                         ║   glm-4.7       358B  FP8  (eval)                 ║
                         ║   minimax-m2    230B  FP8                         ║
                         ║   gpt-oss       120B  MXFP4                       ║
                         ║   olmo          32B   bf16 (eval)                 ║
                         ║   qwen3-small   27B   bf16                        ║
                         ║   gemma         31B   bf16  · multimodal          ║
                         ║   gemma-small   8B    bf16  · audio               ║
                         ║   qwen3-embedding  8B  embeddings only            ║
                         ║                                                    ║
                         ║   Consumed as experts in every Ego turn, every    ║
                         ║   Council round, every Primer tick. Heterogeneous ║
                         ║   lineages (Qwen / Kimi / GLM / MiniMax / OpenAI  ║
                         ║   OSS / Allen AI / Gemma) = genuine multi-voice   ║
                         ║   deliberation, not N samples of one voice        ║
                         ║                                                    ║
                         ║   + Vision pods (GLM-4.1V on L40 / A10)           ║
                         ║   + Worker pools (nats-bursting on 33-node A10)   ║
                         ╚═══════════════════════╤════════════════════════════╝
                                                 │  NATS leaf :7422 (TLS)
                         ╔═══════════════════════╧════════════════════════════╗
                         ║   GLOBAL WORKSPACE  —  NATS JetStream :4222        ║
                         ║   Subjects: agi.*                                  ║
                         ╚═══════════════════════╤════════════════════════════╝
                                                 │
                         ╔═══════════════════════╧════════════════════════════╗
                         ║   vMOE router + synthesizer                        ║
                         ║   (agi.reasoning.vmoe — shared substrate)          ║
                         ║                                                    ║
                         ║   route by role hint → single expert call          ║
                         ║   ensemble on high-stakes → fan out, merge         ║
                         ║   cascade on failure → health-tracked fallback     ║
                         ╚═══════════════════════╤════════════════════════════╝
           ┌─────────────────────────────────────┼─────────────────────────────┐
           │                                     │                             │
  ╔════════╧══════════════════╗   ╔══════════════╧═══════════╗   ╔════════════╧══════╗
  ║ SUBCORTICAL SPECIALISTS   ║   ║ MEMORY  — PostgreSQL +   ║   ║  BRAINSTEM         ║
  ║ 2× Quadro GV100 32GB      ║   ║ pgvector · 3.3M vectors  ║   ║                    ║
  ║                           ║   ║                          ║   ║  Thermal guardian ║
  ║  ErisML voice (7B QLoRA   ║   ║  L0  Dream-consolidated  ║   ║  Watchdog         ║
  ║    on 2.4M ethical        ║   ║  L1  Sensei wiki         ║   ║  Telemetry :8085  ║
  ║    passages)              ║   ║      (Primer+human)      ║   ║  Caddy + OAuth2   ║
  ║  Memory-RAG voice (7B +   ║   ║  L2  PCA-384 IVFFlat     ║   ║  Backup (daily)   ║
  ║    live pgvector/UKG)     ║   ║  L3  tsvector full-text  ║   ║  atlas-target     ║
  ║  Pattern voice (7B on     ║   ║  L4  Episodic memory     ║   ║                    ║
  ║    Erebus solved-code)    ║   ║  L5  Unified Knowledge   ║   ║                    ║
  ║                           ║   ║      Graph (filled+gap)  ║   ║                    ║
  ║  + procedural memory      ║   ║                          ║   ║                    ║
  ║    (pattern nets + A*)    ║   ║                          ║   ║                    ║
  ╚═══════════════════════════╝   ╚══════════════════════════╝   ╚═══════════════════╝

Full systems view: docs/ARCHITECTURE_OVERVIEW.md.

The Freudian agents

Three psychoanalytic agents negotiate decisions through structured debate, all running over the same vMOE substrate. The Star Trek analogues show the functional mapping — not the reverse.

Agent	Role	How it's realized	Status
Ego (Kirk)	Balanced decision-maker — the self that speaks and learns	vMOE route per turn: frontier NRP model(s) for raw reasoning + local specialists for identity / memory / value alignment. Synthesizer merges.	Planned — next infra deliverable
Superego = Divine Council	Internalized normative deliberation. 7 advocates (Judge, Advocate, Synthesizer, Ethicist, Historian, Futurist, Pragmatist) debate in round-based fashion	vMOE ensemble with one NRP model per advocate — e.g. Judge=`glm-4.7` (formal), Advocate=`qwen3` (long-context rhetoric), Synthesizer=`minimax-m2` (integrative), Ethicist=Anthropic-proxy or `olmo` (different lineage), Historian=`kimi` (1T recall), Futurist=`gpt-oss` (speculative), Pragmatist=`qwen3-small` (fast, grounded). Genuine multi-lineage debate	Active today as `kimi`-only; heterogeneous advocate routing is the upgrade
Id	Fast, value-laden unconscious drive — the voice that reacts before deliberation	Local ErisML voice (7B QLoRA on the 2.4M-passage ethical corpus) on one GV100 + the basal-ganglia-style procedural memory (pattern nets + A*)	Planned — specialist adapter training is a dream-cycle workload

Why the Council is the Superego, not Spock: the internalized voice-of-norms is structurally multi-voice (parental figures, societal rules, ideals) — a single critic LLM was a compressed stand-in. The Council, especially with heterogeneous advocate models from different lineages, captures the actual shape of what Freud meant. The Gemma-on-GPU-0 role shrinks to "on-ramp helper / fallback," not the Superego proper.

Why the Ego isn't a single fine-tuned model: GV100s can't host frontier weights; the NRP managed API can't be fine-tuned; renting a 70B box gets a slightly-sub-frontier model with a very expensive personality transplant. The vMOE answer is cleaner — frontier capability from NRP, identity from local tunable specialists, combined at inference time. The "self" is a synthesis, which is also what Freud's Ego actually is.

Cognitive ensemble (vMOE)

One routing substrate, agi.reasoning.vmoe (promoted from agi.primer.vmoe once the Ego cutover lands), backs every cognitive act in the system:

Experts are heterogeneous. The pool mixes frontier NRP models (qwen3, kimi, glm-4.7, minimax-m2, gpt-oss, olmo, gemma) with small locally-fine-tuned specialists on the Atlas GV100s. The NRP side provides capacity we could never match locally; the Atlas side provides weights we actually own and can tune.
Orchestration policies are the same four Primer already uses. route(hint) picks one expert by role tag; cascade(hint) falls through on failure; ensemble(experts, verify) fans out in parallel and returns the first verified pass; first_verified cancels pending calls once a winner arrives. See docs/VMOE.md.
Health-tracked degradation. If glm-4.7 goes slow at 2 am, HealthTracker skips it for a 1 h cooldown and reroutes to qwen3 or minimax-m2. Same mechanism the Primer uses today.
Local specialists are fine-tuned in the dreaming window. QLoRA on the 2.4 M ErisML corpus produces the ErisML voice; QLoRA on Erebus's solved-task code produces the pattern voice; the memory-RAG voice is a small base with live retrieval over pgvector + UKG + sensei wiki. All three fit on the two GV100s with room to spare.

Retrieval subsystem (new possibility from NRP catalog): qwen3-embedding 8B is now offered as a managed embedding endpoint. This means the ErisML corpus can be embedded once, indexed into pgvector, and retrieval becomes a lookup — no local encoder to train. The "ErisML voice specialist" still earns its keep (value-laden generation, not just retrieval), but the memory-RAG layer becomes much cheaper to stand up.

Why this architecture and not a single big fine-tune:

Dimension	Rent 70B + fine-tune	vMOE ensemble
Frontier capability	no (70B is below frontier in 2026)	yes (1T kimi, 397B qwen3)
Fine-tunable identity	yes	yes (small specialists)
Capex	$4–30k box or $500–2500/mo rental	$0 (reuses GV100s)
Policy compliance on NRP	n/a (not on NRP)	clean (managed API is purpose-built for this)
Scales with model progress	upgrade = retrain 70B	upgrade = new model appears on ellm
Voice coherence	single voice, baked in	synthesis across voices
Implementation effort	~2 quarters	~weeks (vMOE substrate exists)

Autonomous learning

Erebus attempts ARC tasks continuously; when he gets stuck, he posts to a help queue; The Primer watches the queue, ensembles frontier models, verifies each candidate against task.train, and — only if some candidate passes — publishes a sensei_task_NNN.md back to the wiki. Erebus reads those notes as mentor preambles on his next attempt.

sequenceDiagram
  autonumber
  participant Sci as Erebus<br/>(ARC Scientist)
  participant Q as Help queue<br/>(erebus_help_queue.json)
  participant Pr as The Primer
  participant MoE as vMOE ensemble<br/>Kimi · GLM-4.7 · Qwen3
  participant Val as Validator<br/>(runs candidate vs task.train)
  participant Wiki as Sensei wiki
  participant UKG as Unified Knowledge Graph

  Sci->>Sci: attempt task 167 (≥10 times, still stuck)
  Sci->>Q: ask_for_help(task=167, error_types)
  Note over Pr: polling tick (default 10 min)
  Pr->>Q: read queue, import as gap nodes
  Pr->>UKG: upsert gap for task 167
  Pr->>MoE: ensemble(context = task + history + wiki)
  MoE-->>Pr: N candidate solutions (parallel)
  loop for each candidate
    Pr->>Val: validate(code, task.train)
  end
  alt any candidate passes all train examples
    Pr->>Wiki: write sensei_task_167.md (frontmatter verified_by)
    Pr->>Wiki: git commit and push (CI deploys)
    Pr->>UKG: promote gap to filled (created_at preserved)
    Sci->>Wiki: read as mentor preamble next attempt
  else none verify
    Pr->>Pr: cooldown 6h then try again
  end

Erebus (`src/agi/autonomous/arc_scientist.py`)

Closed-loop scientific reasoning against the NeuroGolf 2026 / ARC-AGI task set:

Observe — pick an unsolved task (tier-weighted by similarity to prior solves and near-miss exploitation).
Hypothesize — form a theory via LLM prompt strategy (direct / failure_aware / example_chain / diagnostic).
Experiment — generate candidate Python transform(grid) implementations.
Evaluate — verify against every training example; any failure = reject.
Learn — store what worked + what failed with structured error classification (reasoning / execution / perception / specification).
Reflect — LLM diagnoses why a transform failed; result becomes prompt context for the next attempt.
Adapt — Thompson sampling shifts strategy weights based on evidence.
Mentor preambles — task-specific sensei notes from the wiki (Primer-generated or human-written) are prepended to relevant prompts.

Live solve count: see schematic.html (the Erebus — NeuroGolf 2026 card).

The Primer (`src/agi/primer/`)

An always-on teaching daemon that meets Erebus at his current confusion and writes verified sensei notes. Details in docs/THE_PRIMER.md.

Watches the help queue for tasks where Erebus has ≥ 10 attempts and no solve.
Reads task JSON + attempt history + relevant wiki articles to assemble context.
Calls a vMOE ensemble — Kimi, GLM-4.7, Qwen3, MiniMax-M2 on NRP, Kirk-local as resilience fallback. See docs/VMOE.md.
Verifies each candidate's transform() against every training example. Only verified code is published — a wrong mentor note is worse than no note.
Publishes a sensei_task_NNN.md to the wiki, git-commits and pushes; CI deploys.
Tracks expert health with a rolling window — a persistently slow or erroring model is skipped for a cooldown so we don't burn tokens on foregone conclusions.
Emits one JSONL event per expert response (primer_events.jsonl) so the dashboard can show per-expert calls, verify pass/fail rate, and a latency histogram — signal survives daemon restarts.

Runs as atlas-primer.service.

Unified Knowledge Graph

A single append-only JSONL (/archive/neurogolf/knowledge_graph.jsonl) where filled sensei notes and gap open-questions are both first-class nodes. The dreaming and curiosity subsystems will traverse one graph rather than two separate stores.

stateDiagram-v2
  [*] --> gap: help_queue import (Erebus stuck)
  gap --> filled: Primer publishes verified note
  stub --> filled: placeholder promoted
  filled --> filled: Primer refines (newer verified_at)
  filled --> [*]
  note right of filled
    teaching-context gate
    type=filled AND verified=true
    AND status=active AND body exists
  end note

Node model (one per line, full-state snapshot):

Field	Example
`id`	`sensei_task_167`
`type`	`filled` / `gap` / `stub`
`status`	`active` / `archived`
`topic` / `topic_key`	`count distinct colors` / `count-distinct-colors`
`tags`	`[classification, count-distinct-colors, arc, primer]`
`body_ref`	`sensei_task_167.md` (null for gaps)
`verified` / `verified_at`	`true` / `1713574800`
`source`	`primer` / `help_queue` / `backfill` / `manual`
`evidence`	`["help:t167", "primer_task:167"]` (first-seen stable order)

The trust gate is_context_eligible(node) is the single place that decides whether a node is safe to show a generator as truth — filled ∧ verified ∧ active ∧ body-exists-on-disk. Everything else (gaps, unverified notes, archived nodes) stays visible to dashboards and future curiosity consumers but is never fed back as teaching context.

Retrieval mode is controlled by EREBUS_CONTEXT_READER=wiki|graph (env) so the cutover from wiki-glob to graph-query can be A/B-compared before committing. Default is wiki until graph-backed retrieval is validated in practice.

Infrastructure

Component	Description
NATS JetStream	Global workspace at `:4222`. Leaf node at `:7422` bridges to NRP.
PostgreSQL + pgvector	3.3 M PCA-384 vectors. 5-tier retrieval (L0–L4) + L5 UKG.
RAG server	Flask at `:8081` with dual-hemisphere proxy + hybrid search (BM25 + dense + HyDE + RRF).
Telemetry	At `:8085`. NATS stats, NRP pod metrics, live GPU / VRAM via `kubectl exec nvidia-smi`. Serves `/schematic.html` with a dynamic `ui:<sha> · <mtime>` version stamp.
Caddy	Reverse proxy + OAuth2. Serves `atlas-sjsu.duckdns.org`.
Thermal guardian	CPU temp monitoring (82 °C warn, 100 °C critical).
Watchdog	Health checks + automatic service restart.
Primer daemon	`atlas-primer.service` — always-on tutor; CPU-only.

NRP Nautilus integration

flowchart LR
  subgraph AtlasBox["Workstation — Atlas"]
    direction TB
    A1["Atlas services<br/>(agi.*)"]
    NH[("NATS hub :4222")]
    A1 -.-> NH
  end
  subgraph NRPBox["NRP Nautilus — namespace ssu-atlas-ai"]
    direction TB
    LP["NATS leaf pod<br/>(outbound dial only)"]
    MLLM["Managed LLM API<br/>ellm.nrp-nautilus.io/v1"]
    ANT["Anthropic-compat proxy<br/>.../anthropic"]
    BJ["Burst Jobs<br/>(vision pool · solver swarm)"]
    WP["Worker Pools<br/>(erebus-workers via nats-bursting)"]
    LP --- MLLM
    LP --- ANT
    LP --- BJ
    LP --- WP
  end
  NH <-->|"TLS over DuckDNS + NAT<br/>(leaf link :7422)"| LP

Managed LLM API (https://ellm.nrp-nautilus.io/v1): Kimi K2.5, Qwen 3.5 397B, GLM-4.7, MiniMax M2.7, Gemma 4 — shared, OpenAI-compatible, zero marginal cost.
Anthropic-compatible proxy (.../anthropic) for Claude Code tooling and the Primer.
Burst Jobs — ephemeral K8s Jobs (vision pool 4× L40/L40S/A10, GLM-4.1V; solver swarms for NeuroGolf). See deploy/k8s/.
Worker Pools — persistent K8s Deployments (erebus-workers, nats-bursting). See nats-bursting.
Live monitoring — dashboard shows pod count, GPU model, VRAM used/total, utilization. Worker pools and burst jobs appear side-by-side in the NRP Burst Jobs + Worker Pools card.
L40 reservation caveat — 15 of 17 L40 nodes carry csu-tide reservation taints; our namespace defaults to A10 × 8 (33 abundantly-available nodes) for self-hosted pods.

Safety

Three-layer architecture:

Layer	Latency	Function
Reflex	< 100 µs	Emergency stops, thermal limits
Tactical	10–100 ms	ErisML ethical evaluation, Bond Index
Strategic	> 100 ms	Policy enforcement, human oversight

ErisML provides mathematically grounded ethical reasoning with Hohfeldian analysis and SHA-256 hash-chained decision proofs.

Regression guards (CI/CD)

Workflow	Trigger	What it checks
Atlas AI — CI/CD (`ci.yaml`)	push to main	Black format, lint, tests, build, auto-deploy to Atlas (git pull + restart telemetry/rag).
Deploy Smoke (`deploy-smoke.yaml`)	push to main + 30-min cron	Compares live `/api/version` SHA to `main` HEAD. Asserts `NATS Topology`, `NRP Burst Jobs`, `NATS Live`, `Erebus Cognitive Architecture` panels are present on the rendered page.
Dashboard Render (`dashboard-render.yaml`)	dashboard path changes + 30-min cron	Playwright headless Chromium loads `schematic.html`, asserts topology SVG populates, burst table has rows, version stamp matches `ui:<sha> · <date>T<time>Z`, no console errors.

These exist because silent dashboard regressions (stale deploy, empty widget, JS error) previously hid under green Python CI. See docs/CHANGELOG.md for the post-mortem on the cp -f-through-symlink drift that motivated them.

Project layout

agi-hpc/
├── src/agi/
│   ├── autonomous/          # Erebus — ARC Scientist + ONNX scientist
│   ├── primer/              # The Primer — vMOE + validator + events + service
│   ├── knowledge/           # Unified Knowledge Graph (graph, backfill, gaps)
│   ├── core/                # gRPC, NATS/ZMQ/UCX, DHT, LLM providers
│   ├── reasoning/           # Divine Council, debate, NATS service
│   ├── lh/                  # Left hemisphere: planning, metacognition
│   ├── rh/                  # Right hemisphere: perception, world model
│   ├── memory/              # Episodic, semantic, procedural, knowledge
│   ├── safety/              # 3-layer safety, ErisML, privilege gates
│   ├── metacognition/       # Ego monitor, consistency, anomaly
│   ├── dreaming/            # Memory consolidation, QLoRA adapters
│   ├── training/            # Dungeon Master, gym environment, curriculum
│   ├── attention/           # Distractor detection and filtering
│   ├── thermal/             # Thermal management, job queue
│   ├── integration/         # Cross-subsystem orchestration
│   ├── env/                 # Gymnasium-compatible (MuJoCo / Unity)
│   └── meta/                # LLM-based metacognitive reflection
│
├── configs/                 # Service YAML
├── deploy/
│   ├── systemd/             # 20 service units under atlas.target (incl. atlas-primer)
│   └── k8s/                 # NRP pod manifests (erebus-ego PVC, vision burst)
├── proto/                   # Protocol Buffer definitions
├── infra/
│   ├── hpc/                 # Apptainer, Slurm, Docker
│   └── local/atlas-chat/    # Dashboard (schematic.html, erebus.html)
├── scripts/                 # Watchdog, telemetry, ukg_backfill_wiki, ukg_import_help_queue, …
├── tests/
│   ├── unit/                # Python unit tests (primer_*, ukg_*, dashboard_panels, …)
│   └── dashboard/           # Playwright render tests
├── wiki/                    # Sensei notes (Tier-1 RAG) — Primer writes here
├── docs/                    # Architecture, operations, design docs
└── .github/workflows/       # CI/CD pipeline

Hardware

Resource	Spec
Atlas	HP Z840, 2× Xeon E5-2690v3 (48 threads), 251 GB RAM
Local GPUs	2× Quadro GV100 32GB (Volta, no NVLink — separate CPU sockets)
Storage	15 TB at `/archive`
Network	Tailscale at `100.68.134.21`; public at `atlas-sjsu.duckdns.org`
NRP	L40 48 GB, L40S 48 GB, A10 24 GB, A40 48 GB across 100+ university nodes (A100/H100 gated behind reservation form)
NRP LLMs	Kimi K2.5 1T, Qwen 3.5 397B, GLM-4.7 358B, MiniMax M2.7, Gemma 4 (managed, zero marginal cost)

Quickstart

git clone https://github.com/ahb-sjsu/agi-hpc.git
cd agi-hpc
pip install -e ".[dev]"

# Local development (no network)
export AGI_FABRIC_MODE=local
python src/agi/lh/service.py

# Erebus — ARC Scientist (needs NRP token)
export AGI_FABRIC_MODE=nats
export NRP_LLM_TOKEN=<your-token>
python src/agi/autonomous/arc_scientist.py --task-dir /path/to/tasks

# The Primer (CPU-only)
export NRP_LLM_TOKEN=<your-token>
python -m agi.primer.service

# One-shot UKG backfill from the existing wiki
python scripts/ukg_backfill_wiki.py --verbose

Production deployment on Atlas: docs/ATLAS_OPERATIONS.md. Primer-specific: docs/THE_PRIMER.md.

Documentation

Doc	Topic
`docs/ARCHITECTURE_OVERVIEW.md`	Full systems view: agents, fabric, memory, NRP, CI/CD
`docs/THE_PRIMER.md`	The Primer — design, safety invariants, operation
`docs/VMOE.md`	Virtual Mixture-of-Experts — routing policies, health
`docs/ATLAS_OPERATIONS.md`	Atlas workstation ops: systemd, thermal, backup
`docs/DEPLOYMENT_RUNBOOK.md`	How to deploy, roll back, and perform common maintenance
`docs/ONCALL_PLAYBOOK.md`	"Something is broken, what do I look at"
`docs/SLOS_AND_KPIS.md`	Proposed SLOs, research KPIs, error budget policy
`docs/METRICS_INVENTORY.md`	What's measured / logged today, with endpoints and gaps
`docs/METRICS_CONTRIBUTOR_GUIDE.md`	Conventions for adding metrics, logs, endpoints
`docs/AGI_ROADMAP.md`	Phase status, pending work, long-horizon plan
`docs/UKG_ROLLOUT.md`	Runbook for cutting Primer retrieval over to the Unified Knowledge Graph
`docs/KNOWLEDGE_GAP_MAPPING_v1_spec.md`	v1 spec for dissatisfaction detection + gap aggregation (locked, not yet implemented)
`docs/CHANGELOG.md`	Per-date ship log

Citation

@software{bond_atlas_ai_2026,
  author = {Bond, Andrew H.},
  title  = {Atlas AI: Neuroscience-inspired cognitive architecture with distributed compute},
  year   = {2026},
  url    = {https://github.com/ahb-sjsu/agi-hpc}
}

Name		Name	Last commit message	Last commit date
Latest commit History 482 Commits
.claude/rules		.claude/rules
.github/workflows		.github/workflows
Lib/site-packages		Lib/site-packages
articles		articles
benchmarks		benchmarks
birdclef		birdclef
config/safety_rules		config/safety_rules
configs		configs
deploy		deploy
docs		docs
evals		evals
infra		infra
nemotron		nemotron
ops/jupyterhub		ops/jupyterhub
paper		paper
proto		proto
scripts		scripts
src/agi		src/agi
tests		tests
tools		tools
wiki		wiki
.env.example		.env.example
.gitignore		.gitignore
.taplo.toml		.taplo.toml
AGI-HPC-README.docx		AGI-HPC-README.docx
CONTRIBUTING.md		CONTRIBUTING.md
Caddyfile		Caddyfile
Democratically Governed Ethical Decision Modules for Autonomous Vehicles v2.pdf		Democratically Governed Ethical Decision Modules for Autonomous Vehicles v2.pdf
ERIS_AGENT.md		ERIS_AGENT.md
Full_AGI_HPC_Architecture.pdf		Full_AGI_HPC_Architecture.pdf
GLUE_ARCHITECTURE.md		GLUE_ARCHITECTURE.md
LICENSE		LICENSE
LICENSE.txt		LICENSE.txt
README.docx		README.docx
README.md		README.md
README2.md		README2.md
_sprint6_data.json		_sprint6_data.json
_sprint6_files.json		_sprint6_files.json
agi_hpc_architecture.svg		agi_hpc_architecture.svg
atlas-login.html		atlas-login.html
atlas-rag-indexer.py		atlas-rag-indexer.py
atlas-rag-server.py		atlas-rag-server.py
black		black
generate_protos.py		generate_protos.py
insert_header.py		insert_header.py
jira_crawl_pub.py		jira_crawl_pub.py
jira_probe_pub.py		jira_probe_pub.py
kaggle-agi-rules.txt		kaggle-agi-rules.txt
nexus_auto_slow_download.user.js		nexus_auto_slow_download.user.js
proto-patch.py		proto-patch.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
pyvenv.cfg		pyvenv.cfg
requirements.txt		requirements.txt
 2		 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Atlas AI

Neuroscience-inspired cognitive architecture with distributed compute

One bus, many shapes

Which subsystem handles what?

Architecture at a glance

The Freudian agents

Cognitive ensemble (vMOE)

Autonomous learning

Erebus (`src/agi/autonomous/arc_scientist.py`)

The Primer (`src/agi/primer/`)

Unified Knowledge Graph

Infrastructure

NRP Nautilus integration

Safety

Regression guards (CI/CD)

Project layout

Hardware

Quickstart

Documentation

Citation

License

About

Licenses found

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Atlas AI

Neuroscience-inspired cognitive architecture with distributed compute

One bus, many shapes

Which subsystem handles what?

Architecture at a glance

The Freudian agents

Cognitive ensemble (vMOE)

Autonomous learning

Erebus (src/agi/autonomous/arc_scientist.py)

The Primer (src/agi/primer/)

Unified Knowledge Graph

Infrastructure

NRP Nautilus integration

Safety

Regression guards (CI/CD)

Project layout

Hardware

Quickstart

Documentation

Citation

License

About

Resources

License

Licenses found

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Erebus (`src/agi/autonomous/arc_scientist.py`)

The Primer (`src/agi/primer/`)

Packages