Semantic Search — Decisions

Mirrored from docs/design/orbit-search/4_decisions.md. Edit the source document in the repository, not this generated page.

ADR-style log of non-obvious orbit-search decisions. Each entry names the pressure, the choice, and the tradeoff. Entries are append-only and keyed by number; superseded entries are marked, not deleted.

Format for each entry: Status · Date · Task(s), then Context → Decision → Consequences. Every ADR names at least one cost. ADRs in this file carry status Proposed until the implementing task ships; they flip to Accepted with the implementing task ID at that point.

ADR-001 — fastembed-rs ONNX backend over Candle, llama.cpp, or external ollama

Status: Accepted · 2026-05 · [T20260510-3], [T20260510-9]

Context. Local embedding inference has four plausible backends:

Backend	Profile
fastembed-rs	Pure Rust crate wrapping ONNX Runtime; ships a small set of well-known sentence-embedding models (BGE, MiniLM, Nomic, mxbai); CPU-only fine; batch-friendly.
Candle	Pure-Rust ML framework from HuggingFace; broader model support; more code to integrate; less plug-and-play for embeddings specifically.
llama-cpp-rs	Bindings to llama.cpp; GGUF format; runs anything from tiny embedding models to large LLMs; optional GPU; C++ build dependency.
External ollama or similar always-on daemon	Outsources inference but requires the user to install and run a separate long-lived process.

This ADR addresses which backend to use. The orthogonal decision of how the backend is delivered to the user (in-process vs. companion binary vs. feature flag) is in ADR-005. Within in-process or in-companion options, fastembed-rs covers the embedding-model use case directly; Candle is more general but requires more Orbit-side code; llama-cpp-rs is overkill and adds a C++ build dependency that complicates Orbit’s release pipeline. An always-on ollama-style daemon contradicts Orbit’s no-daemon posture regardless of binary placement.

Decision. Phase 1 uses fastembed-rs as the inference backend, exposed through an Embedder trait that lives in a new orbit-embed library crate. Per ADR-005, fastembed-rs is linked into a separate orbit-embed-companion binary, not into the main orbit binary; the trait abstraction means an alternative backend can later swap in without touching orbit-store or orbit-tools. The user-facing default model is BGE-small-en-v1.5 (384 dim, ~30MB), with --model {bge-small | minilm-l6 | nomic-v1.5} selected at install time. Reject external always-on ollama: contradicts the no-daemon posture. Reject llama-cpp-rs: C++ build dependency outweighs its flexibility for embedding-only work. Reject Candle as default: more integration work for less out-of-the-box behavior; remains a viable trait-impl swap.

Consequences.

The Embedder trait isolates the choice of backend from storage and retrieval; later-arriving backends (Candle, code-tuned models) plug in without schema or query changes.
The fastembed-rs model catalog (BGE, MiniLM, Nomic, mxbai) is the menu phase-1 users pick from. Other model families require a new Embedder impl, not a config change.
Model output is well-characterized by published benchmarks (MTEB) so the default is defensible without an Orbit-specific eval (3_vision.md §1.1).
Cost: locking in to the fastembed-rs catalog means models outside that catalog (e.g., voyage-code, code-tuned models in 3_vision.md §1.7) need a different Embedder impl in a future task. The trait abstraction makes that mechanical, but it does mean the phase-1 menu is bounded by what fastembed-rs ships.

ADR-002 — Brute-force cosine over SQLite BLOBs; `sqlite-vec` reserved as phase-2 upgrade

Status: Accepted · 2026-05 · [T20260510-3], [T20260510-9]

Context. Vector storage and retrieval has three plausible shapes:

Brute-force cosine in Rust over SQLite BLOBs. No new dependency. Linear scan per query.
sqlite-vec loadable extension. HNSW-indexed nearest-neighbor inside SQLite. Same on-disk format as (1). Adds a runtime extension load.
Standalone vector DB (Qdrant, LanceDB, ChromaDB). Production-grade. Adds a binary dependency or sidecar.

At phase-1 scale — tasks-only, low thousands of artifacts × small number of fields per task = tens of thousands of vectors at 384d — brute-force cosine is sub-100ms on a modern laptop and zero new dependencies. sqlite-vec is the right answer once the corpus crosses ~100K vectors; that crossing happens with phase-2 graph integration, not phase 1. Standalone vector DBs are inappropriate for an embedded local tool.

A subtle point: the choice of embedding BLOB storage format in (1) is forward-compatible with sqlite-vec. Upgrading is a CREATE VIRTUAL TABLE plus an INSERT … SELECT, not a schema rewrite.

Decision. Phase 1 implements brute-force cosine in Rust over embeddings.embedding BLOBs. The schema preserves forward compatibility with sqlite-vec (same BLOB layout, same dim and model_id columns). Phase 2’s graph corpus revisits storage as a separate ADR; if sqlite-vec is the right call at that point, it’s an additive change, not a migration.

Consequences.

Zero new runtime dependencies in phase 1.
Schema and on-disk layout are stable across the phase-1/phase-2 boundary.
Query latency is acceptable until the corpus crosses ~100K vectors.
Cost: brute force scans every row every query. For a stable phase-1 corpus that’s fine, but it means we can’t ship “semantic search across the entire repository graph” without revisiting storage. The decision deliberately scopes phase 1 to where brute force is comfortable, and pays the upgrade cost later when there is operational evidence to size against.

ADR-003 — Per-field embeddings with chunked overflow, not whole-bundle concatenation

Status: Accepted · 2026-05 · [T20260510-3], [T20260510-9]

Context. A task bundle has structurally distinct fields (purpose, summary, plan, acceptance criteria, comments, review threads) of widely varying length. Two embedding strategies exist:

Concatenate everything into one document and embed once. Simplest; one row per task. Loses precision because a strong match in purpose is averaged with weak signal from twenty unrelated comments. Long bundles routinely exceed BGE-small’s 512-token context, forcing arbitrary truncation.
Per-field embeddings, with long fields chunked at paragraph boundaries. Multiple rows per task. Best-matching field surfaces in the result. Chunking handles the context-window limit cleanly.

The cost of per-field is mostly storage (~5–20× rows per task) and indexing CPU. At BGE-small’s 384d, even a generous 20 rows × 10K tasks = 200K rows × 1.5KB = 300MB. Fits comfortably in SQLite, comfortable for brute force at this scale.

Decision. Phase 1 indexes one row per (task_id, field, chunk_idx). Result formatting collapses multiple field hits on the same task to a single result with the highest-scoring field surfaced as the snippet. Long fields (plan.md, execution-summary.md) are split at paragraph boundaries with a target of 400 tokens per chunk and 50-token overlap.

Consequences.

Result snippets point to the actual field that matched, which makes the answer interpretable to users and agents.
Comments and review messages become independently findable, which directly addresses the “decisions buried in long threads” failure mode in 1_overview.md §1.
Schema’s field column carries the discriminator without a separate table.
Cost: 5–20× more rows per task, more storage, more indexing CPU. At phase-1 scale the cost is unproblematic; at much larger scales the per-field strategy may need revisiting alongside the storage upgrade in ADR-002.

ADR-004 — Hybrid retrieval (FTS5 BM25 + cosine, fused via RRF) from day one

Status: Accepted · 2026-05 · [T20260510-3], [T20260510-9]

Context. Three retrieval strategies were on the table:

Semantic only. Strong on vocabulary mismatch; weak on literal-identifier queries (function names, error codes, task IDs, file paths). Ignores SQLite’s already-shipped FTS5 BM25 capability.
Lexical only. The status quo without this design. Fast, free, well-understood. Cannot find tasks whose vocabulary doesn’t match the query.
Hybrid: BM25 + cosine, fused via Reciprocal Rank Fusion. Both retrievers run in parallel; ranks combine without score calibration. Published research consistently shows hybrid beats either alone across information-retrieval benchmarks.

The third option costs one extra SQL query per search and ~30 lines of fusion code. SQLite ships FTS5 with BM25 built in, so the lexical side is essentially free — the implementation is CREATE VIRTUAL TABLE tasks_fts USING fts5(...). Picking semantic-only would be a deliberate choice to fail on literal-identifier queries, which agents query frequently.

A weighted combination (e.g. 0.6 * cosine_score + 0.4 * bm25_score) was considered as an alternative fusion. Rejected because BM25 and cosine produce scores on incommensurable scales, weights become a tuning knob with no obvious right answer, and RRF demonstrates equal or better quality without the calibration burden.

Decision. Phase 1 ships hybrid retrieval. Both retrievers run on every search query. RRF (k=60) fuses the rankings. Score breakdown (bm25_rank, cosine_rank) is exposed in result payloads so consumers can detect which retriever drove a given hit. related (similar-task discovery) is cosine-only because lexical similarity adds noise for that use case.

Consequences.

Literal-identifier queries (task IDs, function names, file paths) match correctly.
Vocabulary-mismatch queries match correctly.
Score breakdown gives agents a real signal for confidence calibration without exposing raw incommensurable scores.
Cost: every search runs two SQL queries instead of one and computes one extra fusion pass. At phase-1 latency budgets (target <200ms p95) this is unproblematic, but it doubles the per-query work versus a single-retriever design and that overhead is paid even on queries where one retriever would have been enough.

ADR-005 — Companion binary installed on demand, rather than bundled in `orbit`

Status: Accepted · 2026-05 · [T20260510-3], [T20260510-9]

Context. Once fastembed-rs is the chosen backend (ADR-001), the question of where it lives matters. Linking ONNX Runtime + fastembed-rs into the main orbit binary adds ~50MB and pays that cost for every user — including users who never invoke semantic search. Three packaging shapes are plausible:

Option	Default install size	Opt-in mechanism	Inference latency
A. Bundled in `orbit`	Large (~50MB+)	None (always available)	In-process; instant after warm cache
B. Cargo feature flag, two release artifacts	Small or large depending on which artifact you download	Choose `orbit-full` at install time; replace the binary to swap	In-process; instant
C. Companion binary downloaded on demand	Small	`orbit semantic install [--model X]`	Subprocess; ~100–300ms ORT cold start, amortized across batches

Option A is what the design originally called “single binary install posture preserved.” It does preserve that, but it also means the always-pay binary cost is a permanent tax on users who don’t want semantic search. Option B requires users to swap their main binary, which is gross UX (in-flight processes, partially-applied upgrades, surprising behavior changes). Option C keeps the default install slim and gives the user explicit control over which model — and how much disk — they’re committing to, at the cost of subprocess overhead.

Decision. Phase 1 ships option C. Two new crates:

orbit-embed — small library holding the Embedder trait, JSON-RPC types, and SubprocessEmbedder (the trait impl that locates and talks to the companion). No fastembed-rs dependency. Linked into the main orbit binary.
orbit-embed-companion — binary crate. Depends on orbit-embed + fastembed-rs. Produces a standalone orbit-embed-companion binary distributed via GitHub Releases per platform.

orbit semantic install [--model bge-small | minilm-l6 | nomic-v1.5] downloads the platform-appropriate companion binary plus the chosen model files into ~/.orbit/embed/. Inference happens via stdio JSON-RPC; the subprocess is kept alive across a batch (reindex, multi-query session) and shut down at process exit. orbit semantic uninstall removes both the companion and the model. When semantic search is invoked without the companion installed, all read/write paths fail with a clear, actionable error pointing at orbit semantic install.

Consequences.

Default orbit install stays slim — no ORT, no fastembed-rs in the main binary. Users who don’t want semantic search pay no cost.
The model menu is exposed at install time, not as a runtime config knob the user has to discover. Users actively choose between MiniLM-L6 (smallest, ~23MB), BGE-small (default, ~30MB), and Nomic-v1.5 (largest, ~140MB) at the moment they’re committing to the feature.
The subprocess-RPC boundary makes the companion swappable: a future orbit-embed-companion-candle could reuse the same RPC protocol with a different inference engine.
Cost: install becomes a two-step user action (orbit install, then orbit semantic install). Users hitting orbit search without the companion installed need a clean, helpful error. The subprocess introduces ~100–300ms ORT cold-start latency per process; mitigated by reusing the subprocess across batches but still visible on first interactive query. Additionally, the companion binary requires a per-platform release pipeline (Linux x86_64, Linux arm64, macOS x86_64, macOS arm64, Windows x86_64), which is real release-engineering work for follow-up tasks.

ADR-006 — Workspace-local semantic DB separate from global audit/tool DB

Status: Accepted · 2026-05 · [T20260510-9]

Context. Orbit already has a global SQLite database at ~/.orbit/orbit.db for command audit, tool registry, and task-lock bookkeeping. Task bundles themselves are workspace-scoped under .orbit/tasks, and the scoping rules treat task data as workspace-only. Semantic rows are derived from task text, so putting embeddings in the global DB would create cross-project leakage and make stale-row accounting depend on which workspace happened to be active.

Decision. Store phase-1 semantic tables in a workspace-local SQLite database at .orbit/state/semantic.db. The semantic feature crate (orbit-embed, see ADR-007) opens and owns this file end-to-end: VectorStore::open(path) and VectorStore::open_in_memory() apply the WAL + busy_timeout pragmas, run ensure_vector_schema(conn) (CREATE TABLE IF NOT EXISTS for embeddings + tasks_fts), and return a VectorStore whose Arc<Mutex<Connection>> is the only handle into the database.

Consequences.

Task-derived vectors and FTS rows follow task scoping: one workspace cannot see another workspace’s semantic index.
orbit semantic index can rebuild only the active workspace without filtering a global table by workspace ID.
Tests use VectorStore::open_in_memory() directly — no orbit-store handle to plumb through.
semantic.db carries only the embeddings/FTS5 schema. Earlier phase-1 implementations co-located the generic orbit-store migration bundle in the same file (audit, tools, reservations, etc.); that collateral was removed when ADR-007 cut the orbit-embed → orbit-store dependency.

ADR-007 — Semantic-search ownership relocated to `orbit-embed`

Status: Accepted · 2026-05 · [T20260510-20]

Context. [T20260510-9] landed phase-1 with logic split across the wrong crate boundary:

The former orbit-store::vector module hosted VectorStore, EmbedWorker, the BLAKE3 dedup, the paragraph chunker, and the cosine helper. It imported orbit_embed::{Embedder, SubprocessEmbedder} directly, so the dep arrow was orbit-store → orbit-embed and orbit-store leaked knowledge of the embedding feature.
crates/orbit-core/src/command/semantic.rs (328 lines) hosted install / uninstall / reindex / stats logic plus HTTP companion download, model-dir wrangling, and companion version probe. None of that is OrbitRuntime orchestration.

Compare to the analogous knowledge-graph crate: orbit-knowledge owns its data + commands in commands/{search,show,overview,…}.rs and orbit-core/src/command/graph.rs is a six-line re-export. Phase-1 semantic search was three-crate-spread; the graph feature is one-crate-plus-thin-re-export.

Decision. Relocate orbit-search ownership into orbit-embed and make it self-contained:

Move the vector storage module to crates/orbit-embed/src/vector/mod.rs. orbit-store drops its dependency on orbit-embed. orbit-embed does not depend on orbit-store: it owns its own SQLite handle directly via rusqlite::Connection wrapped in Arc<Mutex<_>>, applies WAL + busy_timeout pragmas, and runs ensure_vector_schema on VectorStore::open(path) / VectorStore::open_in_memory(). This mirrors how orbit-knowledge owns graph_index.sqlite end-to-end without going through orbit-store.
Move the per-command logic to crates/orbit-embed/src/commands/{install,uninstall,reindex,stats}.rs. Each command file owns its *Params and *Result types and one public run function. crates/orbit-embed/src/commands/mod.rs aggregates the surface and holds shared helpers (parse_model, active_model, remove_file_if_exists, DEFAULT_RELEASE_BASE_URL).
Reduce crates/orbit-core/src/command/semantic.rs to a thin OrbitRuntime delegate (≤45 lines) that re-exports the param/result types and forwards each method to orbit_embed::commands::*::run. CLI ergonomics are preserved: orbit-cli still calls runtime.semantic_install(params) etc., and Execute impls are unchanged.

Consequences.

Crate dependency direction matches the graph feature exactly: orbit-embed is a near-leaf feature crate that depends only on orbit-common (and its workspace-standard libs: rusqlite, blake3, chrono, reqwest). orbit-store is storage-only; both crates are independent.
semantic.db carries only the embeddings and tasks_fts tables — no audit/tools/reservations/task_tags collateral. Pre-T-20 implementations had orbit-store::Store::open apply the full migration bundle to semantic.db as a side effect; that’s gone now.
orbit-embed gains reqwest (blocking) for the install download and inlines a small WAL helper. Neither violates ADR-005’s slim-client constraint: the prohibition is on linking ML inference (fastembed-rs / ONNX Runtime) into the main orbit binary; storage and HTTP client are fine. orbit-embed-companion remains the only crate that links fastembed.
The phase-2 graph corpus (per ADR-002, ADR-003) can land in orbit-embed::vector directly without crossing another crate boundary.
The phase-1 CLI surface is preserved exactly (install / uninstall / reindex / stats produce identical observable output). VectorStore::new(store) is replaced by VectorStore::open(path) and VectorStore::open_in_memory(). Only one in-tree call site (crates/orbit-core/src/runtime/builder.rs) is affected: it stops opening Store::open(&persistence.semantic_db) and instead calls VectorStore::open(&persistence.semantic_db) directly.
Cost: a small amount of rusqlite::Connection plumbing duplicates what orbit-store::Store does (WAL pragma helper, parent-dir creation, mutex wrapping). The duplication is small (≈30 lines) and isolates the semantic feature’s schema from migrations to other store domains, which is the whole point.

ADR-008 — Version-aware companion refresh and quiet background indexing

Status: Accepted · 2026-05 · [T20260510-26]

Context. The companion binary is installed outside the main orbit executable, so upgrading Orbit does not automatically replace an already-present ~/.orbit/embed/bin/orbit-embed-companion-<platform>. A stale companion can therefore keep old subprocess behavior after the main binary has moved on. The concrete failure was a stale companion writing execution failed: Broken pipe (os error 32) to stderr during best-effort background task indexing, after the durable task update had already succeeded. Direct semantic commands should still surface companion stderr because users explicitly invoked the semantic subsystem and need useful failure detail.

Decision. orbit semantic install probes an existing installed companion with --version-info and compares the returned version to the current Orbit package version. Missing, stale, unprobeable, or explicitly forced companions are replaced through a temporary sibling file before being moved into place; successful install output reports companion_changed. The CLI exposes --force for intentional replacement even when the probe says the companion is current. SubprocessEmbedder keeps inherited stderr as the default for direct semantic commands, while the background task-mutation worker uses a quiet spawn mode.

Consequences.

Re-running orbit semantic install after upgrading Orbit naturally refreshes stale companions without requiring users to uninstall first.
Task mutation output stays trustworthy: background indexing remains best-effort and cannot leak companion stderr into successful task.add / task.update command output.
Direct commands such as orbit search <query> --hybrid, orbit search similar <task-id>, and orbit semantic index still show actionable companion stderr because they use the inherited-stderr path.
Cost: install now trusts the companion’s --version-info protocol. If a broken companion cannot answer the probe, Orbit conservatively replaces it, which can redownload or recopy the binary even when the file might have been usable for embeddings.

ADR-0174 — Split lifecycle and query search namespaces

Status: Accepted · 2026-05-20 · [ORB-00196]

Context. orbit semantic mixed embedding-companion lifecycle (install, uninstall, stats, index) with user query verbs (search, related). The phase-1 search engine now owns both lexical and vector ranking, so leaving queries under semantic would make users choose an implementation detail before they search.

Decision. orbit semantic is only the lifecycle namespace for the local embedding companion. orbit search is the unified query surface; lexical ranking is the default, --hybrid opts into hybrid BM25 plus cosine for task vectors, and --semantic <id> performs cosine-neighbor lookup for indexed tasks.

Consequences.

Establishes a precedent that lifecycle namespaces manage local subsystems while query namespaces describe what users are trying to do.
orbit semantic search, orbit semantic related, and orbit semantic reindex are hard breaks with no shim because there are no known external consumers yet.
Per-domain search commands stay untouched for phase 1; a later task decides whether they thin-wrap orbit search, demote to filters, or retire.
At this phase, docs, learnings, and ADRs used lexical matching even when --hybrid was set; ADR-0180 later adds opt-in doc vectors while keeping learnings and ADRs lexical.

ADR-0175 — Rename search mode and neighbor flags

Status: Superseded by ADR-0179 · 2026-05-21 · [ORB-00204]

Context. Phase 1 used the semantic name for the hybrid BM25 plus cosine mode toggle and a separate related-task flag for cosine-neighbor lookup. That inverted the intuitive reading of semantic search: users expect semantic plus an ID to mean nearest neighbors, while hybrid is the honest name for the ranking algorithm.

Decision. Rename the free-text ranking toggle to --hybrid / hybrid: true and rename task-neighbor lookup to --semantic <id> / semantic: "<id>". Keep lexical search as the default and report JSON mode hybrid for hybrid free-text search and neighbor for cosine-only task-neighbor lookup.

Consequences.

The CLI and MCP surfaces match user vocabulary before external consumers depend on the phase-1 names.
Historical phase-1 audit payloads that carried semantic: true are orphaned by the hard break, matching the no-shim policy for this young surface.
Documentation and packaged skills must distinguish the orbit semantic lifecycle command from the MCP semantic: "<id>" search parameter. ADR-0179 replaces the CLI flag form with orbit search similar <id>.
Cost: Agents and docs written against phase 1 need a one-time rename sweep, and ORB-00202 may need a rebase because it edits adjacent search surfaces.
Cost: historical audit event names semantic.search and semantic.related become orphaned event types, accepted because no external audit-history consumers exist yet.

ADR-0176 — Consolidate per-domain search; cross-kind `--path` and `--tag` filters; learning list `--path` semantics flip

Status: Proposed · 2026-05-20 · [ORB-00202]

Context. After [ADR-0174] and [ADR-0175] consolidated orbit search as the unified query surface, the per-domain task, docs, and learning search subcommands of orbit became redundant for content-similarity queries. The learning variant in particular bundled three unrelated operations under one verb: substring search (content), path-glob applicability lookup (structural), and tag filter (structural). Agents pre-edit also need a single cross-kind command that answers “given this file path, what tasks / learnings / ADRs apply here?” — the context-pack query.

Decision. Hard-remove the per-domain task, docs, and learning search subcommands of orbit (CLI + MCP). Re-home their filters under the unified search surface: --tag <T> (AND semantics, repeatable, case-insensitive), --all (kind-aware status widener), --status (superseded by ADR-0179’s kind:value syntax), and path applicability lookup (superseded by ADR-0179’s orbit search path <path> CLI form; MCP keeps the path parameter). Add orbit task list --path; flip orbit learning list --path from exact-match to glob-containment. The old --include-superseded mental model from the retired per-domain doc surface is replaced by orbit search --kind adr --all. The structural-vs-content split — search for indexed content, list for structural filters — is enforced by the command layout.

Consequences.

One mental model: orbit search queries indexed content, orbit <kind> list filters structural metadata.
The agent context-pack query collapses to a single command (orbit search path <file> --kind all).
Universal --all / --status kind:value vocabulary replaces the patchwork of kind-specific flags (--include-superseded).
ORB-00203 fills the ADR filter branches by adding ADR tags and paths, without changing the public search surface.
Cost: the learning list --path semantics flip is the only observable behavior change. Scripts calling orbit learning list --path 'src/auth/**' expecting exact-match scoped lookups will now also see paths inside that glob. The migration target for ex-learning search --path callers is unchanged because the new semantics match what that deleted command already did.
Cost: during phase 2, ADR carried --tag and --path placeholders in two flag positions; ORB-00203 closes that gap by making those positions real filters.
Cost: AdrStatus has no Deprecated variant, so --all adds Superseded only on ADRs. Asymmetric with task widening (which gets multiple terminal states); revisited if a deprecated state ever becomes load-bearing.
Audit-row granularity is preserved by mapping --kind onto the subcommand field. Before consolidation, orbit task search / orbit docs search / orbit learning search produced distinct (command, subcommand) rows; after consolidation, orbit search --kind X produces (command="search", subcommand="<kind>") so downstream audit queries can still distinguish task / doc / learning / adr searches. Free-text content vs. structural lookup is not currently captured in the audit schema and is out of scope for this ADR.

ADR-0179 — Split `orbit search` modes and require per-kind statuses

Status: Accepted · 2026-05-21 · [ORB-00205]

Context. ADR-0175 corrected the search flag names after phase 1, but the resulting CLI still mixed a positional query with mode flags and allowed flat status tokens whose meaning changed by corpus kind. The real alternatives were to keep extending that single-command flag matrix, or split the user-facing CLI modes before more corpora grow vector support.

Decision. Use three explicit CLI forms: orbit search <query> for free-text search, orbit search similar <id> for cosine-neighbor lookup, and orbit search path <path> for applicability lookup. Require --status values to use kind:value tokens such as task:open, doc:active, and adr:proposed. Remove the CLI field-selection and embedding-model flags, and remove the parallel MCP field and embedding_model parameters while keeping MCP model only as provenance.

Consequences.

The CLI no longer has a top-level <query | --semantic | --path> trichotomy; each primary search operation has its own visible form.
Status filters are unambiguous across task, doc, learning, and ADR corpora.
MCP remains a parameterized tool surface, but it mirrors the reduced public parameter set and the same per-kind status parser.
Cost: similar and path become reserved words immediately after orbit search; searching those literal words requires passing a quoted/free-text query with additional context.
Cost: callers using the young mode flags, flat --status, the retired CLI field/model flags, MCP field, or MCP embedding_model surfaces must migrate with no compatibility shim.

ADR-0180 — Doc corpus embeddings use `docs index` and opt-in hybrid search

Status: Accepted · 2026-05-21 · [ORB-00206]

Context. Doc search was lexical-only after [ORB-00202] unified the query surface, while the orbit-search store already had a source_kind discriminator that could hold docs. The alternatives were to keep semantic ranking deferred, add a separate docs search verb, or reuse the existing vector store behind the unified orbit search --kind doc --hybrid path.

Decision. Use orbit docs index as the explicit admin verb that embeds configured docs roots into source_kind = "doc" rows, and keep retrieval opt-in through orbit search <query> --kind doc --hybrid. Lexical doc search remains the default, ADRs stay lifecycle-owned and lexical-only, and [docs.search].semantic_weight tunes the blend without adding another CLI flag.

Consequences.

orbit docs index shares the semantic companion, model catalog, and embeddings table with task vectors rather than creating a doc-specific store.
The search crate owns doc field extraction and stale-source sweeping, but does not depend on orbit-core; core passes a small DocEmbeddingSource.
Hybrid doc search falls back to lexical when the companion or doc rows are unavailable, preserving read-path ergonomics while making the admin indexing verb fail clearly.
Cost: docs now have a manual freshness loop separate from task mutation indexing. Background docs indexing remains a future task.

Task References

[T20260510-3] — Design semantic search over task artifacts and graph (v2). The task that produced this folder.
[T20260510-9] — Phase-1 semantic search foundation: orbit-embed + orbit-embed-companion + indexing pipeline. The task that accepted and implemented ADR-001 through ADR-006.
[T20260510-20] — Refactor: relocate orbit-search ownership to orbit-embed (vector store + commands). The task that accepted and implemented ADR-007.
[T20260510-26] — Make semantic companion install/update quiet and version-aware. The task that accepted and implemented ADR-008.
[ORB-00196] — Split orbit semantic lifecycle from the unified orbit search query surface. The task that accepted and implemented ADR-0174.
[ORB-00204] — Rename orbit search flags to --hybrid for free-text vector ranking and --semantic <id> for task-neighbor lookup. The task that accepted and implemented ADR-0175.
[ORB-00202] — Consolidate per-domain search subcommands and add cross-kind --path / --tag filters. The task that proposed and implemented ADR-0176.
[ORB-00203] — Add ADR envelope tags and paths so ADRs participate in cross-kind --tag / --path search filters.
[ORB-00205] — Split orbit search into query / similar / path forms and require per-kind --status syntax. The task that accepted and implemented ADR-0179.
[ORB-00206] — Add doc-corpus embeddings through orbit docs index and orbit search --kind doc --hybrid. The task that accepted and implemented ADR-0180.

Resolve any task above with orbit task show <ID> or git log --grep=<ID>.

Semantic Search — Decisions

ADR-001 — fastembed-rs ONNX backend over Candle, llama.cpp, or external ollama

ADR-002 — Brute-force cosine over SQLite BLOBs; sqlite-vec reserved as phase-2 upgrade

ADR-003 — Per-field embeddings with chunked overflow, not whole-bundle concatenation

ADR-004 — Hybrid retrieval (FTS5 BM25 + cosine, fused via RRF) from day one

ADR-005 — Companion binary installed on demand, rather than bundled in orbit

ADR-006 — Workspace-local semantic DB separate from global audit/tool DB

ADR-007 — Semantic-search ownership relocated to orbit-embed

ADR-008 — Version-aware companion refresh and quiet background indexing

ADR-0174 — Split lifecycle and query search namespaces

ADR-0175 — Rename search mode and neighbor flags

ADR-0176 — Consolidate per-domain search; cross-kind --path and --tag filters; learning list --path semantics flip

ADR-0179 — Split orbit search modes and require per-kind statuses

ADR-0180 — Doc corpus embeddings use docs index and opt-in hybrid search

Task References

ADR-002 — Brute-force cosine over SQLite BLOBs; `sqlite-vec` reserved as phase-2 upgrade

ADR-005 — Companion binary installed on demand, rather than bundled in `orbit`

ADR-007 — Semantic-search ownership relocated to `orbit-embed`

ADR-0176 — Consolidate per-domain search; cross-kind `--path` and `--tag` filters; learning list `--path` semantics flip

ADR-0179 — Split `orbit search` modes and require per-kind statuses

ADR-0180 — Doc corpus embeddings use `docs index` and opt-in hybrid search