Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

3. Four data collectors

Date: 2026-04-22

Status

Accepted

Context

Cerebro aggregates signals from development activity to generate context. We need to decide which data sources to collect from.

Decision

We will collect from four sources:

SourceImplementationData
OpenCodeSQLite via opencode_db_pathSession history
Gitgit2 crateCommits, modified files
TODOsRegex scan on source filesTODO/FIXME/HACK/XXX comments
Manual Notesnotes/projects/{name}.mdStatus, journal, intent

Consequences

Pros

  • Comprehensive signal aggregation
  • Each collector is independent
  • Manual notes provide human context

Cons

  • Git collection can be slow on large repos
  • TODO regex may false-positive on strings

Notes

Collectors are orchestrated in collectors/mod.rs. Each is async and can run in parallel.