Benchmarks¶

Benchmarks are part of the routing story, not an afterthought.

The benchmark scripts live in scripts/benchmarks/. The MkDocs guide mirrors that directory with one page per Python benchmark script so the reported tables and current numbers are visible in the docs site instead of only in the repository README.

Pages¶

Translation overhead: translation_overhead.py
Relational direct-read benchmark: duckdb_direct_read.py
Cypher graph-path benchmark: cypher_graph_path.py
Vector single-run benchmark: vector_search.py
Vector sweep benchmark: vector_search_sweep.py
LanceDB tuning benchmark: vector_search_tune_lancedb.py

Current benchmark map¶

Translation overhead benchmark¶

python scripts/benchmarks/translation_overhead.py --warmup 100 --repetitions 1000

Current purpose:

isolate cached and uncached PostgreSQL-like SQL translation overhead
isolate Cypher parse and bind+compile overhead
keep frontend cost separate from backend execution cost when routing decisions are being discussed

Relational benchmark¶

HUMEMDB_THREADS=8 python scripts/benchmarks/duckdb_direct_read.py --rows 50000
HUMEMDB_THREADS=8 python scripts/benchmarks/duckdb_direct_read.py \
    --rows 10000000 --warmup 1 --repetitions 5 --batch-size 50000

Current takeaway:

SQLite stays stronger for point lookups and smaller filtered reads.
DuckDB wins broader grouped scans and analytical aggregates.

Graph benchmark¶

HUMEMDB_THREADS=8 python scripts/benchmarks/cypher_graph_path.py --nodes 5000 --fanout 3
HUMEMDB_THREADS=8 python scripts/benchmarks/cypher_graph_path.py \
    --nodes 1000000 --fanout 4 --tag-fanout 2 --warmup 1 --repetitions 5 --batch-size 20000

Current takeaway:

SQLite is very strong for selective graph traversal.
DuckDB becomes compelling only once the read broadens into graph-analytic shapes.

Vector benchmark¶

The vector benchmark scripts measure exact NumPy search and quantized variants so later routing choices can be based on observed crossover points instead of guesswork.

The single-run, sweep, and LanceDB-tuning vector pages are split out because they answer different questions:

vector_search.py measures one concrete scenario in depth.
vector_search_sweep.py finds crossover regions across row-count and dimension grids.
vector_search_tune_lancedb.py searches candidate indexed profiles that can still hit a recall target.