Examples Overview¶
Hands-on examples demonstrating ArcadeDB Python bindings in real-world scenarios. Each example is self-contained, well-documented, and ready to run.
DSL-first examples
Current examples and docs use SQL/OpenCypher as the default approach for schema, CRUD, and graph operations.
Available Examples¶
🏁 Getting Started¶
Dataset Downloader Download and prepare datasets used by the examples (MovieLens, Stack Exchange, MSMARCO).
01 - Simple Document Store Foundation example covering document types, CRUD operations, comprehensive data types (DATE, DATETIME, DECIMAL, FLOAT, INTEGER, STRING, BOOLEAN, LIST OF STRING), and NULL value handling (INSERT NULL, UPDATE to NULL, IS NULL queries).
02 - Social Network Graph Complete graph modeling with vertices, edges, NULL handling, and dual query languages (SQL MATCH vs Cypher). Demonstrates 8 people with optional fields, 24 bidirectional edges, graph traversal, and comprehensive queries.
03 - Vector Search Semantic similarity search with HNSW (JVector) indexing. Demonstrates vector storage, index creation, and nearest neighbor search.
04 - CSV Import - Documents Production CSV import with automatic type inference by Java, NULL handling, and index optimization. Imports MovieLens dataset (36M+ records) with comprehensive performance analysis and result validation with actual data samples.
05 - CSV Import - Graph Production graph creation from MovieLens dataset. Comprehensive performance analysis of SQL pipelines, synchronous vs async, and index effects. Includes benchmark configurations, validation queries, and export/import roundtrip testing.
06 - Vector Search - Movie Recommendations Production-ready vector embeddings and HNSW (JVector) indexing for semantic movie search.
07 - Stack Overflow Tables (OLTP) Table-oriented OLTP benchmark with mixed CRUD operations and deterministic single-thread verification.
08 - Stack Overflow Tables (OLAP) Table-oriented OLAP benchmark with fixed analytical queries, load/index timing, and repeated query runs.
09 - Stack Overflow Graph (OLTP) Graph OLTP benchmark with directed-edge semantics, result verification notes, and cross-backend workload comparison.
10 - Stack Overflow Graph (OLAP) Graph OLAP benchmark using a fixed OpenCypher query suite across multiple backends.
11 - Vector Index Build Build-only vector benchmark comparing ArcadeDB, pgvector, Qdrant, Milvus, FAISS, and LanceDB.
12 - Vector Search Search-only vector benchmark that reuses Example 11 output and sweeps backend-specific search parameters.
13 - Stack Overflow Hybrid Queries Standalone SQL + graph + vector workflow over Stack Overflow data.
14 - Lifecycle Timing Embedded lifecycle benchmark covering JVM startup, load, query, close, and reopen timing.
15 - Import Database vs Transactional Table Ingest Four-way table-ingest comparison. Repository guidance from these experiments is to prefer single-worker async SQL for bulk table/document ingest.
16 - Import Database vs Transactional Graph Ingest
Four-way graph-ingest comparison. Repository guidance from these experiments is to prefer GraphBatch for bulk graph ingest.
17 - Time Series End-to-End SQL-first time-series workflow covering type creation, tagged inserts, range queries, and hourly bucket aggregation.
18 - Geo Predicates With WKT Points And Polygons
SQL-first geospatial workflow covering WKT storage, GEOSPATIAL indexes, indexed within / intersects, polygon overlap queries, and fallback after dropping the index.
19 - Hash Index Exact-Match Lookup Workflow SQL-first HASH index workflow covering unique and non-unique exact-match lookups, schema inspection, and duplicate-key rejection.
20 - Graph Algorithms Route Planning
SQL-first graph algorithms workflow covering minimum-hop shortestPath, weighted dijkstra / astar, sqlscript variables, and route-cost comparison.
21 - Server Mode And HTTP Access
Embedded-first server workflow covering create_server(...), HTTP auth, server-managed database creation, and mixed embedded plus HTTP access to the same data.
22 - Graph Analytical View SQL Workflow
SQL-first Graph Analytical View workflow covering six-figure synthetic graph generation, CREATE / ALTER / REBUILD, schema metadata polling, stale-versus-ready lifecycle, and persisted GAV restoration.
Quick Start¶
⚠️ Important: Always run examples from the examples/ directory.
Learning Path¶
- Document Store (01) - Learn fundamentals
- Graph Operations (02) - Understand relationships
- Vector Search (03) - AI/ML integration
- CSV Import - Documents (04) - ETL to documents with MovieLens
- CSV Import - Graph (05) - Same data as graph with performance benchmarks
- Vector Search - Movies (06) - Semantic search and recommendations
- Stack Overflow Tables (OLTP/OLAP) (07/08) - Table benchmarks and fairness conventions
- Stack Overflow Graph (OLTP/OLAP) (09/10) - Directed graph benchmarks and query suites
- Vector Benchmarks (11/12) - Index build and search benchmarking across vector backends
- Hybrid Queries (13) - Combined SQL, graph, and vector workflow
- Lifecycle And Ingest Benchmarks (14/15/16) - Embedded lifecycle timing and ingest comparisons
- Time Series SQL Workflow (17) - Tagged samples, range queries, and bucket aggregation from Python
- Geo Predicate Workflow (18) - WKT points and polygons, indexed spatial filters, and fallback semantics
- Hash Index Workflow (19) - Exact-match HASH indexes, missing-key behavior, and duplicate protection
- Graph Algorithms Workflow (20) - Minimum-hop versus weighted routing with
shortestPath,dijkstra, andastar - Server Mode Workflow (21) - Start the built-in server, create schema over HTTP, and verify mixed embedded plus HTTP access
- Graph Analytical View Workflow (22) - Manage GAV lifecycle entirely through SQL and inspect
schema:graphAnalyticalViews
Start with Simple Document Store!