Examples Overview¶

Hands-on examples demonstrating ArcadeDB Python bindings in real-world scenarios. Each example is self-contained, well-documented, and ready to run.

DSL-first examples

Current examples and docs use SQL/OpenCypher as the default approach for schema, CRUD, and graph operations.

Available Examples¶

🏁 Getting Started¶

Dataset Downloader Download and prepare datasets used by the examples (MovieLens, Stack Exchange, MSMARCO).

01 - Simple Document Store Foundation example covering document types, CRUD operations, comprehensive data types (DATE, DATETIME, DECIMAL, FLOAT, INTEGER, STRING, BOOLEAN, LIST OF STRING), and NULL value handling (INSERT NULL, UPDATE to NULL, IS NULL queries).

02 - Social Network Graph Complete graph modeling with vertices, edges, NULL handling, and dual query languages (SQL MATCH vs Cypher). Demonstrates 8 people with optional fields, 24 bidirectional edges, graph traversal, and comprehensive queries.

03 - Vector Search Semantic similarity search with HNSW (JVector) indexing. Demonstrates vector storage, index creation, and nearest neighbor search.

04 - CSV Import - Documents Production CSV import with automatic type inference by Java, NULL handling, and index optimization. Imports MovieLens dataset (36M+ records) with comprehensive performance analysis and result validation with actual data samples.

05 - CSV Import - Graph Production graph creation from MovieLens dataset. Comprehensive performance analysis of SQL pipelines, synchronous vs async, and index effects. Includes benchmark configurations, validation queries, and export/import roundtrip testing.

06 - Vector Search - Movie Recommendations Production-ready vector embeddings and HNSW (JVector) indexing for semantic movie search.

07 - Stack Overflow Tables (OLTP) Table-oriented OLTP benchmark with mixed CRUD operations and deterministic single-thread verification.

08 - Stack Overflow Tables (OLAP) Table-oriented OLAP benchmark with fixed analytical queries, load/index timing, and repeated query runs.

09 - Stack Overflow Graph (OLTP) Graph OLTP benchmark with directed-edge semantics, result verification notes, and cross-backend workload comparison.

10 - Stack Overflow Graph (OLAP) Graph OLAP benchmark using a fixed OpenCypher query suite across multiple backends.

11 - Vector Index Build Build-only vector benchmark comparing ArcadeDB, pgvector, Qdrant, Milvus, FAISS, and LanceDB.

12 - Vector Search Search-only vector benchmark that reuses Example 11 output and sweeps backend-specific search parameters.

13 - Stack Overflow Hybrid Queries Standalone SQL + graph + vector workflow over Stack Overflow data.

14 - Lifecycle Timing Embedded lifecycle benchmark covering JVM startup, load, query, close, and reopen timing.

15 - Import Database vs Transactional Table Ingest Four-way table-ingest comparison. Repository guidance from these experiments is to prefer single-worker async SQL for bulk table/document ingest.

16 - Import Database vs Transactional Graph Ingest Four-way graph-ingest comparison. Repository guidance from these experiments is to prefer GraphBatch for bulk graph ingest.

17 - Time Series End-to-End SQL-first time-series workflow covering type creation, tagged inserts, range queries, and hourly bucket aggregation.

18 - Geo Predicates With WKT Points And Polygons SQL-first geospatial workflow covering WKT storage, GEOSPATIAL indexes, indexed within / intersects, polygon overlap queries, and fallback after dropping the index.

19 - Hash Index Exact-Match Lookup Workflow SQL-first HASH index workflow covering unique and non-unique exact-match lookups, schema inspection, and duplicate-key rejection.

20 - Graph Algorithms Route Planning SQL-first graph algorithms workflow covering minimum-hop shortestPath, weighted dijkstra / astar, sqlscript variables, and route-cost comparison.

21 - Server Mode And HTTP Access Embedded-first server workflow covering create_server(...), HTTP auth, server-managed database creation, and mixed embedded plus HTTP access to the same data.

22 - Graph Analytical View SQL Workflow SQL-first Graph Analytical View workflow covering six-figure synthetic graph generation, CREATE / ALTER / REBUILD, schema metadata polling, stale-versus-ready lifecycle, and persisted GAV restoration.

Quick Start¶

⚠️ Important: Always run examples from the examples/ directory.

cd bindings/python/examples/
python 01_simple_document_store.py

Learning Path¶

Document Store (01) - Learn fundamentals
Graph Operations (02) - Understand relationships
Vector Search (03) - AI/ML integration
CSV Import - Documents (04) - ETL to documents with MovieLens
CSV Import - Graph (05) - Same data as graph with performance benchmarks
Vector Search - Movies (06) - Semantic search and recommendations
Stack Overflow Tables (OLTP/OLAP) (07/08) - Table benchmarks and fairness conventions
Stack Overflow Graph (OLTP/OLAP) (09/10) - Directed graph benchmarks and query suites
Vector Benchmarks (11/12) - Index build and search benchmarking across vector backends
Hybrid Queries (13) - Combined SQL, graph, and vector workflow
Lifecycle And Ingest Benchmarks (14/15/16) - Embedded lifecycle timing and ingest comparisons
Time Series SQL Workflow (17) - Tagged samples, range queries, and bucket aggregation from Python
Geo Predicate Workflow (18) - WKT points and polygons, indexed spatial filters, and fallback semantics
Hash Index Workflow (19) - Exact-match HASH indexes, missing-key behavior, and duplicate protection
Graph Algorithms Workflow (20) - Minimum-hop versus weighted routing with shortestPath, dijkstra, and astar
Server Mode Workflow (21) - Start the built-in server, create schema over HTTP, and verify mixed embedded plus HTTP access
Graph Analytical View Workflow (22) - Manage GAV lifecycle entirely through SQL and inspect schema:graphAnalyticalViews

Start with Simple Document Store!