Your First Benchmark¶

Run a complete TPC-H benchmark in under 5 minutes with zero configuration.

What You’ll Do¶

Install BenchBox
Run a TPC-H power test on DuckDB
View your results

Step 1: Install BenchBox¶

# Using uv (recommended)
uv add benchbox

# Or using pip
pip install benchbox

Step 2: Run the Benchmark¶

benchbox run --platform duckdb --benchmark tpch --scale 0.01

This command:

Generates TPC-H data at scale factor 0.01 (~10MB)
Loads data into DuckDB (in-memory)
Executes all 22 TPC-H queries
Reports timing and validation results

Expected output:

BenchBox v1.1.0 - TPC-H Power Test

Platform: DuckDB (in-memory)
Benchmark: TPC-H
Scale Factor: 0.01

[1/4] Generating data... ━━━━━━━━━━━━━━━━━━━━ 100% 0:00:05
[2/4] Loading tables... ━━━━━━━━━━━━━━━━━━━━ 100% 0:00:02
[3/4] Running queries... ━━━━━━━━━━━━━━━━━━━━ 100% 0:00:08
[4/4] Validating results... ━━━━━━━━━━━━━━━━━━━━ 100% 0:00:01

✓ Benchmark complete!

Summary:
  Total Time: 16.2s
  Queries: 22/22 passed
  Validation: All row counts match expected

Step 3: View Results¶

# Show recent results
benchbox results --limit 1

# Export to JSON for analysis
benchbox export --last --format json

What Just Happened?¶

Data Generation: BenchBox used TPC-H’s data generator to create realistic business data (customers, orders, line items)
Schema Loading: Tables were created in DuckDB and data was loaded
Query Execution: All 22 TPC-H queries ran sequentially (power test)
Validation: Results were compared against expected row counts

Try Different Options¶

# Larger dataset (takes longer, more realistic)
benchbox run --platform duckdb --benchmark tpch --scale 0.1

# Run specific queries only
benchbox run --platform duckdb --benchmark tpch --queries Q1,Q6,Q17

# Preview without running (dry run)
benchbox run --dry-run ./preview --platform duckdb --benchmark tpch

Next Steps¶

Understanding Results - Learn what the metrics mean
Comparing Platforms - Run on multiple databases
DataFrame Benchmarking - Use Polars/Pandas APIs