chDB - fast, reliable, and scalable in-process database
Announcing chDB 4.0
Write Pandas, run ClickHouse, now on Hex
With the new DataStore API, write the Pandas code you already know, and it executes on the ClickHouse engine automatically.
DataStore records your operations as a lazy pipeline, compiles the full chain into optimized ClickHouse SQL at execution time, and applies filter pushdown, column pruning, and limit propagation.
Lazy execution, ClickHouse speed
Operations are recorded, not executed. The full pipeline is compiled into optimized SQL and run on ClickHouse's vectorized, multi-threaded engine.
Seamless engine fallback
Operations are recorded, not executed. The full pipeline is compiled into optimized SQL and run on ClickHouse's vectorized, multi-threaded engine.
One import migration
Operations are recorded, not executed. The full pipeline is compiled into optimized SQL and run on ClickHouse's vectorized, multi-threaded engine.
Try chDB 4.0 on Hex
Run Pandas-style ClickHouse queries in Hex notebooks with zero setup.
Get an extended free trial through ClickHouse.
Learn how to use chDB on HexchDB and the ClickHouse ecosystem
Open-source library
chDB is an open-source library, allowing you to customize and extend your database to fit your needs. Benefit from a vibrant community that keeps you updated with the latest innovations and security practices.
Query objects directly
With chDB, you can query native objects in the programming language of choice directly. This direct access reduces latency and simplifies data processing, enabling faster operations.
Supports Python DB API 2.0
chDB supports Python DB API 2.0, ensuring seamless integration with your Python applications. Enjoy consistency, reliability, and ease of use with familiar libraries and tools.
Pandas-native
Write Pandas code that executes on ClickHouse. The DataStore API brings lazy execution, automatic query optimization, and seamless engine fallback.
Embedded. No need to install or run ClickHouse services
Zero-copy DataFrame exchange
Input and output support
Parquet, CSV, JSON, Arrow,
ORC and 80+ more formats
chDB regularly outperforms DuckDB, Pandas, and Polars, now with zero-copy DataFrame exchange
With chDB, you have the benefit of ClickHouse's blazing speed, in-process.
247x
Faster than Pandas
COUNT(*) aggregation on 1M row DataFrame
~24%
Faster than DuckDB
Exporting 1M rows to Pandas DataFrame (2.64s vs 3.47s)
87x
Faster than chDB v1
End-to-end DataFrame query, v2+ zero-copy input
In head-to-head benchmarks across 14 common Pandas operations at 1M and 10M rows, chDB leads in the majority of challenges. See full benchmark results.
What is chDB used for?
Start using chDB in minutes
pip install chdbquery = "select count() FROM s3('s3://datasets-documentation/hackernews/hacknernews.json.gz')"
chdb.query(query,'DataFrame')