An intro to column-based storage
Mark Needham
In this video, we explore column storage, the backbone of column stores or column databases like ClickHouse. We compare the concept to the more familiar row-based storage used in many relational databases, using a practical example of weather data to illustrate the differences.
Column storage isn't just a different way to organize data; it's a game-changer for analytical queries and data compression. We'll explore why this approach is particularly well-suited for modern data analysis needs and how it aligns with current CPU architectures.
Key points covered:
- Comparison of row-based vs. column-based storage layouts
- Advantages of column storage for data compression and efficient querying
- Examples of compression techniques like dictionary encoding and delta encoding
- How column storage enables faster analytical queries and aggregations
- The benefits of column storage for CPU cache usage and SIMD operations

ClickHouse at DoorDash
How DoorDash monitors TCP and DNS traffic helping their observability team take care of their kubernetes infrastructure.

Open House NYC: Keynote
Keynote from Open House Roadshow NYC 2025.

Open House NYC: ClickHouse for AI/ML
Zach Naimon, Principal Product Manager at ClickHouse