ClickHouse列式储存数据库
ClickHouse是俄罗斯第一大搜索引擎Yandex开发的列式储存数据库.令人惊喜的是,这个列式储存数据库的性能大幅超越了很多商业MPP数据库软件,比如Vertica,InfiniDB.
相比传统的数据库软件,ClickHouse要快100-1000X:
100Million 数据集:
-
ClickHouse比Vertica约快5倍,比Hive快279倍,比My SQL快801倍
1Billion 数据集:
-
ClickHouse比Vertica约快5倍,MySQL和Hive已经无法完成任务了
该项目当前还有一些不足:
-
pre-build包只有Ubuntu平台的可用,并且该项目当前没有任何架构文档
-
只有Github上面的C++源代码
主要功能
-
True column-oriented
-
Vectorized query execution
-
Data compression
-
Parallel and distributed query execution
-
Real-time data ingestion
-
On-disk locality of reference
-
Real-time query processing
-
Cross-datacenter replication
-
High availability
-
SQL support
-
Local and distributed joins
-
Pluggable external dimension tables
-
Arrays and nested data types
-
Approximate query processing
-
Probabilistic data structures
-
Full support of IPv6
-
Features for web analytics
-
State-of-the-art algorithms
-
Detailed documentation
-
Clean documented code
应用场景
-
Web and App analytics
-
Advertising networks and RTB
-
Telecommunications
-
E-commerce
-
Information security
-
Monitoring and telemetry
-
Business intelligence
-
Online games
-
Internet of Things