Draft:Apache Doris

Apache Doris is an open source real-time data warehouse mostly written in Java and C++. It is a column-oriented DBMS compatible with the MySQL protocol. The design of Apache Doris integrates the distributed storage engine of Google Mesa and the massively parallel processing SQL query engine of Apache Impala..

History
Apache Doris originated as a project initiated by Baidu in 2008 to cater to the specific requirements of the company's advertising business. It was developed into an analytic database that supported a range of data services including multidimensional analysis, user profile analysis, ad hoc queries, and real-time dashboards. In 2017, it was open sourced and made available on GitHub. In July 2018, Apache Doris entered the Apache Incubator program, a process designed by the Apache Software Foundation to guide and nurture open-source projects. In June 2022, the Apache Software Foundation announced the graduation of Apache Doris as a Top-Level Project. By then, it accumulated 300 code contributors and over 500 enterprise users, including ByteDance, Tencent, and Xiaomi.

Features
Apache Doris uses technologies including column-oriented storage, indexes, parallel execution engine, vectorization, and query optimizer in query execution. It is horizontally scalable and operates independently of third-party services. It uses on-demand JSON in resource utilization optimization, and Bloom Filter indexes in query acceleration.

Doris offers compatibility with big data components such as Apache Flink, Apache Hive, Apache Hudi, Apache Iceberg, Apache Spark, and Elasticsearch. It undertakes data aggregation and join queries in a real-time data processing architecture and historical data queries in a log analytics platform