Skip to main content
Version: 0.4.x

ByConity Introduction

What is ByConity

ByConity is a new generation of open-source cloud-native data warehouse. It adopts a compute-storage separation architecture to meet various requirements of data warehouse users, such as elastic scaling of resources, read-write separation, resource isolation, and strong data consistency, while providing excellent query and write performance. ByConity uses a large number of mature OLAP technologies, such as column-based storage engines, MPP execution, intelligent query optimization, vectorized execution, code generation, indexing, data compression, etc.

ByConity's name

ByConity's name is composed of three words: Byte, Convert, and Community. "By" comes from "byte" which represents the basic unit of stored data, metaphorically referring to massive amounts of data. "Con" comes from "convert" which represents change and innovation. "Conity" also comes from "community" which represents a group of people, specifically our open-source developer community. The implication of ByConity is to bring together a group of developers who are willing to break conventional technologies and change the way we use data.

Applicable Scenarios

ByConity can meet various user scenarios for enterprise users, such as interactive queries, real-time data dashboards, and real-time data warehouses.

  • Interactive Queries: This scenario includes user-defined queries, self-service reporting, user profiling analysis, marketing effectiveness analysis, and behavior log analysis. These applications support free-dimension and multi-table association query analysis with fast response times. Behavior log analysis also supports large-scale log exploration and analysis.

  • Real-time Data Dashboards: Suitable for real-time business monitoring screens, live data statistics dashboards, business dashboards, and system link monitoring. All applications emphasize real-time characteristics, and some support statistical functions.

  • Real-time Data Warehouses: Includes real-time data ingestion and near-real-time ETL calculations, emphasizing real-time data writing and immediate data visibility while supporting complex calculations and data cleaning.

  • ELT workerloads: User can use bsp mode to do batch processing. Fault-tolerance is enabled with this mode. See ELT introduction.