69 docs tagged with "Docs"

View All Tags

ByConity provides a rich set of SQL syntax through ANSI SQL dialect. When using this dialect, SQL statements will be parsed and validated by Apache Calcite and then sent to servers for execution. Apache Calcite supports standard ANSI SQL, please refer to the BNF-grammar here for more details//calcite.apache.org/docs/reference.html.

Arithmetic

Notice:

Array

Notice:

Background and Technical Architecture

ByConity is an open source Data Warehouse system designed for modern IT architecture changes and, it is designed with a Cloud Native architecture. It provides excellent query and write performance while meeting the needs of Data Warehouse users for resource elastic volume expansion and contraction, read and write separation, resource isolation, and strong data consistency.

Background Tasks Configuration

Start/Stop Merge of tables

Background Tasks Management

Document Type: Tutorial

Basic Database Operations

There are a few ways to get started with ByConity. You have the choice to deploy ByConity through package deployment, using docker wrapper or deploy ByConity in Kubernetes. To get started quickly, we recommend that you use the ByConity Playground with docker-compose/the docker wrapper.

Bit & Bitmap

Notice:

Bucket table best practice manual

In ByConity, when using a Bucket table, the system organizes table data based on one or more columns and expressions specified by the user in the table creation statement. The data with the same value is clustered together and assigned to the same bucket number.

Byconity 0.2.0 s3 storage upgrade checklist

There are some s3 object key and s3 metadata changes after s3's preview version(from pre 0.2.0 version to 0.2.0 version). And we provide some tools to migrate from old version. Only use this if you are using old version of byconity and store data in s3.

Client Connection

Document Type: Tutorial

Cluster Configuration

Document Type: Descriptive

Column ingestion query

Introduction

Column Storage Design Principles

Typically, transactional databases use row storage to support transactions and high concurrent reading and writing, while analytical databases use column storage to reduce IO and facilitate compression. ByConity, on the other hand, uses column storage to ensure read and write performance, support transaction consistency, and is well-suited for large-scale data calculations.

Community Code of Conduct

Our Pledge

Comparison

Notice:

Complex Query Models and Execution Tuning

Complex query execution model

Conditional

Notice:

Data Type

Document Type: Descriptive

Data Types

The data types provided in ByConity are adapted from ClickHouse. Visit this page for more information on ClickHouse data types.

Data Visualisation

Tutorial goals:

Database Table Design

Document Type: Tutorial

Date & Times

Notice:

Deploy ByConity in Kubernetes

This page demonstrates how to deploy a ByConity cluster in your Kubernetes cluster.

Deploy ByConity in Kubernetes

This page demonstrates how to deploy a ByConity cluster in your Kubernetes cluster.

Deploy ByConity to physical servers with a docker wrapper

The current way to deploy ByConity to physical servers is deployed via a docker wrapper.

Deployment Requirements

ByConity can run on most mainstream commercial servers. We recommend that the deployment of ByConity can comply with the following requirements:

Encoding

Notice:

Encryption

Notice:

Export Data

Document Type: Tutorial

FoundationDB Installation

In this guideline, I will set up a Foundation DB cluster on 3 physical machines. They are all using debian OS. I refer to two official guidelines here Getting Started on Linux and Building a Cluster.

Functions

ByConity provides two SQL dialects, (1) ClickHouse and (2) ANSI.

Geo

Notice:

Git WorkFlow

ByConity is leverage the Github doing the developement. Each contributor and maintainer in ByConity must follow this workflow:

Hash

Notice:

Hash

Notice:

HDFS Installation

In this guide I will set up HDFS on 3 machine, 1 machine is for name node and other 2 machines is for data nodes. I refer to the following official document SingleCluster and ClusterSetup. I will install HDFS version 3.3.4 so I need java-8 because this is the recommended java version for this Hadoop

Hive External Catalog

Besides creating tables in CnchHive engine to access external hive tablesl, Byconity also supports visit the external tables using external catalog.

Hive External Table

CnchHive is a table engine provided by ByConity, which supports federated query in the form of external tables, and users can directly accelerate data query without importing data. CnchHive supports querying data on both HDFS and S3 Hive table.

How to Become a Maintainer and TSC Member of ByConity

Contributing to an open-source project like ByConity can be intimidating, but it can also be a rewarding experience.

Import Data

Document Type: Tutorial

Import Optimisation

Currently CNCH supports the following import methods:

IP Address

Notice:

Joining the ByConity Technical Steering Committee

This guide aims to provide information on how to become a member of the ByConity Technical Steering Committee (TSC).

JSON

Notice:

Logical

Notice:

Main Principles Concepts

This chapter will introduce the main principles of ByConity and it's query execution. ByConity's query execution process is shown in the figure below. First, ByConity will obtain the Metadata information required for the query through the Metadata service. Then ByConity will generate an efficient query plan through the optimizer according to the user's SQL, and schedule it to the corresponding calculation group to read the data and execute it. Finally, the result set is summarized and sent back to the Client.

Map

map

Mathematical

Notice:

Monitor Cluster

Prometheus Monitoring Indicators：

Nullable

Notice:

Others

MACNumToString

Package Deployment

One way to deploy ByConity to physical machines is using package manager.

Query Acceleration

Preload feature will load data from remote to local disk cache to speed up the coming up queries. After preload is finished, the query will read data from the local disk, rather than the remote storage.

Query Optimizer

The optimizer is the core of the database system. An excellent optimizer can greatly improve query performance, especially in complex query scenarios. The optimizer can bring performance improvements of several to hundreds of times.

Random

Notice:

Recommended Use Cases

ByConity uses a large number of mature OLAP technologies, such as column storage engine, MPP execution, intelligent query optimization, vectorized execution, Codegen, indexing, data compression, mainly used in OLAP query and computing scenarios. It has very good performance in real-time data access, aggregation query of large and wide tables, complex analysis and calculation under massive data, and multi-table associated query scenarios.

resource manager

The Resource Manager (RM) component is used for unified management and scheduling of ByConity computing resources, and is the core component to achieve resource elasticity and improve resource utilization.

Role-based Access Control (RBAC)

RBAC in Byconity is adapted from the ClickHouse version of RBAC in most aspects other than minor syntax differences and the underlying implementation which will be explained further below.

Setup ByConity Developement Environment

Environment Dependencies

SQL Statements

The supported statements in ByConity are similar to ClickHouse, but it is still recommended to follow the ByConity manual to ensure proper use. Some of the examples below are referenced from ClickHouse Documentation but have been adapted and modified to work in ByConity.

Aggregation

ANSI Compatibility

Arithmetic

Array

Background and Technical Architecture

Background Tasks Configuration

Background Tasks Management

Basic Database Operations

Bit & Bitmap

Bucket table best practice manual

Byconity 0.2.0 s3 storage upgrade checklist

Client Connection

Cluster Configuration

Column ingestion query

Column Storage Design Principles

Community Code of Conduct

Comparison

Complex Query Models and Execution Tuning

Conditional

Data Type

Data Types

Data Visualisation

Database Table Design

Date & Times

Deploy ByConity in Kubernetes

Deploy ByConity in Kubernetes

Deploy ByConity to physical servers with a docker wrapper

Deployment Requirements

Encoding

Encryption

Export Data

FoundationDB Installation

Functions

Geo

Git WorkFlow

Hash

Hash

HDFS Installation

Hive External Catalog

Hive External Table

How to Become a Maintainer and TSC Member of ByConity

Import Data

Import Optimisation

IP Address

Joining the ByConity Technical Steering Committee

JSON

Logical

Main Principles Concepts

Map

Mathematical

Monitor Cluster

Nullable

Others

Package Deployment

Query Acceleration

Query Optimizer

Random

Recommended Use Cases

resource manager

Role-based Access Control (RBAC)

Setup ByConity Developement Environment

SQL Statements

String

Transactions and Concurrency Control

Type Conversion

URLs

UUID

Virtual Warehouse Configuration

Window