78 docs tagged with "Docs"

Basic Operations

This tutorial guides users through the basic operations of creating tables, importing data, and querying data using ByConity in a Docker deployment environment.

ByConity upgrade

Upgrading Clusters Based on S3 Storage (Applicable Only to Versions Before 0.2.0)

Data Compression

ByConity supports codecs for lists, which are methods for compressing and decompressing data aimed at saving storage space and improving I/O performance.

This tutorial aims to guide users on how to export data from tables in ByConity to different file systems or databases, as well as how to export query results. Currently, ByConity supports exporting data to local file systems and HDFS (Hadoop Distributed FileSystem).

Data Import Tuning

Direct Write Method Tuning

Data Model

Columnar Storage Design Principles

Data Types

The data types provided in ByConity are adapted from ClickHouse. Visit this page for more information on ClickHouse data types.

Data Visualization

Tutorial Objectives:

Date & Times

Notice:

Deploy on Kubernetes

This article demonstrates how to deploy a ByConity cluster on a Kubernetes (k8s) environment.

Deploy on Physical Machines

Currently, the way to install ByConity on physical machines is through package managers, such as installing Debian packages for Debian OS or rpm packages for Centos OS. Since ByConity uses FoundationDB as its metadata storage and HDFS as its data storage, before deploying ByConity, we need to deploy FoundationDB and HDFS first. The steps are to install FoundationDB (FDB) first, then install HDFS, and finally deploy the ByConity software package. Details are as follows.

Deployment Requirements

Software Requirements

Deployment Settings

Refer to the Admin Guide - Cluster Configuration Parameters

Development Environment

Our developers connect to a Debian 11 development environment running on a virtual machine using a plugin called Dev Container in VS Code. Therefore, we also recommend that you use the same approach for development to avoid many dependency and compatibility issues. For more information, refer to the official dev set.

Direct Access via Hive

Importing from Hive External Tables

Direct Access via MySQL

Importing via MySQL

Docker Deployment

This tutorial guides users to quickly deploy ByConity locally using Docker. Users can follow the steps below to deploy a simple ByConity cluster for a quick start experience. If you want to fully try and understand ByConity, it is recommended that you skip this section and refer directly to ByConity Cluster Deployment to experience the powerful performance of ByConity.

Encoding

Notice:

Encryption

Notice:

External Catalog

In addition to supporting access to Hive data using external tables, Byconity also supports access through the External Catalog feature.

External Tables

CnchHive

FAQ

Deployment-related

Features and Capabilities

Elastic Scaling

Functions

ByConity provides two SQL dialects, (1) ClickHouse and (2) ANSI.

Geo

Notice:

Hash

Notice:

Hash

Notice:

High Availability

ByConity supports deploying multiple ByConity Server, Resource Manager, and TSO processes within the same cluster to achieve high availability. This is achieved by determining the leader through a "Compare And Swap" operation in the shared FoundationDB.

How to Build

Dependencies

Import from HDFS

Importing from HDFS

Import from Kafka

Importing from Kafka

Indexes

Full-Text Search

IP Address

Notice:

JSON

Notice:

Logical

Notice:

Manage Background Tasks

Configuring Background Tasks

Map

map

Materialized View

Function Definition

Mathematical

Notice:

Monitor

Common Monitoring Metrics

Nullable

Notice:

Others

MACNumToString

Privilege Management

RBAC Model

Query Optimization with Projection

Introduction

Query Optimizer

The optimizer is the core of the database system, and an excellent optimizer can greatly improve query performance, especially in complex query scenarios where the optimizer can bring several times to hundreds of times performance improvement.

Random

Notice:

Resource Manager

The Resource Manager (RM) component is responsible for the unified management and scheduling of ByConity's computational resources. It is a core component for achieving resource elasticity and improving resource utilization.

Scaling

ByConity supports containerized and modular deployment, allowing for elastic scaling. It can scale computing resources in real-time and on-demand, ensuring efficient resource utilization. This section describes how to scale ByConity on a Kubernetes cluster.

SQL Statements

The supported statements in ByConity are similar to ClickHouse, but it is still recommended to follow the ByConity manual to ensure proper use. Some of the examples below are referenced from ClickHouse Documentation but have been adapted and modified to work in ByConity.

Stream Import

Streaming Data Import (Local Files and In-Memory Data)

String

Notice:

Table Engines

The table engine determines the way data is organized and stored, the method and type of indexing, which queries are supported, and how they are supported, as well as some other specific features and configurations. In ByConity, the most commonly used table engines are CnchMergeTree, CnchKafka, and CnchMaterializedMySql. This article focuses on explaining the principles of these three table engines.

Technical Architecture and Principles

Technical Architecture

TPC-DS

With the increasing amount and complexity of data, enterprises are increasingly turning to OLAP (On-Line Analytical Processing) engines to process large-scale data and deliver real-time analytical insights. Performance is a very important factor when choosing an OLAP engine. Therefore, this article aims to offer valuable insights by conducting a performance comparison among four prominent open-source OLAP engines: ClickHouse, Doris, Presto, and ByConity. To achieve this, we employ a set of 99 query statements sourced from the TPC-DS benchmark. Our aim is to provide reference for enterprises in their journey to select the most fitting OLAP engine for their specific needs.

Ad-hoc Import

Aggregation

Arithmetic

Array

Basic Operations

Bit & Bitmap

ByConity 0.1.0.GA

ByConity 0.2.0

ByConity 0.3.0

ByConity 0.4.x

ByConity upgrade

Cache Preload

Client Connection

Cluster Configuration

Comparison

Complex Query Tuning

Conditional

Data Compression

Data Description

Data Export

Data Import Tuning

Data Model

Data Types

Data Visualization

Date & Times

Deploy on Kubernetes

Deploy on Physical Machines

Deployment Requirements

Deployment Settings

Development Environment

Direct Access via Hive

Direct Access via MySQL

Docker Deployment

Encoding

Encryption

External Catalog

External Tables

FAQ

Features and Capabilities

Functions

Geo

Hash

Hash

High Availability

How to Build

Import from HDFS

Import from Kafka

Indexes

IP Address

JSON

Logical

Manage Background Tasks

Map

Materialized View

Mathematical

Monitor

Nullable

Others

Privilege Management

Query Optimization with Projection

Query Optimizer

Random

Resource Manager

Scaling

SQL Statements

Stream Import

String

Table Engines

Technical Architecture and Principles

TPC-DS

Transactions Management

Type Conversion

Unique Key

URLs

UUID

Virtual Warehouse Configuration

What is ByConity

Window