列类型

请参阅数据类型以获取一般参考。

数值类型

提示

数值类型编码与小端 CPU（如 AMD64 或 ARM64）的内存布局匹配。

这允许实现非常高效的编码和解码。

整数

Int 和 UInt 的字符串，8、16、32、64、128 或 256 位，小端序。

浮点数

IEEE 754 二进制表示中的 Float32 和 Float64。

字符串

只是一个字符串数组，即（长度，值）。

FixedString(N)

N 字节序列的数组。

IP

IPv4 是 UInt32 数值类型的别名，表示为 UInt32。

IPv6 是 FixedString(16) 的别名，直接表示为二进制。

元组

元组只是列的数组。例如，Tuple(String, UInt8) 只是两个连续编码的列。

Map

Map(K, V) 由三列组成：Offsets ColUInt64、Keys K、Values V。

Keys 和 Values 列中的行数是来自 Offsets 的最后一个值。

数组

Array(T) 由两列组成：Offsets ColUInt64、Data T。

Data 中的行数是来自 Offsets 的最后一个值。

Nullable

Nullable(T) 由具有相同行数的 Nulls ColUInt8、Values T 组成。

// Nulls is nullable "mask" on Values column.
// For example, to encode [null, "", "hello", null, "world"]
//	Values: ["", "", "hello", "", "world"] (len: 5)
//	Nulls:  [ 1,  0,       0,  1,       0] (len: 5)

UUID

FixedString(16) 的别名，UUID 值表示为二进制。

Enum

Int8 或 Int16 的别名，但每个整数都映射到某个 String 值。

低基数

LowCardinality(T) 由 Index T、Keys K 组成，其中 K 是 (UInt8、UInt16、UInt32、UInt64) 之一，具体取决于 Index 的大小。

// Index (i.e. dictionary) column contains unique values, Keys column contains
// sequence of indexes in Index column that represent actual values.
//
// For example, ["Eko", "Eko", "Amadela", "Amadela", "Amadela", "Amadela"] can
// be encoded as:
//	Index: ["Eko", "Amadela"] (String)
//	Keys:  [0, 0, 1, 1, 1, 1] (UInt8)
//
// The CardinalityKey is chosen depending on Index size, i.e. maximum value
// of chosen type should be able to represent any index of Index element.

布尔值

UInt8 的别名，其中 0 为 false，1 为 true。

数值类型​

整数​

浮点数​

字符串​

FixedString(N)​

IP​

元组​

Map​

数组​

Nullable​

UUID​

Enum​

低基数​

布尔值​

数值类型

整数

浮点数

字符串

FixedString(N)

IP

元组

Map

数组

Nullable

UUID

Enum

低基数

布尔值