February 2023 Update

Documentation

Core Library

  • Add support for DECIMAL types in DecodedVector.

  • Add support for k PRECEDING/FOLLOWING window frames in ROWS mode.

  • Enhance BloomFilter so that it can be used in aggregate functions.

  • Replace ArrayBuilder class with ArrayWriter.

Presto Functions

Spark Functions

Hive Connector

  • Add support for single-level subfield pruning. #3949

  • Add support for BOOLEAN and DATE type in native Parquet reader.

  • Add options to open and prepare file splits asynchronously.

  • Fix reading of VARBINARY columns in Parquet reader.

Substrait

  • Update Substrait to 0.23.0.

  • Add support for emit

  • Add support for DATE type.

Arrow

Performance and Correctness

  • Add concurrent memory allocation benchmark.

  • Add support for CAST and TRY special forms to Fuzzer.

  • Add support to favorably select certain functions.

  • Add support to generate repro files for window fuzzers.

  • Add support for nested expression re-use.

  • Add support for DECIMAL types to VectorFuzzer.

  • Add FuzzerConnector to source randomly generated data. #4094

  • Add GeneratorSpec to support generating vectors with customized data, nulls and encoding.

  • Fix bugs in CAST found by Fuzzer.

  • Fix memory leaks in Prestissimo use cases.

Build System

Credits

Aditi Pandit, Bikramjeet Vig, ChenZhang, Chengcheng Jin, Christy Lee-Eusman, Deepak Majeti, Ge Gao, Hazem Ibrahim Nada, Hualong Gervais, Huameng Jiang, Ivan Sadikov, Jacob Wujciak-Jens, Jimmy Lu, Karteek Murthy Samba Murthy, Krishna Pai, Laith Sakka, Manikandan Somasundaram, Mark Shroyer, Masha Basmanova, Michael Shang, Open Source Bot, Orri Erling, Pedro Eugenio Rocha Pedreira, Pramod, Sergey Pershin, Wei He, Weiguo Chen, Xianda Ke, Xuedong Luan, ZJie1, joey.ljy, rui-mo, vibhatha, xiaoxmeng, yangchuan, yoha.zy, zhejiangxiaomai, zky.zhoukeyong, 张政豪