Iceberg 版本 | Apache Iceberg 0.12.1 发布

共 1519字,需浏览 4分钟

 ·

2021-11-14 08:22

Apache Iceberg 0.12.1 已于2021年11月8日发布。


Apache Iceberg 是一种用于大型分析数据集的开放式表格式。Iceberg 为包括 Spark、Trino、PrestoDB、Flink 和 Hive 在内的多种计算引擎提供高性能的 SQL 表格式。


Apache Kyuubi(Inubating) 基于 Apache Spark 为包括 Iceberg 在内的数据湖三剑客均提供了支持。



该版本提供的 spark3-runtime Jar 仅支持 Spark 3.0 和 3.1。下一次主要版本的发布将会带来 Spark 3.2 的支持,也可以尝试基于 master 分支代码的每日构建版尝鲜 Spark 3.2 的支持。


Apache Iceberg 0.12.1 包含以下主要的问题修复和改进:


#3264 fixes validation failures that occurred after snapshot expiration when writing Flink CDC streams to Iceberg tables.

#3264 fixes reading projected map columns from Parquet files written before Parquet 1.11.1.

#3195 allows validating that commits that produce row-level deltas don't conflict with concurrently added files. Ensures users can maintain serializable isolation for update and delete operations, including merge operations.

#3199 allows validating that commits that overwrite files don't conflict with concurrently added files. Ensures users can maintain serializable isolation for overwrite operations.

#3135 fixes equality-deletes using DATE, TIMESTAMP, and TIME types.

#3078 prevents the JDBC catalog from overwriting the jdbc.user property if any property called user exists in the environment.

#3035 fixes drop namespace calls with the DyanmoDB catalog.

#3273 fixes importing Avro files via add_files by correctly setting the number of records.

#3332 fixes importing ORC files with float or double columns in add_files.

浏览 61
点赞
评论
收藏
分享

手机扫一扫分享

分享
举报
评论
图片
表情
推荐
点赞
评论
收藏
分享

手机扫一扫分享

分享
举报