Apache Tajo v0.9 发布,此版本目标是优化传统 SQL 性能,改进 Tajo leading-edge 原生 SQL 支持;提高查询速度。 Apache Tajo v0.9 改进如下: - More comprehensive and powerful SQL capabilities, such as TIMESTAMP, DATE, TIME, and INTERVAL type support, as well as WINDOW functions, OVER clause support, and multiple distinct aggregation; - Performance improvements, such as offheap sort algorithm for ORDER BY and Runtime code generation for evaluating expressions push the boundaries of massive data query speeds; - Improvements to the hash shuffle I/O, boosting bottom-line speeds by 200-300% on "heavy", complex queries; - Enhanced Hadoop integration, including support for Hadoop 2.2.0 up to Hadoop 2.5.1, and expanded Hive Metastore access; - Improved catalog backup and restore feature, as well as accessibility enhancements streamline performance across disparate technology environments.
Tajo 是一个分布式数据仓库系统,基于 Hadoop 实现,特点是低延迟、高可伸缩,提供专用查询和 ETL 工具 特点:
|