News

With the latest milestone of Project Shapeshift, the real-time analytics database is morphing into a more versatile product, thanks to the addition of a multi-stage query engine. With more than 1,000 ...
this open source query engine is reminiscent of Apache Drill in that it’s source-agnostic. It can query both Hive and Cassandra using ANSI SQL commands, and developers can extend the system by ...
Cloudera, a provider of Apache Hadoop solutions for the enterprise, recently announced the general availability of Cloudera Impala, its open-source, interactive SQL query engine for analyzing data ...
A database that uses Apache DataFusion, a distributed SQL query engine, will be even more effective. DataFusion is an open source project that allows users to efficiently query data within ...
Aiming to eliminate a number of onerous data engineering tasks, MapR today updated its distribution of Hadoop to include Apache Drill 0.5. Drill is an open source distributed ANSI SQL query engine ...
Impala, part of Cloudera Distribution Including Apache Hadoop (CDH) 4.1, is a native SQL query engine that runs on Hadoop clusters, providing easy query access to raw HDFS data and to HBase databases.
PySpark is a popular open-source technology for using the Python language with the Apache Spark query engine. “The two languages that are really important for data engineers are SQL, of course ...
The PrestoSQL query engine was itself rebranded as Trino ... the industry’s first Big Data analytic platform natively integrating SQL and Apache Hadoop in 2010. The company was acquired by ...
Confluent today unveiled KSQL, a SQL engine for Apache Kafka designed to enable users ... “With KSQL, now users can query Apache Kafka topics directly without dumping data into intermediate databases ...
Trino is a highly parallel, open-source distributed SQL query engine designed to perform interactive ... OpenSearch, MongoDB and Apache Kafka. Sundstrom described additional refinements currently ...