🚀 Sign up for the bi-weekly newsletter

Join over 2000 recruiters and sourcers from around the world.


A modern, massively distributed SQL query engine for Apache Hadoop. It allows you to analyze, transform and combine data from a variety of data sources. With Impala, you can query data, whether stored in HDFS or HBase, in real time. 

Learn more

First released 2013
Developed by Apache Software Foundation
Latest stable version Impala 4
Open-source Yes
Used by Cloudera, Amazon, Oracle, MapR

Interesting facts

Impala is a query engine which is integrated into the Hadoop environment and utilizes a number of standard Hadoop components (Metastore, HDFS, HBase, YARN, Sentry) in order to deliver an RDBMS-like experience. 


Development by Synergize.digital

Sign up for updates
straight to your inbox