Cloudera: Impala’s it for interactive SQL on Hadoop; everything else will move to Spark


Despite some speculation over the past few days about what it means that Cloudera wants to port the Hive SQL-on-Hadoop engine onto the Spark processing framework, Cloudera Co-founder and Chief Strategy Officer Mike Olson (pictured above) says nothing much has changed. Well, nothing has changed with regard to Cloudera’s Impala product, that is. There’s actually quite a bit happening elsewhere in the Hadoop and Spark ecosystems.

Simply put, Olson said Impala is the future of interactive SQL queries on top of Hadoop as far as Cloudera is concerned. “Impala is flat-out faster than the fastest thing Hortonworks or anyone else has ever done with Hive,” he said.

Cloudera — along with IBM, MapR and spark startup Databricks — is working to port Hive onto Spark as an acknowledgement that Hive workloads are still very important to the company’s customer base and that “running on MapReduce, Hive really, really sucks.” But, Olson added, Hive was built…

View original post 276 more words


Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s