By David Gruzman, on June 18th, 2014

Multi-engine data processing

There is a lot of criticism of HDFS – it is slow, it has SPOF, it is read only, etc. All of the above is true. Systems built on top of a local file system are more efficient than those built on top of HDFS (like Cassandra vs. HBase). That is also true. However, . . . → Read More: Multi-engine data processing

By Constantine Peresypkin, on September 8th, 2012

Hadoop on OpenStack Swift: experiments

Some time has passed since our initial post on Hadoop over OpenStack Swift implementation. A couple of things have changed (Rackspace finally implemented range requests in their Cloudfiles library) others remained the same (still no built-in support for Hadoop in OpenStack / CloudFiles).

We got a lot of feedback and questions regarding the integration . . . → Read More: Hadoop on OpenStack Swift: experiments

By Camuel Gilyadov, on September 4th, 2012

Progress on Apache Drill

We are continuing our efforts in contributing our OpenDremel code to Apache Drill project and look forward to be active with it right after that.

Right now the efforts are being put into our ANTLR-based parser, we want to make it work with the new grammar of BigQuery language. That should be done within . . . → Read More: Apache Drill Progress

By Camuel Gilyadov, on February 7th, 2012

OpenDremel update and Dremel vs. Tenzing

I wasn’t blogged for whole 2011 year… I’m not dead, quite on contrary, we were pretty active with OpenDremel project in 2011. First, we are renaming it to¬†Dazo to avoid using a trademarked name and second, we did a good job implementing a secure generic execution engine¬†and integrating it into OpenStack Swift. It also . . . → Read More: OpenDremel update and Dremel vs. Tenzing