Cask Blog

How we built it: designing a globally consistent transaction engine

At Cask, we are committed to contributing back to the open source community. One of our latest open-sourced projects is Tephra, a system that adds complete transaction support to Apache HBase™. As an XA-style transaction system, Tephra is designed to be agnostic to the underlying data stores, so its usage is not limited to HBase. … Read more


Strata + Hadoop World NYC 2014 Recap: Four Trends in Hadoop

The Cask team had a great and productive time at Strata + Hadoop World earlier this month in New York City! We are very optimistic about the robust growth in Hadoop adoption, increased participation from a broad range of developers and companies in many industries, and continued maturation in the early days of this technology. As … Read more


The Shift to Realtime Processing: An Easier Way to Build Hadoop Apps

I was inspired by the recent Google I/O talk on Cloud Dataflow, a data processing service used internally at Google, which evolved from a model based on MapReduce and successor stream processing technologies such as MillWheel and FlumeJava. Based on the premise of focusing on your application logic rather than the underlying infrastructure, I set … Read more


How we built it: Making Hadoop data exploration easier with Ad-hoc SQL Queries

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). We are excited to introduce a new feature added in the latest 2.3 release of Continuuity Reactor – ad-hoc querying of Datasets. Datasets are high-level abstractions over common data patterns. Reactor Datasets provide a … Read more


Meet Tephra, An Open Source Transaction Engine

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). Our platform, Continuuity Reactor, uses several open source technologies in the Apache HadoopTM ecosystem to enable any developer to build data applications. One of the major components of our platform is Apache HBase, a … Read more


Behind the scenes: Hacking our way to success

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). Building a platform that no one has created before is a big challenge. We break this huge effort into a continuous cadence of platform releases that are delivered to production frequently. Before every release … Read more



Continuuity & AT&T Labs to Open Source Real-Time Data Processing Framework

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). Also, jetStream is now known as Tigon. Why are we combining our technologies?Today we announced an exciting collaborative effort with AT&T Labs that will facilitate the integration of Continuuity BigFlow, our distributed framework for … Read more


HBaseCon: Moving Beyond the Core to Address Availability & Usability

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). We just wrapped HBaseCon 2014, the annual event for Apache HBase™ contributors, developers, and users. As in years past, this is one of the most technical conferences that we attend, and it’s really focused … Read more