Cask Blog

Building a Data Lake on Google Cloud Platform with CDAP

It is no secret that traditional platforms for data analysis, like data warehouses, are difficult and expensive to scale, to meet the current data demands for storage and compute. And purpose-built platforms designed to process big data often require significant up-front and on-going investment if deployed on-premise. Alternatively, cloud computing is the perfect vehicle to … Read more

Better collaboration and productivity: Cloud-based software development at Cask

Over the last few years, the popularity of cloud-based software development has risen dramatically, along with the need for sharing development assets and resources within and across organizations. Containers and open source have simplified the sharing and cloning of code and entire dev/test environments, taking efficiency, collaboration and productivity of product engineering organization to new … Read more

The Shift to Realtime Processing: An Easier Way to Build Hadoop Apps

I was inspired by the recent Google I/O talk on Cloud Dataflow, a data processing service used internally at Google, which evolved from a model based on MapReduce and successor stream processing technologies such as MillWheel and FlumeJava. Based on the premise of focusing on your application logic rather than the underlying infrastructure, I set … Read more

Meet Tephra, An Open Source Transaction Engine

Please note: Continuuity is now known as Cask, and Continuuity Reactor is now known as the Cask Data Application Platform (CDAP). Our platform, Continuuity Reactor, uses several open source technologies in the Apache HadoopTM ecosystem to enable any developer to build data applications. One of the major components of our platform is Apache HBase, a … Read more