Introducing Tigon: Real-time streaming for the real world

In collaboration with AT&T Labs, today we are releasing version 0.2.0 of the open source Tigon project, a real-time streaming analytics framework for Hadoop based on technology contributed by both companies. By combining AT&T’s low-latency and declarative language support with our durable, high-throughput computing capabilities and procedural language support, Tigon provides developers with a new … Read more

Meet Coopr Cloud: Clusters with a click (in the cloud)

Because testing against real clusters is such a critical and frequent component of the engineering development cycle, we originally built Coopr to address our own needs for a fast, self-service way to provision clusters on our own hardware and in public clouds. When we open sourced the technology earlier this year, our vision was to … Read more

The Shift to Realtime Processing: An Easier Way to Build Hadoop Apps

I was inspired by the recent Google I/O talk on Cloud Dataflow, a data processing service used internally at Google, which evolved from a model based on MapReduce and successor stream processing technologies such as MillWheel and FlumeJava. Based on the premise of focusing on your application logic rather than the underlying infrastructure, I set … Read more

We are now Cask

Today we officially announced that Continuuity is changing our name to Cask. Since founding the company in 2011, we have been focused on providing developers in the Hadoop ecosystem with simple access to powerful technology. While we began with a proprietary software development and licensing model, we understood from the beginning that reaching developers and … Read more

Why I joined Cask

Just a few months ago, I was seated next to Jon Gray, CEO of Cask, at a dinner I attended thanks to a very gracious invitation from Mike Olson at Cloudera. In my final two years at Intel, I helped create the Datacenter Software Division. That group was a startup inside Intel, and by the … Read more

How we built it: Making Hadoop data exploration easier with Ad-hoc SQL Queries

We are excited to introduce a new feature added in the latest 2.3 release of Continuuity Reactor – ad-hoc querying of Datasets. Datasets are high-level abstractions over common data patterns. Reactor Datasets provide a consistent view of your data whether processed in batch or real-time. In addition to scans and RPCs of Datasets, there is … Read more

Continuuity Reactor 2.3: SQL and Security Release

The Continuuity Reactor platform is designed to make it easy for developers to build and manage data applications on Apache Hadoop™ and Apache HBase™. Every day we’re passionately focused on delivering an awesome experience for all developers, with or without Hadoop expertise. And today, we’re excited to release the next version of our platform, Continuuity … Read more

Meet Tephra, An Open Source Transaction Engine

Our platform, Continuuity Reactor, uses several open source technologies in the Apache HadoopTM ecosystem to enable any developer to build data applications. One of the major components of our platform is Apache HBase, a non-relational, massively salable column-oriented database modeled after Google’s BigTable. We use HBase for a number of reasons, including the strong data … Read more

Hadoop Summit: Where is the value? Where are the apps?

Coming out of Hadoop Summit, one thing is clear to me – while there has been significant growth and success of the ecosystem, it is still early days and Hadoop is still exceptionally hard to consume for most organizations. As a result of this persistent issue, there weren’t many major announcements, nothing exceptionally new or … Read more