Cask Blog

Building a Data Lake on Google Cloud Platform with CDAP

It is no secret that traditional platforms for data analysis, like data warehouses, are difficult and expensive to scale, to meet the current data demands for storage and compute. And purpose-built platforms designed to process big data often require significant up-front and on-going investment if deployed on-premise. Alternatively, cloud computing is the perfect vehicle to … Read more



Better collaboration and productivity: Cloud-based software development at Cask

Over the last few years, the popularity of cloud-based software development has risen dramatically, along with the need for sharing development assets and resources within and across organizations. Containers and open source have simplified the sharing and cloning of code and entire dev/test environments, taking efficiency, collaboration and productivity of product engineering organization to new … Read more



Meet the Cask Team at Strata Data Conference in New York, Sept 26-28, 2017

Tatiana Staffaroni

For many years Strata Data Conference has been attracting thousands of people interested in big data tools and technologies, and each year our team looks forward to meeting with our customers, partners and collaborators at this great event. Once again, Cask will be an Exabyte sponsor and we look forward to showcasing our latest technology … Read more


Event Based Triggers for CDAP Pipelines

Bhooshan Mogal

Data Engineering groups in large enterprises are typically decentralized. Teams develop specialized skill sets in particular areas of data processing, and have specific charters. For example, a team may be responsible for data acquisition. Another may be responsible for cleansing, transforming, normalizing and analyzing data. Another team of data scientists may be responsible for consuming … Read more


Announcing GA Release of CDAP 4.3 – Use Cases, Features and Capabilities

We would like to thank all our users and customers for the great conversations we have had around use cases, the challenges you face with operationalizing a data lake and/or building data analytics solutions, and your candid feedback on CDAP usability. These interactions are invaluable and we always love hearing from you. You have offered … Read more


Overcoming Big Data Integration Challenges

Enterprise challenges Hadoop has emerged as the leading technology to solve a number of big data use cases. However, enterprises needing to solve their business problems often need to piece together different technologies to build a solution. Each component in the Hadoop technology stack is infrastructure focused and purpose-built to solve a unique set of … Read more


Introducing CDAP Cloud Sandbox for Microsoft Azure

Cask Data Application Platform (CDAP) is a platform-agnostic unified integration platform that allows users to run, manage and deploy big data applications independent of distros on-premises, in the cloud or in a hybrid environment. We recently announced the availability of Cloud Sandbox for AWS,  and in order to continue to give our customers and users … Read more