Cask Blog

Building a Data Lake on Google Cloud Platform with CDAP

It is no secret that traditional platforms for data analysis, like data warehouses, are difficult and expensive to scale, to meet the current data demands for storage and compute. And purpose-built platforms designed to process big data often require significant up-front and on-going investment if deployed on-premise. Alternatively, cloud computing is the perfect vehicle to … Read more


Overcoming Big Data Integration Challenges

Enterprise challenges Hadoop has emerged as the leading technology to solve a number of big data use cases. However, enterprises needing to solve their business problems often need to piece together different technologies to build a solution. Each component in the Hadoop technology stack is infrastructure focused and purpose-built to solve a unique set of … Read more


Integrating CDAP with Microsoft Azure HDInsight

We recently announced the integration of CDAP with the Microsoft Azure HDInsight platform. This post will give a behind-the-scenes look at this integration. First, a bit about the integration itself. Azure HDInsight is an Apache Hadoop and Spark distribution powered by the cloud. This means that it handles any amount of data, scaling from terabytes … Read more


Powering BI with ODBC Connectors for CDAP

Bhooshan Mogal

Open Database Connectivity (ODBC) is the de-facto standard API for accessing data stored in relational databases. ODBC drivers allow applications across a variety of platforms (especially non-Java) to access relational databases in a manner independent from the implementation and the operating system. In this blog we will discuss the integration between CDAP Datasets and Tableau … Read more