Sumit Pal
Independent Consultant (Architect, Developer) Big Data & Data Science, USA
Title: SQL On Big Data – Technology, Architecture and Innovations
Biography
Biography: Sumit Pal
Abstract
With the rapid adoption of Hadoop in the enterprise it has become all the more important to build SQL Engines on Hadoop for all kinds of workloads for almost all kind of end users and use cases. From low latency analytics based SQL to ACID based semantics on Hadoop for Operational Systems, to SQL for handling unstructured and streaming data, SQL is fast becoming the ligua-franca in the big data world too. The talk focuses on the exciting tools, technologies and innovations and their underlying architectures and the exciting road ahead in this space. This is a fiercely competitive landscape with vendors and innovators trying to capture mindshare and piece of the pie – with a whole suite of innovations like – index based SQL solutions in Hadoop to OLAP with Apache Kylin and Tajo to BlinkDB and MapD.
Topics :
- Why SQL on Hadoop
- Challenges of SQL on Hadoop
- SQL on Hadoop Architectures for Low Latency Analytics ( Drill, Impala, Presto, SparkSQL, JethroData)
- SQL on Hadoop Architecture for Semi-Structured Data
- SQL on Hadoop Architecture for Streaming Data and Operational Analytics
- Innovations ( OLAP on Hadoop, Probabilistic SQL Engines, GPU Based SQL Solutions )