Monday, June 27, 2016

Zeppelin


Apache Zeppelin is an open source GUI which creates interactive and collaborative notebooks for data exploration using Spark. You can use Scala, Python, SQL (using Spark SQL), or HiveQL to manipulate data and quickly visualize results.

Zeppelin notebooks can be shared among several users, and visualizations can be published to external dashboards. Zeppelin uses the Spark settings on your cluster and can use Spark’s dynamic allocation of executors to let YARN estimate the optimal resource consumption.

To run the prediction analysis, you need to create notebooks that generate prediction % and are scheduled to run daily. As part of the prediction analysis, we needed to connect to multiple data sources, like MySQL and Vertica for data ingestion and error rate generation. This enabled us to aggregate data across multiple dimensions, thus exposing underlying issues and anomalies at a glance.

Using Zeppelin, we applied many A/B models by replaying our raw data in AWS S3 to generate different prediction reports, which in turn helped us move in the right direction and provide better forecasting.

Zeppelin helps us to turn the huge amounts of raw data, often from across different data stores, into consumable information with useful insights.

Slide share reference is available at http://www.slideshare.net/prajods/big-data-visualization-with-apache-spark-and-zeppelin

7 comments:

  1. The Information Give You In The Blog Is Very Good. Thank You So Much For Sharing.
    DATA ANALYTICS COURSE NEAR ME
    MASTERS IN DATA SCIENCE AND ARTIFICIAL INTELLIGENCE IN CHENNAI
    MSC ARTIFICIAL INTELLIGENCE AND DATA SCIENCE ONLINE
    MSC ARTIFICIAL INTELLIGENCE AND DATA SCIENCE ONLINE
    ARTIFICIAL INTELLIGENCE POSTGRADUATE COURSES
    MASTER OF SCIENCE IN DATA SCIENCE AND ARTIFICIAL INTELLIGENCE
    MS IN AI AND DATA SCIENCE
    DATA VISUALIZATION USING POWER BI
    BEST DATA ANALYTICS COURSES ONLINE
    CERTIFICATION IN BUSINESS ANALYTICS NEAR ME
    PG COURSES IN ARTIFICIAL INTELLIGENCE NEAR ME
    POST GRADUATE PROGRAM IN ARTIFICIAL INTELLIGENCE IN CHENNAI
    TOP ARTIFICIAL INTELLIGENCE MASTERS PROGRAMS IN CHENNAI
    DATA VISUALISATION TOOLS POWER BI
    TABLEAU STEP BY STEP FOR BEGINNERS
    TABLEAU FOR DATA SCIENCE AND DATA VISUALIZATION
    PYTHON DATA SCIENCE PROJECTS FOR BEGINNERS IN CHENNAI
    BEST PYTHON FOR DATA SCIENCE COURSES
    PYTHON DATA SCIENCE ONLINE COURSE
    DATA ANALYTICS COURSE ONLINE WITH PLACEMENT
    BEST BUSINESS ANALYTICS CERTIFICATION COURSE IN CHENNAI
    BEST ARTIFICIAL INTELLIGENCE GRADUATE PROGRAMS IN CHENNAI
    DATA SCIENCE & PYTHON CLASSES IN CHENNAI
    PREDICTIVE ANALYTICS ONLINE COURSE
    ONLINE MASTERS DEGREE ARTIFICIAL INTELLIGENCE
    ARTIFICIAL INTELLIGENCE MASTERS DEGREE ONLINE
    DATA VISUALISATION IN POWER BI
    DATA VISUALIZATION TOOLS FOR DATA SCIENCE
    SIMPLE DATA SCIENCE PROJECT USING PYTHON IN CHENNAI
    PYTHON DATA SCIENCE REAL TIME PROJECTS IN CHENNAI
    DATA VISUALIZATION ONLINE COURSE
    PG COURSES IN ARTIFICIAL INTELLIGENCE IN CHENNAI
    BEST DATA ANALYTICS COURSE ONLINE
    ARTIFICIAL INTELLIGENCE COURSE IN CHENNAI
    ONLINE VISUAL TABLEAU
    HOW TO VISUALIZE DATA USING POWER BI?
    DATA VISUALIZATION WITH TABLEAU SPECIALIZATION
    DATA VISUALIZATION USING TABLEAU
    DATA VISUALIZATION IN TABLEAU

    ReplyDelete