Top Banner
Programming for Big Data and Analytics Develop Real-World Big Data and Analytics Applications Earn High Value Certificates | Get 365 Days Placement Assistance Enter the Exciting World of Big Data In Partnership with
4

Big Data

Nov 15, 2015

Download

Documents

navin

talent sprint
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
  • Programming for Big Data and Analytics

    Develop Real-World Big Data and Analytics Applications

    Earn High Value Certificates | Get 365 Days Placement Assistance

    Enter the Exciting World of Big Data

    In Partnership with

  • What is Big Data?

    Big Data consists of an exploding world of structured and unstructured data

    sets that are beyond the ability of conventional databases and programming

    tools to process and analyze. Big Data is characterized by far greater Volume,

    Variety and Velocity than traditional data.

    Big Data and Analytics refers to the process of collecting, organizing,

    transforming, analyzing, and presenting large sets of data to discover useful

    patterns, correlations and insights.

    Why Learn Big Data and Analytics?

    We live in a digital era, defined by information abundance and growing

    complexity. Every interaction with businesses is now digitized, waiting for a

    clever algorithm for analysis. The potential of analytics to increase efficiency

    or forecast future probabilities is tremendous. Businesses and governments

    are taking advantage of these new data-focused tools and techniques to

    improve organizational efficiency and gain a competitive advantage.

    With all this comes the demand for new talent for Big Data. Many companies

    are now looking for IT professionals with Big Data programming skills who

    are able to identify, collect, analyze, interpret and transform data to drive

    value and innovation for the organization. Programming for Big Data and

    Analytics is a highly innovative program designed to create IT professionals

    with the skills required to manage and analyze Big Data.

    Why Join this Particular Program?

    Big Data professionals earn twice as much as other IT professionals

    Co-designed by Big Data experts at TalentSprint and Gramener

    5 real-time case studies and 1 project from Gramener

    Joint certificate from TalentSprint and Gramener

    MOOC certificates on Big Data from top US universities

    365 days placement assistance after the program

    Program Details

    Duration : 300 hour program in 3-month full time format or 6-month part

    time format

    PROGRAM MENTORS

    Asokan Pichai

    Senior Vice President, TalentSprint

    Prabhu Ramachandran

    Professor, IIT Bombay

    S. Anand

    Chief Data Scientist, Gramener

  • Core Language and Libraries

    Use the standard library and development

    tools setup to write, execute, and

    troubleshoot Python programs for regular

    data processing tasks.

    Data Types and objects

    Control Flow

    Reading and writing data

    Modules and Namespaces

    Advanced constructs: OOP, FP

    Who Should take this Course?

    Professionals returning from a career break

    Professionals looking for high-growth careers

    Engineering graduates / students looking for a differentiated career

    Background Statistics

    Brush up on the core concepts of statistics

    and learn how to carry out statistical

    computations using Python.

    Mean, Median, Mode, ANOVA

    scipy

    Data Exploration and Analysis

    Learn how to perform statistical analysis

    using specialized tools and packages to

    explore data and extract meaningful

    insights.

    Using pandas

    Overview of R

    scipy.stats

    Data Transformation

    Gain in-depth experience in using pandas

    for data manipulation and transformation.

    Data munging with pandas

    Short case studies

    Normalization and outlier removal

    Distr ibuted Computing and

    Hadoop

    Learn how to deploy and extract data from

    Hadoop.

    subprocess and multiprocessing

    Hadoop installation/deployment

    Data extraction from Hadoop

    Using pig/hive

    Machine Learning

    Learn about the machine learning

    algorithms and how to use the scikit-learn

    library.

    Survey of Algorithms

    scikit-learn library

    Regression

    Parallel Computing with Spark

    Learn how to manipulate data sets using

    parallel processing with Apache Spark.

    Get edX certificate University of

    California, Berkeley

    Data Visualization

    Learn how to present information and

    results of analysis graphically for best

    assimilation.

    matplotlib in depth

    survey of other tools

    Text Processing

    Learn about patterns and NLTK libraries

    and how to use them for Text Processing

    tasks.

    Course Project

    Work on a real t ime project from

    Gramener's extensive library of live case

    studies and big data sets!

    Program Outline

    Get a MOOC Certificate!

    Get a MOOC Certificate!

    Setting up the Tool Chain

    Understand why Python is the toolbox of

    choice for data science and how each tool is

    used for data science workflow. Set up a

    functional Python based environment in

    your machine and cloud for your use.

    Version control: git, github

    iPython, notebook

    Editors and IDEs

    Data Collection and Cleansing

    Learn to load data from common sources,

    such as structured text files, web pages

    and SQL databases. Use some standard

    tools to clean and prepare data for

    analysis.

    Reading and Writing CSV files

    Reading and Writing SQL

    lxml, beautiful soup and requests

  • Best Non-Corporate

    Performer 2012

    Best Performing

    Partner Award 2013

    FICCI LeapVault Skills

    Champion Roll of

    Honour 2012

    TalentSprint Awards

    TalentSprint Professional Affiliations

    Corporate Office

    Hyderabad | Bangalore | Chennai | Coimbatore | Visakhapatnam

    www.talentsprint.com

    TalentSprint Pvt Ltd

    Block A, IIIT Campus, Gachibowli, Hyderabad - 500 032

    CIN: U80902TG2008PTC062284

    email : [email protected]

    About TalentSprint

    TalentSprint is a leader in professional skill development and integrated talent management for the

    Information Technology and Banking sectors. Funded by NSDC and Nexus Ventures, TalentSprint

    has embarked on an ambitious mission to skill ONE MILLION young people by 2020. The company

    partners with more than 250 employers and 150 colleges, and has skilled 50,000 young job-seekers

    since its inception. Our trainees are regularly recruited by major multinational IT firms that include

    Accenture, Genpact, Deloitte, Capgemini, Virtusa, ADP, Wells Fargo, Tech Mahindra, Cognizant, CSC,

    Value Labs, HSBC, Cyient, Broadridge, Renault Nissan, CA Technologies, Polaris, Invesco to name a

    few. TalentSprint conducts industry-linked skill programs across multiple channels such as skill

    centers, online, and college campuses.

    About Gramener

    Gramener is a Data Visualization and Analytics company. Its proprietary platform, Gramex

    Visualization, handles large-scale data via programmatic analysis and visualizes it in real-time. The

    company helps clients unlock hidden insights from data, and using cutting-edge visualizations

    develops foresight for critical business decisions. Gramener works with Global Fortune 100

    customers in a quick, non-intrusive manner to condense large amounts of data from heterogeneous

    sources and convert these findings into intuitive visual representations. Gramener customers are

    spread across various domains including Telecom, Manufacturing, Financial, Pharmaceuticals,

    Media, Utilities, Airlines, Retail, Education and Government sectors. The company has its offices in

    Hyderabad, Bangalore, and Coimbatore. For more information, please visit www.gramener.com.

    Page 1Page 2Page 3Page 4