Programming for Big Data and Analytics Develop Real-World Big Data and Analytics Applications Earn High Value Certificates | Get 365 Days Placement Assistance Enter the Exciting World of Big Data In Partnership with
Programming for Big Data and Analytics
Develop Real-World Big Data and Analytics Applications
Earn High Value Certificates | Get 365 Days Placement Assistance
Enter the Exciting World of Big Data
In Partnership with
What is Big Data?
Big Data consists of an exploding world of structured and unstructured data
sets that are beyond the ability of conventional databases and programming
tools to process and analyze. Big Data is characterized by far greater Volume,
Variety and Velocity than traditional data.
Big Data and Analytics refers to the process of collecting, organizing,
transforming, analyzing, and presenting large sets of data to discover useful
patterns, correlations and insights.
Why Learn Big Data and Analytics?
We live in a digital era, defined by information abundance and growing
complexity. Every interaction with businesses is now digitized, waiting for a
clever algorithm for analysis. The potential of analytics to increase efficiency
or forecast future probabilities is tremendous. Businesses and governments
are taking advantage of these new data-focused tools and techniques to
improve organizational efficiency and gain a competitive advantage.
With all this comes the demand for new talent for Big Data. Many companies
are now looking for IT professionals with Big Data programming skills who
are able to identify, collect, analyze, interpret and transform data to drive
value and innovation for the organization. Programming for Big Data and
Analytics is a highly innovative program designed to create IT professionals
with the skills required to manage and analyze Big Data.
Why Join this Particular Program?
Big Data professionals earn twice as much as other IT professionals
Co-designed by Big Data experts at TalentSprint and Gramener
5 real-time case studies and 1 project from Gramener
Joint certificate from TalentSprint and Gramener
MOOC certificates on Big Data from top US universities
365 days placement assistance after the program
Program Details
Duration : 300 hour program in 3-month full time format or 6-month part
time format
PROGRAM MENTORS
Asokan Pichai
Senior Vice President, TalentSprint
Prabhu Ramachandran
Professor, IIT Bombay
S. Anand
Chief Data Scientist, Gramener
Core Language and Libraries
Use the standard library and development
tools setup to write, execute, and
troubleshoot Python programs for regular
data processing tasks.
Data Types and objects
Control Flow
Reading and writing data
Modules and Namespaces
Advanced constructs: OOP, FP
Who Should take this Course?
Professionals returning from a career break
Professionals looking for high-growth careers
Engineering graduates / students looking for a differentiated career
Background Statistics
Brush up on the core concepts of statistics
and learn how to carry out statistical
computations using Python.
Mean, Median, Mode, ANOVA
scipy
Data Exploration and Analysis
Learn how to perform statistical analysis
using specialized tools and packages to
explore data and extract meaningful
insights.
Using pandas
Overview of R
scipy.stats
Data Transformation
Gain in-depth experience in using pandas
for data manipulation and transformation.
Data munging with pandas
Short case studies
Normalization and outlier removal
Distr ibuted Computing and
Hadoop
Learn how to deploy and extract data from
Hadoop.
subprocess and multiprocessing
Hadoop installation/deployment
Data extraction from Hadoop
Using pig/hive
Machine Learning
Learn about the machine learning
algorithms and how to use the scikit-learn
library.
Survey of Algorithms
scikit-learn library
Regression
Parallel Computing with Spark
Learn how to manipulate data sets using
parallel processing with Apache Spark.
Get edX certificate University of
California, Berkeley
Data Visualization
Learn how to present information and
results of analysis graphically for best
assimilation.
matplotlib in depth
survey of other tools
Text Processing
Learn about patterns and NLTK libraries
and how to use them for Text Processing
tasks.
Course Project
Work on a real t ime project from
Gramener's extensive library of live case
studies and big data sets!
Program Outline
Get a MOOC Certificate!
Get a MOOC Certificate!
Setting up the Tool Chain
Understand why Python is the toolbox of
choice for data science and how each tool is
used for data science workflow. Set up a
functional Python based environment in
your machine and cloud for your use.
Version control: git, github
iPython, notebook
Editors and IDEs
Data Collection and Cleansing
Learn to load data from common sources,
such as structured text files, web pages
and SQL databases. Use some standard
tools to clean and prepare data for
analysis.
Reading and Writing CSV files
Reading and Writing SQL
lxml, beautiful soup and requests
Best Non-Corporate
Performer 2012
Best Performing
Partner Award 2013
FICCI LeapVault Skills
Champion Roll of
Honour 2012
TalentSprint Awards
TalentSprint Professional Affiliations
Corporate Office
Hyderabad | Bangalore | Chennai | Coimbatore | Visakhapatnam
www.talentsprint.com
TalentSprint Pvt Ltd
Block A, IIIT Campus, Gachibowli, Hyderabad - 500 032
CIN: U80902TG2008PTC062284
email : [email protected]
About TalentSprint
TalentSprint is a leader in professional skill development and integrated talent management for the
Information Technology and Banking sectors. Funded by NSDC and Nexus Ventures, TalentSprint
has embarked on an ambitious mission to skill ONE MILLION young people by 2020. The company
partners with more than 250 employers and 150 colleges, and has skilled 50,000 young job-seekers
since its inception. Our trainees are regularly recruited by major multinational IT firms that include
Accenture, Genpact, Deloitte, Capgemini, Virtusa, ADP, Wells Fargo, Tech Mahindra, Cognizant, CSC,
Value Labs, HSBC, Cyient, Broadridge, Renault Nissan, CA Technologies, Polaris, Invesco to name a
few. TalentSprint conducts industry-linked skill programs across multiple channels such as skill
centers, online, and college campuses.
About Gramener
Gramener is a Data Visualization and Analytics company. Its proprietary platform, Gramex
Visualization, handles large-scale data via programmatic analysis and visualizes it in real-time. The
company helps clients unlock hidden insights from data, and using cutting-edge visualizations
develops foresight for critical business decisions. Gramener works with Global Fortune 100
customers in a quick, non-intrusive manner to condense large amounts of data from heterogeneous
sources and convert these findings into intuitive visual representations. Gramener customers are
spread across various domains including Telecom, Manufacturing, Financial, Pharmaceuticals,
Media, Utilities, Airlines, Retail, Education and Government sectors. The company has its offices in
Hyderabad, Bangalore, and Coimbatore. For more information, please visit www.gramener.com.
Page 1Page 2Page 3Page 4