Professional Documents
Culture Documents
Lesson 1
Objective Slide
After completing
this course, you will
be able to:
Analytics
Analytics is the science of analysis whereby
statistics, data mining, computer
technology, etc. is used in doing analysis
Analysis
Analysis is the process of breaking down a
complex object into its simpler forms
What is Analytics?
Its the science of wisely acquiring meaningful results from given data using various methods and
technologies.
Aims at discovering pattern of variation from the given data.
It helps to understand the future from past data and the uncertainty related to business.
Its a sophisticated process that uses statistics, mathematics and economics models to predict the
future and prescribe strategies.
Gather Data
Organize Data
Analyze Data
Analytics Stages
How many
students
dropped out last
year?
Descriptive
Which students
are most likely to
drop out?
Which students
should I target to
keep from
dropping out?
Diagnostic
Predictive
Prescriptive
Information
Decision
Insights
Popular Tools
R
Revolution R
R Studio
Tableau
SAP HANA
Weka
KXEN
SAS
Copyright 2014, Simplilearn, All rights reserved.
DATA PREPARING
DELIVER RESULTS
MODEL PLANNING
MODEL BUILDING
Problem Definition
WHAT IS IT NOT ?
Types of Data
Quantitative Data
Summarizing Data
Marital Status
Frequency
Single
203
Married
2,580
Widowed
334
Divorced
367
Separated
46
Total
3,530
Summarizing Data
Numeric - Descriptive
Mean
Median
Mode
Categorical - Descriptive
Numeric - Graphical
Box plot
Categorical - Graphical
Bar charts
Histograms
Data Collection
Experiment
Census
Questionnaire
Survey
Reporting
Registration
Data Sources
Data Dictionary
A Data Dictionary is a file that describes the structure of the database itself.
Number of records
Name of each field
Characteristic of each field
Description of each field
Relationships between different fields
It helps in analyzing different data variables and their relationships between each other.
Retention
Exclusion
Other treatment methods
Mark (Percentage)
Outlier Treatment
Outlier!
Summary
Here is a quick
recap of what we
have learned in this
lesson
What is analytics and analysis, and what are the differences between them
Quiz
QUIZ
a.
Surveys
b. Interviews
c.
Data Sources
d.
Experiments
QUIZ
a.
Surveys
b. Interviews
c.
Data Sources
d.
Experiments
Answer: c.
Explanation: Surveys, Interviews and Experiments are personally conducted by the
researchers, and hence belong to primary data collection methods. Data sources are already
existing sources of data thus belongs to secondary methods.
Copyright 2014, Simplilearn, All rights reserved.
QUIZ
2
a.
Number of records
b. Characteristic of fields
c.
Type of fields
d.
Actual records
QUIZ
2
a.
Number of records
b. Characteristic of fields
c.
Type of fields
d.
Actual records
Answer: d.
Explanation: Data dictionary refers to the meta data, i.e., defining the attributes of the data.
It does not contain the actual data.
Copyright 2014, Simplilearn, All rights reserved.
QUIZ
3
a.
Mean
b. Frequency distribution
c.
Median
d.
Mode
QUIZ
3
a.
Mean
b. Frequency distribution
c.
Median
d.
Mode
Answer: b.
Explanation: Mean, median and mode are mathematical summaries of numeric or
quantitative data. Frequency distribution is used to summarize categorical or qualitative
data.
Copyright 2014, Simplilearn, All rights reserved.
QUIZ
4
a.
Discovery
b. Deliver results
c.
Model building
d.
Re-checking
QUIZ
4
a.
Discovery
b. Deliver results
c.
Model building
d.
Re-checking
Answer: d.
Explanation: Re-checking is not a step in data analytics methodology.
QUIZ
5
a.
d.
QUIZ
5
a.
d.
Answer: a.
Explanation: Data collection methods are classified into primary and secondary
QUIZ
6
a.
Prescriptive
b. Predictive
c.
Descriptive
d.
Productive
QUIZ
6
a.
Prescriptive
b. Predictive
c.
Descriptive
d.
Productive
Answer: d.
Explanation: Productive is not a step in analytics.
QUIZ
7
Which of the following is FALSE with reference to the role of a data scientist?
a.
d.
QUIZ
7
Which of the following is FALSE with reference to the role of a data scientist?
a.
d.
Answer: c.
Explanation: Data scientist needs to consider statistical algorithm working process.
Thank You