Professional Documents
Culture Documents
Business Statistics
Business Process
Value Addition
Input
Input
Input
Output
Mission
Fact-based Decision-Making
at Google
What Data to Use?
{Do managers actually matter?}-HR Case
performance reviews (top down review of managers) & employee survey
(bottom up review of managers).
Review of Managers
6
5
4
3
2
1
0
Category 1
Category 2
Category 3
Category 4
Analytics
Top Quartile
Bottom Quartile
Team productivity-Employee happiness-Employee turnover
5
4.5
4
3.5
3
2.5
2
1.5
1
0.5
0
Category 1
Category 2
Category 3
Category 4
Insights
Top 8 behaviors of a high scoring manager:
Is a good coach
Empowers the team and does not micromanage
Expresses interest / concern for team members success and personal wellbeing
Is productive and results-orientated
Is a good communicator listens and shares information
Helps with career development
Has a clear vision / strategy for the team
Has important technical skills that help him / her advice the team
Insights
In addition to the eight behaviors they have identified for a good
manager, they also narrowed down on the top 3 causes why
managers are struggling in their role:
Has a tough transition (e.g. suddenly promoted, hired from outside with little
training)
Lacks a consistent philosophy / approach to performance management and
career development
Spends too little time on managing and communicating.
Good Decision-Making
Good Data and Facts
1. Defining the objectives and information needs: Do managers matter? and
What makes a good manager within Google?
2. Collecting the right data: using existing data from performance reviews
and employee surveys and creating new data sets from the award
nominations and managers interviews.
3. Analyzing the data and turning it into insights: simply plotting of the
results, regression analysis and text analysis.
4. Presenting the Information: new communications to the managers
5. Making evidence-based decisions: revising the training, measuring
performance in line with the findings, introducing new feedback
mechanisms
Understanding customers
Optimize processes
Create opportunities
Customer Relationship
Management (CRM)
Improve security
Introduction To Statistical
Science
Statistics is the science that relates data to specific questions of interest.
This includes devising methods to gather data relevant to the question
Methods to summarize and display the data to shed light on the question
Methods that enable us to draw answers to the question that are supported
by the data.
Data
Data almost always contain uncertainty.
This uncertainty may arise from selection of the items to be measured.
It may arise from variability of the measurement process.
Statistical Inferences
Methods and tools for taking decision despite the uncertainty in the data.
Is the basis for increasing knowledge about the world.
Is the basis for all rational scientific inquiry.
Z
Our first goal is to determine which of the possible reasons for the association
holds. If we conclude that it is due to a causal effect, then our next goal is to
determine the size of the effect. If we conclude that the association is due to
causal effect confounded with the effect of a lurking variable, then our next
goal becomes determining the sizes of both the effects.
Data
Information
Visualization Tools
Reports
Data Analysis
Tools
Use appropriate
Information Delivery
Insight
Application of
Business Drivers
Business Value
t$
$
Empowered
Decision-Making
Constant Optimization
BW/Operational Reporting
Looking back
Looking forward
Discover & simulate
Understand
What happened?
Analyze
Key Performance Indicators
(KPIs)
Present
Business Goals
Your competitive edge is a data-driven culture.
Sales Growth
SalesKPI
Sales Opportunities
Product Performance
Rank products based on revenue
performance.
SIGN IN
TRY IT NOW
HOW IT WORKS
INTEGRATIONS
BLOG
PRICING
MORE
Univariate Analysis
Univariate analysis involves the examination across cases of one
variable at a time. There are three major characteristics of a single
variable that we tend to look at:
the distribution
the central tendency
the dispersion
The Distribution
Central Tendency.
The
Dispersion
Dispersion refers to the spread of the values around the central tendency. There
are two common measures of dispersion, the range and the standard deviation.
Therange
approximately 68% of the scores in the sample fall within one standard deviation
of the mean
approximately 95% of the scores in the sample fall within two standard deviations
of the mean
approximately 99% of the scores in the sample fall within three standard
deviations of the mean
Descriptive Statistics
15,20,21,20,36,15,25,15
Mean
20.8750
Median
20.0000
Mode
Std.
Deviation
Variance
Range
15.00
7.0799
50.1250
21.00
Bivariate Analysis
Correlation
The correlation is one of the most common and most useful statistics. A correlation is
a single number that describes the degree of relationship between two variables.
Person
Height
Self Esteem
68
4.1
71
4.6
62
3.8
75
4.4
58
3.2
60
3.1
67
3.8
68
4.1
71
4.3
10
69
3.7
11
68
3.5
12
67
3.2
13
63
3.7
14
62
3.3
15
60
3.4
16
63
4.0
17
65
4.1
18
67
3.8
Histogram
Histogram
smoking party.
Smoking parties
have a lot more
variability in the
tips that they
give. Males tend
to pay the (few)
higher bills, and
female nonsmokers tend to
be very
consistent
tippers (with the
exception of
three women).