Professional Documents
Culture Documents
in Large-Scale Systems
Haroon Malik
Postdoctoral fellow
Ian John
Davis
Michael
Godfrey
Serge
Mankovskii
Research Staf
Douglas Neuse
Infrastructure
Management
CA Technologies
USA
Datacenters Require
Forecasting Steps
Determine
purpose
Select
technique
Prepare data
Prepare
forecast
Monitor
forecast
Forecasting Steps
1
Determine
purpose
Select
technique
Prepare data
Prepare
forecast
Monitor
forecast
Forecasting Steps
1
Determine
purpose
Select
technique
Prepare data
Prepare
forecast
Monitor
forecast
Forecasting Steps
1
Determine
purpose
Select
technique
Prepare data
Prepare
forecast
Monitor
forecast
Forecasting Steps
1
Determine
purpose
Select
technique
Prepare data
Prepare
forecast
Monitor
forecast
Forecasting Steps
1
Determine
purpose
Select
technique
Prepare data
Challenges
Prepare
forecast
Monitor
forecast
Discontinuities
Anomal
ies
5
6
Magnitude
Discontin
uity
Time (Days)
9
Discontinuities
Reasons:
1.Company merge
Symptoms:
(b
)
(a)
2.Hardware upgrade
3.Software change (new
release)
4.Workload change
T1
T2
T3
(c)
(d)
5.Promotional customers
10
Transition Period
We propose an automated
approach to help analyst
identify discontinuities in
performance data
11
Approach
Data
preparation
Metric
selection
3
Anomaly
detection
4
Discontinuity
identification
Outpu
t
Report
(discontinuities)
12
Data
preparation
Metric
selection
Anomaly
detection
Discontinuity
identification
1. Data Preparation
The performance logs from
the production have noise:
o
Missing counters
Empty counters
Different
ranges
numerical
We used statistical
techniques to filter
noise in the data
13
2.Metric
Selection
Data
preparation
Metric
selection
Anomaly
detection
Discontinuity
identification
Highly correlated
Invariants
Configuration constants
We used PrincipalComponent-Analysis
(PCA) to select
important metrics
14
3. Anomaly
Detection
Data
preparation
Metric
selection
Anomaly
detection
Discontinuity
identification
Quadratic Modelling
o
A greedy algorithm to
replace
performance
counter time series data
3. Anomaly Detection
(Quadratic Model)
Counter Value
16
3. Anomaly Detection
(Quadratic Model)
Cost
Counter Value
17
4.
Discontinuity
Identification
Data
preparation
Metric
selection
Anomaly
detection
Discontinuity
identification
Distribution comparison
o Difference
of
mean
between two population
o Quantify the difference
of mean between two
population
18
Discontinuity
Transition
Period
Transition
Period
19
Discontinuity
Transition
Period
Cost
Transition
Period
20
Discontinuity
Transition
Period
Transition
Period
Discontinuity
Transition
Period
Transition
Period
Discontinuity
Transition
Period
Transition
Period
M
m e di
COHENS-D
threshold
ll
a
m
S
Cohen
sd
Tr
i
v
i
al
24
Subjects of Study
DVD Store
System: Simulation
Domain: Cloud Computing
Type of Data: Synthetic
Data
25
Fault Injection
Category
Types of Faults
CPU Stress
Anomalies
Memory Stress
Interfering Workload
Workload as Multiplicative
Factor
Discontinuities
Change in Transaction
Pattern
Hardware & Software
Upgrade
Results
1
0.92
0.9
0.8
3 0.8
0.7
0.6
0.72
0.5
F-measure 0.4
0.3
0.2
0.1
0
27
Results
1
0.92
0.9
0.8
3 0.8
0.7
0.6
0.72
0.5
F-measure 0.4
0.3
0.2
0.1
0
28
Results
1
0.92
0.9
0.8
3 0.8
0.7
0.6
0.72
0.5
F-measure 0.4
0.3
0.2
0.1
0
29
o Using large effect size reduces false alarms, this may result in an analyst overlooking
significant discontinuities.
o Analysts have to conduct multiple experiments
32
QUESTIONS
Haroon Malik
malikh@uwaterloo.ca
33