You are on page 1of 7

Course in Data Science

Contact: +917095167689
About the Course:
In this course you will get an introduction to the main tools and ideas which are required for Data
Scientist/Business Analyst/Data Analyst. The course gives an overview of the data, questions, and
tools that data analysts and data scientists work with. There are two components to this course. The first
is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a
practical introduction to the tools that will be used in the program like R Programming, SAS, MINITAB and
EXCEL.

Course features:

140+ hours of teaching

Exam on every weekend

Exclusive doubt clarification session on every weekend

Real Time Case Study driven approach

Live Project

Placement assistance

Qualification

Any Graduate. No programming and statistics knowledge or skills required

Duration of the course:

3 months (Every day 2 hours of teaching).

Classes on week days.

Mode of course delivery

Online / Class Room trainings are available.

Class Room Training Venue:

KPHB. For more details, call on +917095167689.

Faculty Details:

A team of faculty having an average 20 + years experience in the data analysis across various
industries and training.

Module:1 - Descriptive & Inferential Statistics:(20 Hrs)


1. Turning Data into Information

5. Hypothesis Testing

Data Visualization

Hypothesis Testing

Measures of Central Tendency

Type I and Type II Errors

Measures of Variability

Decision Making in Hypothesis Testing

Measures of Shape

Hypothesis Testing for a Mean,

Covariance, Correlation

Using Software-Real Time Problems

2. Probability Distributions

Probability Distributions: Discrete

Variance, Proportion

Power in Hypothesis Testing

Using Software-Real Time Problems

6. Comparing Two Groups

Random Variables

Comparing Two Groups

Mean, Expected Value

Comparing Two Independent Means,

Binomial Random Variable

Poisson Random Variable

Pairs wise testing for Means

Continuous Random Variable

Two Variances Test(F-Test)

Normal distribution

Using Software-Real Time Problems

Using Software-Real Time Problems

3. Sampling Distributions

Proportions

7. Analysis of Variance (ANOVA)

One-Way and Two-way ANOVA

Central Limit Theorem

ANOVA Assumptions

Sampling Distributions for Sample

Multiple Comparisons (Tukey, Dunnett)

Proportion, p-hat

Using Software-Real Time Problems

Sampling Distribution of the Sample


Mean, x-bar

Using Software-Real Time Problems

4. Confidence Intervals

Statistical Inference

Constructing confidence intervals to

8. Association Between
Categorical Variables

Two Categorical Variables Relation

Statistical Significance of Observed


Relationship / Chi-Square Test

estimate a population Mean, Variance,


Proportion

Using Software-Real Time Problems

Calculating the Chi-Square Test


Statistic

Contingency Table

Using Software-Real Time Problems

Module:2 - Applied Regression Methods(20Hrs)


1. Simple Linear Regression(SLR)

6. MLR Model Evaluation

Prerequisite Mathematics

The General Linear Test

The Simple Linear Regression Model

Sequential (or Extra) Sums of Squares

What is The Common Error Variance?

The Hypothesis Tests for the Slopes

The Coefficient of Determination

Partial R-squared

Hypothesis Test for the Population

Lack of Fit Testing in the Multiple

Correlation Coefficient

Using Software-Real Time Problems

2. SLR Model Evaluation

Using Software-Real Time Problems

7. MLR Estimation, Prediction &


Model Assumptions

Inference for the Population Intercept and


Slope

Regression Setting

The Analysis of Variance (ANOVA) table

Confidence Interval for the Mean


Response

and the F-test

Prediction Interval for a New Response

Equivalent linear relationship tests

Model Assumptions Diagnostics

Decomposing the Error

Using Software-Real Time Problems

The Lack of Fit F-test

8. Categorical Predictors

Using Software-Real Time Problems

Coding Qualitative Variables

3. SLR Estimation & Prediction

Additive Effects

Confidence Interval for the Mean

Interaction Effects

Response

Using Software-Real Time Problems

Prediction Interval for a New Response

9. Data Transformations

Using Software-Real Time Problems

Using Software-Real Time Problems

4. SLR Model Assumptions

10. Model Building

Model Assumptions Diagnostics

Forward Selection/Backward Elimination

Using Software-Real Time Problems

Stepwise Regression

5. Multiple Linear
Regression(MLR)

Adjusted R-Sq, Mallows Cp, PRESS, AIC,

The Multiple Linear Regression Model

Outliers and Influential Data Points

Using Software-Real Time Problems

Cooks Distance/DIFBETAS/DFFITS

Using Software-Real Time Problems

BIC, SBC, AICC

Module:3 - Applied Time Series Analysis(10Hrs)


1. Time Series Basics

Overview

ACF and AR(1) Model

2. MA Models, PACF

Moving Average Models (MA models)

PACF

Using Software-Real Time Problems

3. ARIMA models

Non-seasonal ARIMA

Diagnostics

Forecasting

Using Software-Real Time Problems

4. Seasonal Models

Seasonal ARIMA

Identifying Seasonal Models

Using Software-Real Time Problems

5. Smoothing and Decomposition


Methods

Decomposition Models

Smoothing Time Series

Using Software-Real Time Problems

6. Periodogram

Periodogram
Using Software-Real Time Problems

7. Regression with ARIMA errors;


CCF; 2 Time Series

Linear Regression Models with


Autoregressive Errors

CCF and Lagged Regressions

Using Software-Real Time Problems

Module:4 Applied Multivariate Analysis (20hrs)


1. Measures of Central Tendency,
Dispersion and Association

7. Discriminant Analysis

Measures of Central Tendency/

Bayes Rule and Classification Problem

Measures of Dispersion

Discriminant Analysis (Linear/Quadratic)

Using Software-Real Time Problems

Estimating Misclassification Probabilities

2. Graphical Display of Multivariate


Data

Using Software-Real Time Problems

Graphical Methods

Agglomerative Hierarchical Clustering

Using Software-Real Time Problems

K-Means Procedure

3. Multivariate Normal Distribution

Meloid Cluster Analysis

Exponent of Multivariate Normal

Using Software-Real Time Problems

Distribution

9. MANOVA

Multivariate Normality and Outliers

MANOVA

Eigenvalues and Eigenvectors

Test Statistics for MANOVA

Spectral Value Decomposition

Hypothesis Tests

Single Value Decomposition

MANOVA table

Using Software-Real Time Problems

Using Software-Real Time Problems

4. Sample Mean Vector and


Sample Correlation

Distribution of Sample Mean Vector

Interval Estimate of Population Mean

Inferences for Correlations

Using Software-Real Time Problems

5. Principal Components Analysis


(PCA)

Principal Component Analysis (PCA)


Procedure

Using Software-Real Time Problems

6. Factor Analysis

Principal Component Method

Communalities

Factor Rotations

Varimax Rotation

Using Software-Real Time Problem

8. Cluster Analysis

Module:5 - Machine Learning(30hrs)


1. Introduction

6. Support Vector Machine

Application Examples

Support Vector Classier

Supervised Learning

Support Vector Machine

Unsupervised Learning

SVMs with More than Two Classes

Using Software-Real Time Problems

2. Regression Shrinkage Methods

7. Dimension Reduction Methods

Ridge Regression

Lasso

Principal Components Regression (PCR)

Using Software-Real Time Problems

Partial Least Squares (PLS)

Using Software-Real Time Problems

3. Classification

8. Association rules

Logistic Regression

Discriminant Analysis

Market Basket Analysis

Nearest-Neighbor Methods

Using Software-Real Time Problems

Using Software-Real Time Problems

4. Tree-based Methods

The Basics of Decision Trees

Regression Trees

Classication Trees

Ensemble Methods

Bagging, Boosting, Bootstrap, Random


Forests

Using Software-Real Time Problems

5. Neural Networks

Introduction

Single Layer Perceptron

Multi-layer Perceptron

Forward Feed and Backward Propagation

Using Software-Real Time Problems

Module:6 - SAS/R Programming (20hrs)


1. Base SAS

2. SAS SQL

Working with SAS program syntax

Basic Queries

Examining SAS data sets

Sub-Queries

Accessing SAS libraries

Joins (SQL)

Producing Detail Reports

Operators

Sorting and grouping report data

Creating Tables and Views

Enhancing reports

Managing Tables

Formatting Data Values

3. SAS Macros

Creating user-defined formats

Macro Variables

Reading SAS Data Sets

Definitions

Customizing a SAS data set

Data Step and SQL Interfaces

Handling missing data

4. R Programming

Manipulating Data

RCMDR Package

Combining SAS Data Sets

Rattle Package

Creating Summary Reports

Controlling Input and Output

Summarizing Data

Reading Raw Data Files

Data Transformations

Debugging Techniques

Using the PUTLOG statement

Processing Data Iteratively

Restructuring a Data Set

Creating and Maintaining Permanent


Formats

You might also like