Professional Documents
Culture Documents
Iterative Operations & Functions in Python This is where we move beyond simple data
containers and learn about amazing possibilities
Writing for loops in Python
and functionalities hidden in various associated
List & Dictionary Comprehension
operators. We get introduced to wonderful world
While loops and conditional blocks
of loops, list and dictionary comprehensions. In
List/Dictionary comprehensions with loops
addition to already existing functions and classes
Writing your own functions in Python we learn to write our own custom functions and
Writing your own classes and functions as class objects classes. This module sets the stage for handling
Practice assignment data and ML algorithm implementation in python.
Data Summary; Numerical and Visual in Python
Data summary is extremely important to
Need for data summary understand what the data is saying and gain
Summarising numeric data in pandas insights in just one glance. Visualization of data is a
Summarising categorical data strong point of the Python software using the latest
Group wise summary of mixed data ggplot package using much celebrated grammar of
Need for visual summary graphics. We also introduce you another powerful
Introduction to ggplot & Seaborn package seaborn in additional material section.
Visual summary of different data combinations
Practice Exercise
Data Handling in Python using NumPy & Pandas Python is a very versatile language and in this
Introduction to NumPy arrays, functions &properties module we expand on its capabilities related to
Introduction to pandas data handling. Focusing on packages numpy and
Dataframe functions and properties pandas we learn how to manipulate data which will
Reading and writing external data be eventually useful in converting raw data suitable
Manipulating Data Columns for machine learning algorithms.
Tree Models using Python In this module we will learn a very popular class of
Introduction to decision trees machine learning models, rule based tree
Tuning tree size with cross validation structures also known as Decision Trees. We'll
examine their biased nature and learn how to use
Introduction to bagging algorithm
bagging methodologies to arrive at a new
Random Forests
technique known as Random Forest to analyse
Grid search and randomized grid search
data. Well further extend the idea of randomness
ExtraTrees (Extremely Randomised Trees)
to decrease bias in ExtraTrees algorithm.
Partial Dependence Plots
Case Studies In addition, we learn about powerful tools used
Home exercises with all kind of machine learning algorithms,
gridSearchCV and RandomizedSearchCV.
Case Studies:
In the class we continue with the case studies taken
in previous module of simple linear models and see
how the tree based models compare in terms of
performance in comparison to the linear models.
Case Studies:
1. Understanding impact of cash assistance
programs in New York: To understand PCA,
we take up data of cash assistance programs in
New York. This has more than 60 variables.
Well see how can we reduce the size of the
data.