Professional Documents
Culture Documents
To
1
Topics of Discussion
3
Imbalanced Data Sets
Significance
5
Significance
• At Algorithmic Level:
- Adjusting the misclassification costs
- Adjusting the decision threshold at the
tree leaf
7
Quiz-1
Which of these is not a type of imbalance scenario?
a) 95:5 b) 80:20
b) 75:25 d) 60:40
Ans.: b
Data Level techniques? (multiple possibilities)
a) Cost Sensitive b) Under Sampling
c) Over Sampling d) Classifier based
Ans.: b and c
Real-world extreme imbalanced data set example is?
a) Fraud detection b) Birth rate ratio
Ans.: a
Research Challenges
10
Research Challenges
11
Research Challenges
12
Research Challenges
15
Research Challenges
16
Research Challenges
17
Research Challenges
18
Research Challenges
19
Research Challenges
- Need of an in-depth analysis of the structure of minority
class and its examples
- Analyse the appearance of new types of examples or
changes in properties of already described types
- To address complex scenarios requiring local analysis of
each difficult region and their individual solutions
20
Research Challenges
- Need of an in-depth analysis of the structure of minority
class and its examples
- Analyse the appearance of new types of examples or
changes in properties of already described types
- To address complex scenarios requiring local analysis of
each difficult region and their individual solutions
21
Quiz-2
Can a sample may overlapped with more than two classes?
a) false b) true
Ans.: b
Dose the streaming data progresses skewness?
a) May be b) May not
c) Yes d) No
Ans.: a and b (mostly)
23