Professional Documents
Culture Documents
- Confidence: The likelihood that a rule is true for a new transaction that contains items
on the LHS of the rule,
> Confidence(A
> For all values of lift which are> 1, Actual lift= Lift Value-1
> % Increase in those cases= (Lift Value-1)100
> Lift =
Confidence ( A B)
Support ( A B)
P( B A ) P(B A) P (AB)
=
=
=
=
Support( B)
P( A)
P( B)
Support ( A ) Support (B) P ( A ) P (B)
Basket Data
ID
001
001
002
003
Item
Apple
Orange
Apple
Orange
Types of rules
- Actionable rules: DA GEM, contain high quality actionable information that was
previously not known/not common
knowledge
- Trivial rules: Information already well known within the business
- Inexplicable rules: no explanation available and non-actionable
A priori Algorithm
- Used to reduce the number of combinations (reduce the number of factors and hence
hopefully increase factor loading)
- Based on the A priori principle that if the support of an item set is large, then the
support of all of its subsets must also be
large; likewise if an item set is small, the support of its supersets must also be small.
- Support of an item set will never exceed the support of its subsets.
- By using the A priori algorithm, we progressively identify large item sets and eliminate
low support items early in the analysis
to reduce the amount of computation required to get to insightful actionable rules (high
support, high confidence)