Professional Documents
Culture Documents
(e) Given three rules {p} {q}, {p} {q, r}, & {p, r} {q}, can you identify rule
that has lowest confidence? Can you identify the rule with the highest confidence?
Q.3. Suppose that Apriori algorithm is applied to the data set shown in table below with
minimum support = 30%. Answer the questions given below [2 + 3 = 5]
Transaction
ID Items Bought
1 {apple, banana, dates, guava}
2 {banana, coconut, dates}
3 {apple, banana, dates, guava}
4 {apple, coconut, dates, guava}
5 {banana, coconut, dates, guava}
6 {banana, dates, guava}
7 {coconut, dates}
8 {apple, banana, coconut}
9 {apple, dates, guava}
10 {banana, dates}
a. What is the percentage of frequent itemsets (with respect to all possible itemsets)?
b. What is the false alarm rate (i.e., percentage of candidate itemsets that are found to be
infrequent after performing support counting)?
Q.4. Draw single link dendrogram for the following distance data among 5 points. If we need
to restrict inter-cluster distance to at least 0.20, what is maximum count of clusters?
[3 + 2 = 5]
p1 p2 p3 p4 p5
p1 0 0.95 0.69 0.48 0.65
p2 0.95 0 0.36 0.53 0.17
p3 0.69 0.36 0 0.56 0.21
p4 0.48 0.53 0.56 0 0.24
p5 0.65 0.17 0.21 0.24 0
Q.5. Use k-means algorithm to cluster the following points into three clusters. Use Manhattan
distance to compute distance between points. Consider A, C, and G as initial centroids for
the three clusters. [5]
A B C D E F G H
(2,10) (2,5) (8,4,) (5,8) (7,5) (6,4) (1,2) (4,9)
Q.6. The demand for a product for 5 consecutive months is given in the table below:
Month 1 2 3 4 5
Demand (in thousands) 20 21 23 24 25
Use exponential smoothing with a smoothing factors (α) of 0.9 and 0.1 to forecast
demand for 6th month. Compare the results obtained in both cases. [3 + 2 = 5]
************