1. : What is the primary purpose of the Apriori algorithm in data mining?
(A) Classification
(B) Regression
(C) Association rule mining
(D) Clustering
2. : Which of the following is the first step in the Apriori algorithm?
(A) Generate candidate itemsets
(B) Calculate confidence
(C) Prune non-frequent itemsets
(D) Generate frequent itemsets
3. : In the context of the Apriori algorithm, what is ‘support’?
(A) The ratio of the number of transactions that contain an itemset to the total number of transactions
(B) The probability of an itemset occurring given another itemset
(C) The strength of an association rule
(D) The total number of items in a transaction
4. : How does the Apriori algorithm generate candidate itemsets?
(A) By randomly selecting items
(B) By using frequent itemsets from the previous iteration
(C) By clustering similar items
(D) By sorting items based on their frequency
5. : What is the purpose of the pruning step in the Apriori algorithm?
(A) To generate new candidate itemsets
(B) To remove infrequent itemsets
(C) To calculate the confidence of rules
(D) To sort the itemsets
6. : In the Apriori algorithm, which property helps reduce the number of candidate itemsets?
(A) Monotonicity property
(B) Anti-monotonicity property
(C) Transitivity property
(D) Symmetry property
7. : What does the ‘confidence’ measure in the context of association rules?
(A) The frequency of the rule in the dataset
(B) The accuracy of the rule
(C) The probability that the rule’s consequent is true given the antecedent
(D) The number of items in the rule
8. : If an itemset is frequent, what can be said about its subsets in the context of the Apriori algorithm?
(A) They must also be frequent
(B) They can be infrequent
(C) They are irrelevant
(D) They are less important
9. : What does the term ‘lift’ indicate in association rule mining?
(A) The number of items in a transaction
(B) The strength of a rule over random occurrence
(C) The total number of transactions
(D) The sum of item frequencies
10. : Which of the following is a limitation of the Apriori algorithm?
(A) It cannot handle large datasets
(B) It requires multiple database scans
(C) It does not support numeric data
(D) It cannot generate rules with more than two items