Hierarchical Clustering MCQs

By: Prof. Dr. Fazal Rehman Shamil | Last updated: August 7, 2024

1. What is the primary objective of Hierarchical Clustering in data mining?
a) To classify data points into predefined clusters
b) To find the optimal number of clusters automatically
c) To reduce the dimensionality of the data
d) To perform regression analysis

Answer: b) To find the optimal number of clusters automatically

2. How does Hierarchical Clustering initially treat each data point?
a) As a separate cluster
b) By assigning it to the nearest centroid
c) By randomly assigning it to a cluster
d) By calculating its distance to all other points

Answer: a) As a separate cluster

3. What is the main difference between agglomerative and divisive hierarchical clustering?
a) Agglomerative starts with all data points in one cluster, while divisive starts with each point as a separate cluster
b) Agglomerative merges clusters, while divisive splits clusters
c) Agglomerative uses Euclidean distance, while divisive uses Manhattan distance
d) Agglomerative is faster than divisive

Answer: b) Agglomerative merges clusters, while divisive splits clusters

4. Which linkage method in hierarchical clustering merges clusters based on the minimum distance between points in each cluster?
a) Single linkage
b) Complete linkage
c) Average linkage
d) Ward’s linkage

Answer: a) Single linkage

5. What is the dendrogram used for in hierarchical clustering?
a) To visualize the data points
b) To display the distance between clusters at each merge step
c) To select the optimal number of clusters
d) To measure the silhouette coefficient

Answer: b) To display the distance between clusters at each merge step

6. Which method is used to determine the number of clusters in hierarchical clustering?
a) Silhouette coefficient
b) Elbow method
c) Dendrogram
d) F-measure

Answer: c) Dendrogram

7. What does the “linkage distance” measure in hierarchical clustering?
a) The distance between data points within a cluster
b) The distance between centroids of clusters
c) The distance between clusters at each merge step
d) The number of iterations until convergence

Answer: c) The distance between clusters at each merge step

8. In hierarchical clustering, what does “Ward’s method” prioritize during cluster merging?
a) Minimizing the maximum variance within clusters
b) Minimizing the sum of squared differences within clusters
c) Minimizing the sum of squared differences between clusters
d) Maximizing the silhouette coefficient

Answer: b) Minimizing the sum of squared differences within clusters

9. Which of the following is an advantage of hierarchical clustering?
a) It requires less computational resources than k-means clustering
b) It guarantees convergence to the global optimum
c) It is scalable to large datasets
d) It does not require specifying the number of clusters in advance

Answer: d) It does not require specifying the number of clusters in advance

10. What is a potential drawback of hierarchical clustering?
a) It is computationally expensive for large datasets
b) It cannot handle categorical data
c) It is sensitive to the initial placement of centroids
d) It requires normalization of data

Answer: a) It is computationally expensive for large datasets

More Next Data Mining MCQs

  1. Repeated Data Mining MCQs
  2. Classification in Data mining MCQs
  3. Clustering in Data mining MCQs
  4. Data Analysis and Experimental Design MCQs
  5. Basics of Data Science MCQs
  6. Big Data MCQs
  7. Caret Data Science MCQs 
  8. Binary and Count Outcomes MCQs
  9. CLI and Git Workflow

 

  1. Data Preprocessing MCQs
  2. Data Warehousing and OLAP MCQs
  3. Association Rule Learning MCQs
  4. Classification
  5. Clustering
  6. Regression MCQs
  7. Anomaly Detection MCQs
  8. Text Mining and Natural Language Processing (NLP) MCQs
  9. Web Mining MCQs
  10. Sequential Pattern Mining MCQs
  11. Time Series Analysis MCQs

Data Mining Algorithms and Techniques MCQs

  1. Frequent Itemset Mining MCQs
  2. Dimensionality Reduction MCQs
  3. Ensemble Methods MCQs
  4. Data Mining Tools and Software MCQs
  5. Python  Programming for Data Mining MCQs (Pandas, NumPy, Scikit-Learn)
  6. R Programming for Data Mining(dplyr, ggplot2, caret) MCQs
  7. SQL Programming for Data Mining for Data Mining MCQs
  8. Big Data Technologies MCQs