Text Preprocessing MCQs

By: Prof. Dr. Fazal Rehman Shamil | Last updated: August 7, 2024

1. What is the primary goal of text preprocessing in data mining?
a) To remove stop words
b) To convert text into lowercase
c) To prepare text data for analysis
d) To tokenize sentences

Answer: c) To prepare text data for analysis

2. Which of the following is NOT a common step in text preprocessing?
a) Lemmatization
b) Part-of-speech tagging
c) Tokenization
d) Feature scaling

Answer: d) Feature scaling

3. What does tokenization involve in text preprocessing?
a) Removing punctuation marks from text
b) Converting words to their base forms
c) Splitting text into words or tokens
d) Normalizing text data

Answer: c) Splitting text into words or tokens

4. Which technique is used to handle different forms of a word during text preprocessing?
a) Lemmatization
b) Stemming
c) Tokenization
d) Stop word removal

Answer: a) Lemmatization

5. What is the purpose of stop word removal in text preprocessing?
a) To remove special characters from text
b) To remove words that do not contribute much to the meaning of a sentence
c) To convert words to lowercase
d) To split text into sentences

Answer: b) To remove words that do not contribute much to the meaning of a sentence

6. What is stemming in text preprocessing?
a) Removing duplicate words from text
b) Reducing words to their base or root form
c) Removing punctuation marks from text
d) Converting words to their plural forms

Answer: b) Reducing words to their base or root form

7. Which of the following is an example of normalization in text preprocessing?
a) Removing punctuation marks
b) Converting all text to uppercase
c) Converting numbers to words
d) Removing stop words

Answer: c) Converting numbers to words

8. What is the purpose of part-of-speech tagging in text preprocessing?
a) To identify the main topic of a document
b) To identify the grammatical parts of speech for each word in a sentence
c) To remove irrelevant words from text
d) To convert words into their base forms

Answer: b) To identify the grammatical parts of speech for each word in a sentence

9. Which step in text preprocessing involves removing HTML tags and special characters?
a) Tokenization
b) Lemmatization
c) Cleaning
d) Normalization

Answer: c) Cleaning

10. What is an essential consideration when performing text preprocessing?
a) Speed of text processing algorithms
b) Retaining all words in their original forms
c) Preserving sentence boundaries
d) Removing all punctuation marks

Answer: c) Preserving sentence boundaries

More Next Data Mining MCQs

  1. Repeated Data Mining MCQs
  2. Classification in Data mining MCQs
  3. Clustering in Data mining MCQs
  4. Data Analysis and Experimental Design MCQs
  5. Basics of Data Science MCQs
  6. Big Data MCQs
  7. Caret Data Science MCQs 
  8. Binary and Count Outcomes MCQs
  9. CLI and Git Workflow

 

  1. Data Preprocessing MCQs
  2. Data Warehousing and OLAP MCQs
  3. Association Rule Learning MCQs
  4. Classification
  5. Clustering
  6. Regression MCQs
  7. Anomaly Detection MCQs
  8. Text Mining and Natural Language Processing (NLP) MCQs
  9. Web Mining MCQs
  10. Sequential Pattern Mining MCQs
  11. Time Series Analysis MCQs

Data Mining Algorithms and Techniques MCQs

  1. Frequent Itemset Mining MCQs
  2. Dimensionality Reduction MCQs
  3. Ensemble Methods MCQs
  4. Data Mining Tools and Software MCQs
  5. Python  Programming for Data Mining MCQs (Pandas, NumPy, Scikit-Learn)
  6. R Programming for Data Mining(dplyr, ggplot2, caret) MCQs
  7. SQL Programming for Data Mining for Data Mining MCQs
  8. Big Data Technologies MCQs