Caret Data Science MCQs Questions Answers

1. Which is helpful to generate balanced cross-validation groupings from a set of data?
a) createResample
b) createSample
c) createFolds
d) none of these
MCQ Answer: c
2. Which of the following is the wrong statement?
a) Three parameters are helpful to time series splitting
b) Simple random sampling of time series is possibly the greatest way to resample times series data.
c) Horizon parameter is the number of consecutive values in test set sample
d) All of these
MCQ Answer: b
3. Which of the following function can be helpful to maximize the minimum dissimilarities?
a) sumDiss
b) avgDiss
c) minDiss
d) All of these
MCQ Answer: d
4. Which function can create the indices for the time series type of splitting?
a) createTimeSlices
b) newTimeSlices
c) binTimeSlices
d) none of these
MCQ Answer: a
5. Which of the following is the correct statement?
a) Caret includes several functions to pre-process the predictor data
b) Asymptotics are helpful to inference typically
c) The function dummyVars can be helpful to generate a complete set of dummy variables from one or more factors
d) All of these
MCQ Answer: d
6. Which is helpful to create sub-samples using a maximum dissimilarity approach?
a) minDissim
b) inmaxDissim
c) maxDissim
d) All of these
MCQ Answer: c
7. caret does not use the proxy package.
a) True
b) False
MCQ Answer: b
8. Which function can be helpful to create balanced splits of the data?
a) newDataPartition
b) renameDataPartition
c) createDataPartition
d) none of these
MCQ Answer: c
9. Which package tools are present in caret?
a) model tuning
b) feature selection
c) pre-processing
d) All of these
MCQ Answer: d
10. caret stands for classification and regression training.
a) True
b) False
MCQ Answer: a
11. Which of the following function is a wrapper for dissimilar lattice plots to visualize the data?
a) featurePlot
b) levelplot
c) plotsample
d) None of these
MCQ Answer: a
12. Which of the following is the wrong statement?
a) In every situation, the data generating mechanism can create predictors that only have a single unique value of a matrix to enumerate sets of linear combinations
b) Predictors might have only a handful of unique values that occur with very low frequencies
c) The function findLinearCombos uses the QR decomposition
d) All of these
MCQ Answer: c
13. Which function can be helpful to identify near zero-variance variables?
a) nearZeroVar
b) nearVar
c) zeroVar
d) All of these
MCQ Answer: a
14. Which function can be helpful to flag predictors for removal?
a) searchCorrelation
b) findCorrelation
c) findCausation
d) none of these
MCQ Answer: b
15. Which of the following is the correct statement?
a) findLinearColumns will also return a vector of column positions that can be removed to eliminate the linear dependencies
b) the function findLinearRows can be helpful to generate a complete set of row variables from one factor
c) findLinearCombos will return a list that enumerates dependencies
d) None of these
MCQ Answer: c
16. Which can be helpful to impute data sets based only on information in the training set?
a) preProcess
b) postProcess
c) process
d) All of these
MCQ Answer: a
17. The function preProcess guesses the needed parameters for each operation.
a) True
b) False
MCQ Answer: a
18. Which of the following can also be helpful to find new variables that are linear combinations of the original set with independent components?
a) PCA
b) SCA
c) ICA
d) None of these
MCQ Answer: c
19. Which function is helpful to generate the class distances?
a) predict.classDist
b) preprocess.classDist
c) predict.classDistance
d) All of these
MCQ Answer: a
20. The preProcess class can be helpful to many operations on predictors.
a) True
b) False
MCQ Answer: a
21. varImp is a wrapper around the evimp function in which of the following package?
a) numpy
b) plot
c) earth
d) none of these
MCQ Answer: c
22. Which of the following is the wrong statement?
a) An argument, para, is helpful to choice the model fitting technique
b) For regression, the relationship between each predictor and the outcome is evaluated
c) The trapezoidal rule is helpful to compute the area under the ROC curve
d) All of these
MCQ Answer: a

23. Which of the following curve analysis is conducted on each predictor for classification?
a) NOC
b) COC
c) ROC
d) All of these
MCQ Answer: c
24. Which of the following function tracks the changes in model statistics?
a) findTrack
b) varImpTrack
c) varImp
d) none of these
MCQ Answer: c
25. Which of the following is the correct statement?
a) Boosted Trees uses a dissimilar approach as a single tree
b) The Bagged Trees output holds variable usage statistics
c) The difference between the class centroids and the overall centroid is helpful to measure the variable influence
d) None of these
MCQ Answer: c
26. What model includes a backward elimination feature selection routine?
a) MCV
b) MCRS
c) MARS
d) All of these
MCQ Answer: c
26. The benefit of using a model-based method is that is more closely tied to the model performance.
a) True
b) False
MCQ Answer: a
27. What model sums the importance over each boosting iteration?
a) Partial least squares
b) Bagged trees
c) Boosted trees
d) None of these
MCQ Answer: c
28. What argument is helpful to set important values?
a) set
b) scale
c) value
d) All of these
MCQ Answer: b
29. For most classification models, each predictor will have separate variable importance for each class.
a) True
b) False
MCQ Answer: a

More Next Data Mining MCQs

  1. Repeated Data Mining MCQs
  2. Classification in Data mining MCQs
  3. Clustering in Data mining MCQs
  4. Data Analysis and Experimental Design MCQs
  5. Basics of Data Science MCQs
  6. Big Data MCQs
  7. Caret Data Science MCQs 
  8. Binary and Count Outcomes MCQs
  9. CLI and Git Workflow

 

  1. Data Preprocessing MCQs
  2. Data Warehousing and OLAP MCQs
  3. Association Rule Learning MCQs
  4. Classification
  5. Clustering
  6. Regression MCQs
  7. Anomaly Detection MCQs
  8. Text Mining and Natural Language Processing (NLP) MCQs
  9. Web Mining MCQs
  10. Sequential Pattern Mining MCQs
  11. Time Series Analysis MCQs

Data Mining Algorithms and Techniques MCQs

  1. Frequent Itemset Mining MCQs
  2. Dimensionality Reduction MCQs
  3. Ensemble Methods MCQs
  4. Data Mining Tools and Software MCQs
  5. Python  Programming for Data Mining MCQs (Pandas, NumPy, Scikit-Learn)
  6. R Programming for Data Mining(dplyr, ggplot2, caret) MCQs
  7. SQL Programming for Data Mining for Data Mining MCQs
  8. Big Data Technologies MCQs

Add a Comment