Machine Learning (ML) MCQ Set 02 | Machine Learning (ML) - Online Exam Test Papers | Machine Learning (ML) - MCQs[multiple choice questions and answers ] | Machine Learning (ML) - Mock Test Papers | Machine Learning (ML) - Practice Papers | Machine Learning (ML)

Question:

 How does the bias-variance decomposition of a ridge regression estimator compare with that of ordinaryleast squares regression?

1.ridge has larger bias, larger variance

2.ridge has smaller bias, larger variance

3.ridge has larger bias, smaller variance

4. ridge has smaller bias, smaller variance

Posted Date:-2022-06-21 11:33:46

Question:

 If TP=9 FP=6 FN=26 TN=70 then Error rate will be

1.45 percentage

2. 99 percentage

3.28 percentage

4.20 perentage

Posted Date:-2022-06-21 12:00:44

Question:

 In SVM which has quadratic kernel function of polynomial degree 2 that has slack variable C as one hyper paramenter. What would happen if we use very large value for C

1.we can still classify the data correctly for given setting of hyper parameter c

2.we can not classify the data correctly for given setting of hyper parameter c

3.we can not classify the data at all

4. data can be classified correctly without any impact of c

Posted Date:-2022-06-21 12:40:25

Question:

 Linear Regression is a _______ machine learning algorithm.

1.supervised

2.unsupervised

3.semi-supervised

4.can say

Posted Date:-2022-06-21 12:53:20

Question:

 Suppose we train a hard-margin linear SVM on n > 100 data points in R2, yielding a hyperplane with exactly 2 support vectors. If we add one more data point and retrain the classifier, what is the maximum possible number of support vectors for the new hyperplane (assuming the n + 1 points are linearly separable)?

1.1

2.2

3. n

4.n+1

Posted Date:-2022-06-21 11:58:11

Question:

 Suppose your model is demonstrating high variance across the different training sets. Which of the following is NOT valid way to try and reduce the variance?

1. increase the amount of traning data in each traning set

2.improve the optimization algorithm being used for error minimization.

3.decrease the model complexity

4.reduce the noise in the training data

Posted Date:-2022-06-21 11:49:44

Question:

 The difference between the actual Y value and the predicted Y value found using a regression equation is called the

1.slope

2.residual

3.outlier

4.scatter plot

Posted Date:-2022-06-21 12:51:31

Question:

 What is the purpose of the Kernel Trick?

1.to transform the data from nonlinearly separable to linearly separable

2.to transform the problem from regression to classification

3.to transform the problem from supervised to unsupervised learning

4.all of the above

Posted Date:-2022-06-21 11:27:21

Question:

 Which of the following are components of generalization Error?

1.bias

2.vaiance

3.both of them

4.none of them

Posted Date:-2022-06-21 12:11:47

Question:

A machine learning problem involves four attributes plus a class. The attributes have 3, 2, 2, and 2 possible values each. The class has 3 possible values. How many maximum possible different examples are there?

1.12

2.24

3.48

4.72

Posted Date:-2022-06-21 12:46:42

Question:

A perceptron adds up all the weighted inputs it receives, and if it exceeds a certain value, it outputs a 1, otherwise it just outputs a 0.

1. true

2.false

3.sometimes â€“ it can also output intermediate values as well

4.canâ€™t say

Posted Date:-2022-06-21 11:26:12

Question:

Based on survey , it was found that the probability that person like to watch serials is 0.25 and the probability that person like to watch netflix series is 0.43. Also the probability that person like to watch serials and netflix sereis is 0.12. what is the probability that a person doesn't like to watch either?

1.0.32

2.0.2

3.0.44

4.0.56

Posted Date:-2022-06-21 12:45:56

Question:

During the treatement of cancer patients , the doctor needs to be very careful about which patients need to be given chemotherapy.Which metric should we use in order to decide the patients who should given chemotherapy?

1.precision

2.recall

3.call

4.score

Posted Date:-2022-06-21 12:34:47

Question:

he minimum time complexity for training an SVM is O(n2). According to this fact, what sizes of datasets are not best suited for SVMâ€™s?

1. large datasets

2.small datasets

3.medium sized datasets

4.size does not matter

Posted Date:-2022-06-21 12:02:57

Question:

How can SVM be classified?

1. it is a model trained using unsupervised learning. it can be used for classification and regression.

2.it is a model trained using unsupervised learning. it can be used for classification but not for regression.

3.it is a model trained using supervised learning. it can be used for classification and regression.

4.t is a model trained using unsupervised learning. it can be used for classification but not for regression.

Posted Date:-2022-06-21 11:44:58

Question:

Imagine, you are solving a classification problems with highly imbalanced class. The majority class is observed 99% of times in the training data. Your model has 99% accuracy after taking the predictions on test data. Which of the following is true in such a case?
1. Accuracy metric is not a good idea for imbalanced class problems.
2.Accuracy metric is a good idea for imbalanced class problems.
3.Precision and recall metrics are good for imbalanced class problems.
4.Precision and recall metrics arenâ€™t good for imbalanced class problems.

1.1 and 3

2.1 and 4

3.2 and 3

4.2 and 4

Posted Date:-2022-06-21 12:01:54

Question:

In SVM, RBF kernel with appropriate parameters to perform binary classification where the data is non-linearly seperable. In this scenario

1. the decision boundry in the transformed feature space in non-linear

2. the decision boundry in the transformed feature space in linear

3.. the decision boundry in the original feature space in not considered

4.the decision boundry in the original feature space in linear

Posted Date:-2022-06-21 12:41:25

Question:

Let S1 and S2 be the set of support vectors and w1 and w2 be the learnt weight vectors for a linearly
separable problem using hard and soft margin linear SVMs respectively. Which of the following are correct?

1.s1 âš‚ s2

2. s1 may not be a subset of s2

3.w1 = w2

4.all of the above

Posted Date:-2022-06-21 11:59:04

Question:

MLE estimates are often undesirable because

1.they are biased

2.they have high variance

3.they are not consistent estimators

4. none of the above

Posted Date:-2022-06-21 12:47:51

Question:

Neural networks

1.optimize a convex cost function

2.always output values between 0 and 1

3. can be used for regression as well as classification

4.all of the above

Posted Date:-2022-06-21 12:52:25

Question:

Perceptron Classifier is

1.unsupervised learning algorithm

2. semi-supervised learning algorithm

3.supervised learning algorithm

4.soft margin classifier

Posted Date:-2022-06-21 12:05:06

Question:

Suppose you are using RBF kernel in SVM with high Gamma value. What does this signify?

1.the model would consider even far away points from hyperplane for modeling

2. the model would consider only the points close to the hyperplane for modeling

3.the model would not be affected by distance of points from hyperplane for modeling

4.none of the above

Posted Date:-2022-06-21 11:50:27

Question:

Suppose you have trained an SVM with linear decision boundary after training SVM, you correctly infer that your SVM model is under fitting. Which of the following is best option would you more likely to consider iterating SVM next time?

1.you want to increase your data points

2.you want to decrease your data points

3.you will try to calculate more variables

4. you will try to reduce the features

Posted Date:-2022-06-21 11:46:42

Question:

The cost parameter in the SVM means:

1.the number of cross-validations to be made

2. the kernel to be used

3. the tradeoff between misclassification and simplicity of the model

4. none of the above

Posted Date:-2022-06-21 11:31:47

Question:

The firing rate of a neuron

1.determines how strongly the dendrites of theneuron stimulate axons of neighboring neurons

2.is more analogous to the output of a unit in aneural net than the output voltage of the neuron

3.only changes very slowly, taking a period ofseveral seconds to make large adjustments

4.can sometimes exceed 30,000 action potentialsper second

Posted Date:-2022-06-21 11:29:25

Question:

The kernel trick

1.can be applied to every classification algorithm

2. is commonly used for dimensionality reduction

3.changes ridge regression so we solve a d ?? dlinear system instead of an n ?? n system, given nsample points with d features

4.exploits the fact that in many learning algorithms, the weights can be written as a linearcombination of input points

Posted Date:-2022-06-21 11:32:47

Question:

The soft margin SVM is more preferred than the hard-margin SVM when-

1.the data is linearly seperable

2.the data is noisy and contains overlapping points

3.the data is not noisy and linearly seperable

4.weight of a person

Posted Date:-2022-06-21 12:37:43

Question:

The SVMs are less effective when

1.the data is linearly separable

2.the data is clean and ready to use

3.the data is noisy and contains overlapping points

4.option 1 and option 2

Posted Date:-2022-06-21 12:09:35

Question:

Type of dataset available in Supervised Learning is

1.unlabeled dataset

2.labeled dataset

3.csv file

4.excel file

Posted Date:-2022-06-21 12:05:53

Question:

We usually use feature normalization before using the Gaussian kernel in SVM. What is true about feature normalization? 1. We do feature normalization so that new feature will dominate other
2. Some times, feature normalization is not feasible in case of categorical variables
3. Feature normalization always helps when we use Gaussian kernel in SVM

1.1

2.1 and 2

3.1 and 3

4.2 and 3

Posted Date:-2022-06-21 11:53:59

Question:

What are support vectors?

1. all the examples that have a non-zero weight ??k in a svm

2.the only examples necessary to compute f(x) in an svm.

3. all of the above

4. none of the above

Posted Date:-2022-06-21 11:25:13

Question:

What is the Accuracy in percentage based on following confusion matrix of three class classification.
Confusion Matrix C=[14 0 0]
[ 1 15 0]
[ 0 0 6]

1.0.75

2.0.97

3.0.95

4.0.85

Posted Date:-2022-06-21 12:43:12

Question:

What is the precision value for following confusion matrix of binary classification?

1.0.91

2.0.09

3.0.9

4.0.95

Posted Date:-2022-06-21 12:10:58

Question:

What is/are true about kernel in SVM? 1. Kernel function map low dimensional data to high dimensional space2. Itâ€™s a similarity function

1.1

2.2

3.1 and 2

4.none of these

Posted Date:-2022-06-21 11:47:22

Question:

which among the following is the most appropriate kernel that can be used with SVM to separate the classes

1.linear kernel

2. gaussian rbf kernel

3.polynomial kernel

4. option 1 and option 3

Posted Date:-2022-06-21 12:06:43

Question:

Which of the following are real world applications of the SVM?

1.text and hypertext categorization

2.image classification

3.clustering of news articles

4.all of the above

Posted Date:-2022-06-21 11:34:58

Question:

Which of the following can help to reduce overfitting in an SVM classifier?

1.use of slack variables

2.high-degree polynomial features

3.normalizing the data

4.setting a very low learning rate

Posted Date:-2022-06-21 11:45:47

Question:

Which of the following can only be used when training data are linearlyseparable?

1.linear hard-margin svm

2.linear logistic regression

3.linear soft margin svm

4.parzen windows

Posted Date:-2022-06-21 11:28:17

Question:

Which of the following evaluation metrics can not be applied in case of logistic regression output to compare with target?

1. auc-roc

2.accuracy

3.mean-squared-error

4.mean-squared-error

Posted Date:-2022-06-21 11:30:29

Question:

Which of the following is a categorical data?

1.branch of bank

2.expenditure in rupees

3.prize of house

4.weight of a person

Posted Date:-2022-06-21 12:36:50

Question:

Which of the following is not a kernel method in SVM?

1.linear kernel

2.polynomial kernel

3.rbf kernel

4. nonlinear kernel

Posted Date:-2022-06-21 12:12:39

Question:

Which of the following is true about SVM? 1. Kernel function map low dimensional data to high dimensional space. 2. It is a similarity Function

1. 1 is true, 2 is false

2.1 is false, 2 is true

3.1 is true, 2 is true

4.1 is false, 2 is false

Posted Date:-2022-06-21 12:42:16

Question:

Which of the following method is used for multiclass classification?

1.one vs rest

2.loocv

3.all vs one

4.one vs another

Posted Date:-2022-06-21 12:44:18

Question:

Which of the following methods can not achieve zero training error on any linearly separable dataset?

1.decision tree

2.15-nearest neighbors

3. hard-margin svm

4.perceptron

Posted Date:-2022-06-21 11:55:46

Question:

Which of the following methods do we use to best fit the data in Logistic Regression?

1.least square error

2.maximum likelihood

3.jaccard distance

4.both a and b

Posted Date:-2022-06-21 12:55:35

Question:

Which of the following methods/methods do we use to find the best fit line for data in Linear Regression?

1.least square error

2.maximum likelihood

3.logarithmic loss

4.both a and b

Posted Date:-2022-06-21 12:54:22

Question:

Which one of the following is suitable? 1. When the hypothsis space is richer, overfitting is more likely. 2. when the feature space is larger , overfitting is more likely.

1.true, false

2.false, true

3.true,true

4.false,false

Posted Date:-2022-06-21 12:35:38

Question:

Which statement about outliers is true?

1.outliers should be part of the training dataset but should not be present in the test data

2.outliers should be identified and removed from a dataset

3.the nature of the problem determines how outliers are used

4.outliers should be part of the test dataset but should not be present in the training data

Posted Date:-2022-06-21 11:59:54

Question:

Wrapper methods are hyper-parameter selection methods that

1.should be used whenever possible because they are computationally efficient

2. should be avoided unless there are no other options because they are always prone to overfitting.

3. are useful mainly when the learning machines are â€œblack boxesâ€

4.should be avoided altogether.

Posted Date:-2022-06-21 11:54:50

Question:

You trained a binary classifier model which gives very high accuracy on the training data, but much lower accuracy on validation data. Which is false.

1. this is an instance of overfitting

2.this is an instance of underfitting

3.the training was not well regularized

4. the training and testing examples are sampled from different distributions

Posted Date:-2022-06-21 11:48:39

Posted on by R4R Team

More MCQS

Search

R4R Team

R4Rin Top Tutorials are Core Java,Hibernate ,Spring,Sturts.The content on R4R.in website is done by expert team not only with the help of books but along with the strong professional knowledge in all context like coding,designing, marketing,etc!

Machine Learning (ML)/Machine Learning (ML) MCQ Set 02 Sample Test,Sample questions

Question: How does the bias-variance decomposition of a ridge regression estimator compare with that of ordinaryleast squares regression?

Question: If TP=9 FP=6 FN=26 TN=70 then Error rate will be

Question: In SVM which has quadratic kernel function of polynomial degree 2 that has slack variable C as one hyper paramenter. What would happen if we use very large value for C

Question: Linear Regression is a _______ machine learning algorithm.

Question: Suppose your model is demonstrating high variance across the different training sets. Which of the following is NOT valid way to try and reduce the variance?

Question: The difference between the actual Y value and the predicted Y value found using a regression equation is called the

Question: What is the purpose of the Kernel Trick?

Question: Which of the following are components of generalization Error?

Question: A machine learning problem involves four attributes plus a class. The attributes have 3, 2, 2, and 2 possible values each. The class has 3 possible values. How many maximum possible different examples are there?

Question: A perceptron adds up all the weighted inputs it receives, and if it exceeds a certain value, it outputs a 1, otherwise it just outputs a 0.

Question: During the treatement of cancer patients , the doctor needs to be very careful about which patients need to be given chemotherapy.Which metric should we use in order to decide the patients who should given chemotherapy?

Question: he minimum time complexity for training an SVM is O(n2). According to this fact, what sizes of datasets are not best suited for SVMâ€™s?

Question: How can SVM be classified?

Question: In SVM, RBF kernel with appropriate parameters to perform binary classification where the data is non-linearly seperable. In this scenario

Question: Let S1 and S2 be the set of support vectors and w1 and w2 be the learnt weight vectors for a linearly separable problem using hard and soft margin linear SVMs respectively. Which of the following are correct?

Question: MLE estimates are often undesirable because

Question: Neural networks

Question: Perceptron Classifier is

Question: Suppose you are using RBF kernel in SVM with high Gamma value. What does this signify?

Question: Suppose you have trained an SVM with linear decision boundary after training SVM, you correctly infer that your SVM model is under fitting. Which of the following is best option would you more likely to consider iterating SVM next time?

Question: The cost parameter in the SVM means:

Question: The firing rate of a neuron

Question: The kernel trick

Question: The soft margin SVM is more preferred than the hard-margin SVM when-

Question: The SVMs are less effective when

Question: Type of dataset available in Supervised Learning is

Question: What are support vectors?

Question: What is the Accuracy in percentage based on following confusion matrix of three class classification. Confusion Matrix C=[14 0 0] [ 1 15 0] [ 0 0 6]

Question: What is the precision value for following confusion matrix of binary classification?

Question: What is/are true about kernel in SVM? 1. Kernel function map low dimensional data to high dimensional space2. Itâ€™s a similarity function

Question: which among the following is the most appropriate kernel that can be used with SVM to separate the classes

Question: Which of the following are real world applications of the SVM?

Question: Which of the following can help to reduce overfitting in an SVM classifier?

Question: Which of the following can only be used when training data are linearlyseparable?

Question: Which of the following evaluation metrics can not be applied in case of logistic regression output to compare with target?

Question: Which of the following is a categorical data?

Question: Which of the following is not a kernel method in SVM?

Question: Which of the following is true about SVM? 1. Kernel function map low dimensional data to high dimensional space. 2. It is a similarity Function

Question: Which of the following method is used for multiclass classification?

Question: Which of the following methods can not achieve zero training error on any linearly separable dataset?

Question: Which of the following methods do we use to best fit the data in Logistic Regression?

Question: Which of the following methods/methods do we use to find the best fit line for data in Linear Regression?

Question: Which one of the following is suitable? 1. When the hypothsis space is richer, overfitting is more likely. 2. when the feature space is larger , overfitting is more likely.

Question: Which statement about outliers is true?

Question: Wrapper methods are hyper-parameter selection methods that

Question: You trained a binary classifier model which gives very high accuracy on the training data, but much lower accuracy on validation data. Which is false.

More MCQS

Search

R4R Team

Question:
How does the bias-variance decomposition of a ridge regression estimator compare with that of ordinaryleast squares regression?

Question:
If TP=9 FP=6 FN=26 TN=70 then Error rate will be

Question:
In SVM which has quadratic kernel function of polynomial degree 2 that has slack variable C as one hyper paramenter. What would happen if we use very large value for C

Question:
Linear Regression is a _______ machine learning algorithm.

Question:
Suppose your model is demonstrating high variance across the different training sets. Which of the following is NOT valid way to try and reduce the variance?

Question:
The difference between the actual Y value and the predicted Y value found using a regression equation is called the

Question:
What is the purpose of the Kernel Trick?

Question:
Which of the following are components of generalization Error?

Question:
A machine learning problem involves four attributes plus a class. The attributes have 3, 2, 2, and 2 possible values each. The class has 3 possible values. How many maximum possible different examples are there?

Question:
A perceptron adds up all the weighted inputs it receives, and if it exceeds a certain value, it outputs a 1, otherwise it just outputs a 0.

Question:
During the treatement of cancer patients , the doctor needs to be very careful about which patients need to be given chemotherapy.Which metric should we use in order to decide the patients who should given chemotherapy?

Question:
he minimum time complexity for training an SVM is O(n2). According to this fact, what sizes of datasets are not best suited for SVMâ€™s?

Question:
How can SVM be classified?

Question:
In SVM, RBF kernel with appropriate parameters to perform binary classification where the data is non-linearly seperable. In this scenario

Question:
Let S1 and S2 be the set of support vectors and w1 and w2 be the learnt weight vectors for a linearly separable problem using hard and soft margin linear SVMs respectively. Which of the following are correct?

Question:
MLE estimates are often undesirable because

Question:
Neural networks

Question:
Perceptron Classifier is

Question:
Suppose you are using RBF kernel in SVM with high Gamma value. What does this signify?

Question:
Suppose you have trained an SVM with linear decision boundary after training SVM, you correctly infer that your SVM model is under fitting. Which of the following is best option would you more likely to consider iterating SVM next time?

Question:
The cost parameter in the SVM means:

Question:
The firing rate of a neuron

Question:
The kernel trick

Question:
The soft margin SVM is more preferred than the hard-margin SVM when-

Question:
The SVMs are less effective when

Question:
Type of dataset available in Supervised Learning is

Question:
What are support vectors?

Question:
What is the Accuracy in percentage based on following confusion matrix of three class classification. Confusion Matrix C=[14 0 0] [ 1 15 0] [ 0 0 6]

Question:
What is the precision value for following confusion matrix of binary classification?

Question:
What is/are true about kernel in SVM? 1. Kernel function map low dimensional data to high dimensional space2. Itâ€™s a similarity function

Question:
which among the following is the most appropriate kernel that can be used with SVM to separate the classes

Question:
Which of the following are real world applications of the SVM?

Question:
Which of the following can help to reduce overfitting in an SVM classifier?

Question:
Which of the following can only be used when training data are linearlyseparable?

Question:
Which of the following evaluation metrics can not be applied in case of logistic regression output to compare with target?

Question:
Which of the following is a categorical data?

Question:
Which of the following is not a kernel method in SVM?

Question:
Which of the following is true about SVM? 1. Kernel function map low dimensional data to high dimensional space. 2. It is a similarity Function

Question:
Which of the following method is used for multiclass classification?

Question:
Which of the following methods can not achieve zero training error on any linearly separable dataset?

Question:
Which of the following methods do we use to best fit the data in Logistic Regression?

Question:
Which of the following methods/methods do we use to find the best fit line for data in Linear Regression?

Question:
Which one of the following is suitable? 1. When the hypothsis space is richer, overfitting is more likely. 2. when the feature space is larger , overfitting is more likely.

Question:
Which statement about outliers is true?

Question:
Wrapper methods are hyper-parameter selection methods that

Question:
You trained a binary classifier model which gives very high accuracy on the training data, but much lower accuracy on validation data. Which is false.