Introduction to Machine Learning Nptel Week 6 Answers

Last updated: March 2, 2025

Are you looking for Introduction to Machine Learning Nptel Week 6 Answers? You’ve come to the right place! Access the latest and most accurate solutions for your Week 6 assignment in the Introduction to Machine Learning course.

Course Link: Click Here

Introduction to Machine Learning Nptel Week 6 Answers (Jan-Apr 2025)

1) Statement: Decision Tree is an unsupervised learning algorithm.
Reason: The splitting criterion uses only the features of the data to calculate their respective measures.

a) Statement is True. Reason is True.
b) Statement is True. Reason is False.
c) Statement is False. Reason is True.
d) Statement is False. Reason is False.

View Answer

2) Increasing the pruning strength in a decision tree by reducing the maximum depth:

a) Will always result in improved validation accuracy.
b) Will lead to more overfitting.
c) Might lead to underfitting if set too aggressively.
d) Will have no impact on the tree’s performance.
e) Will eliminate the need for validation data.

View Answer

3) What is a common indicator of overfitting in a decision tree?

a) The training accuracy is high while the validation accuracy is low.
b) The tree is shallow.
c) The tree has only a few leaf nodes.
d) The tree’s depth matches the number of attributes in the dataset.
e) The tree’s predictions are consistently biased.

View Answer

4) Consider the following statements:
Statement 1: Decision Trees are linear non-parametric models.
Statement 2: A decision tree may be used to explain the complex function learned by a neural network.

a) Both the statements are True.
b) Statement 1 is True, but Statement 2 is False.
c) Statement 1 is False, but Statement 2 is True.
d) Both the statements are False.

View Answer

5) Entropy for a 50-50 split between two classes is:

a) 0
b) 0.5
c) 1
d) None of the above

View Answer

6) Consider a dataset with only one attribute (categorical). Suppose there are 10 unordered values in this attribute. How many possible combinations are needed to find the best split-point for building the decision tree classifier?

a) 1024
b) 511
c) 1023
d) 512

View Answer

7) What is the initial entropy of Malignant?

a) 0.543
b) 0.9798
c) 0.8732
d) 1

View Answer

8) For the same dataset, what is the info gain of Vaccination?

a) 0.4763
b) 0.2102
c) 0.1134
d) 0.9355

View Answer

Introduction to Machine Learning Nptel Week 6 Answers (July-Dec 2024)

1. Entropy for a 90-10 split between two classes is:

A) 0.469
B) 0.195
C) 0.204
D) None of the above

Answer: A) 0.469

2. Consider a dataset with only one attribute (categorical). Suppose, there are 8 unordered values in this attribute, how many possible combinations are needed to find the best split-point for building the decision tree classifier?

A) 511
B) 1023
C) 512
D) 127

Answer: A) 511

3. Having built a decision tree, we are using reduced error pruning to reduce the size of the tree. We select a node to collapse. For this particular node, on the left branch, there are three training data points with the following outputs: 5, 7, 9.6, and for the right branch, there are four training data points with the following outputs: 8.7, 9.8, 10.5, 11. The average value of the outputs of data points denotes the response of a branch. The original responses for data points along the two branches (left & right respectively) were response−left and response−right and the new response after collapsing the node is response−new. What are the values for response−left, response−right, and response−new (numbers in the option are given in the same order)?

A) 9.6, 11, 10.4
B) 7.2; 10; 8.8
C) 5, 10.5, 15
D) Depends on the tree height

Answer: B) 7.2; 10; 8.8

4. Which of the following is a good strategy for reducing the variance in a decision tree?

A) If improvement of taking any split is very small, don’t make a split. (Early Stopping)
B) Stop splitting a leaf when the number of points is less than a set threshold K.
C) Stop splitting all leaves in the decision tree when any one leaf has less than a set threshold K points.
D) None of the Above.

Answer: B) Stop splitting a leaf when the number of points is less than a set threshold K

These are Introduction to Machine Learning Nptel Week 6 Answers

5. Which of the following statements about multiway splits in decision trees with categorical features is correct?

A) They always result in deeper trees compared to binary splits
B) They always provide better interpretability than binary splits
C) They can lead to overfitting when dealing with high-cardinality categorical features
D) They are computationally less expensive than binary splits for all categorical features

Answer: C) They can lead to overfitting when dealing with high-cardinality categorical features

6. Which of the following statements about imputation in data preprocessing is most accurate?

A) Mean imputation is always the best method for handling missing numerical data
B) Imputation should always be performed after splitting the data into training and test sets
C) Missing data is best handled by simply removing all rows with any missing values
D) Multiple imputation typically produces less biased estimates than single imputation methods

Answer: D) Multiple imputation typically produces less biased estimates than single imputation methods

7. Consider the following dataset:

Which among the following split-points for feature2 would give the best split according to the misclassification error?

A) 186.5
B) 188.6
C) 189.2
D) 198.1

Answer: C) 189.2

These are Introduction to Machine Learning Nptel Week 6 Answers

All weeks of Introduction to Machine Learning: Click Here

For answers to additional Nptel courses, please refer to this link: NPTEL Assignment Answers