Introduction to Machine Learning Nptel Week 6 Answers

Are you looking for Introduction to Machine Learning Nptel Week 6 Answers? You’ve come to the right place! Access the latest and most accurate solutions for your Week 6 assignment in the Introduction to Machine Learning course.

Course Link: Click Here

Introduction to Machine Learning Nptel Week 6 Answers (July-Dec 2024)

1. Entropy for a 90-10 split between two classes is:

A) 0.469
B) 0.195
C) 0.204
D) None of the above

Answer: A) 0.469

2. Consider a dataset with only one attribute (categorical). Suppose, there are 8 unordered values in this attribute, how many possible combinations are needed to find the best split-point for building the decision tree classifier?

A) 511
B) 1023
C) 512
D) 127

Answer: A) 511

3. Having built a decision tree, we are using reduced error pruning to reduce the size of the tree. We select a node to collapse. For this particular node, on the left branch, there are three training data points with the following outputs: 5, 7, 9.6, and for the right branch, there are four training data points with the following outputs: 8.7, 9.8, 10.5, 11. The average value of the outputs of data points denotes the response of a branch. The original responses for data points along the two branches (left & right respectively) were response−left and response−right and the new response after collapsing the node is response−new. What are the values for response−left, response−right, and response−new (numbers in the option are given in the same order)?

A) 9.6, 11, 10.4
B) 7.2; 10; 8.8
C) 5, 10.5, 15
D) Depends on the tree height

Answer: B) 7.2; 10; 8.8

4. Which of the following is a good strategy for reducing the variance in a decision tree?

A) If improvement of taking any split is very small, don’t make a split. (Early Stopping)
B) Stop splitting a leaf when the number of points is less than a set threshold K.
C) Stop splitting all leaves in the decision tree when any one leaf has less than a set threshold K points.
D) None of the Above.

Answer: B) Stop splitting a leaf when the number of points is less than a set threshold K

These are Introduction to Machine Learning Nptel Week 6 Answers

5. Which of the following statements about multiway splits in decision trees with categorical features is correct?

A) They always result in deeper trees compared to binary splits
B) They always provide better interpretability than binary splits
C) They can lead to overfitting when dealing with high-cardinality categorical features
D) They are computationally less expensive than binary splits for all categorical features

Answer: C) They can lead to overfitting when dealing with high-cardinality categorical features

6. Which of the following statements about imputation in data preprocessing is most accurate?

A) Mean imputation is always the best method for handling missing numerical data
B) Imputation should always be performed after splitting the data into training and test sets
C) Missing data is best handled by simply removing all rows with any missing values
D) Multiple imputation typically produces less biased estimates than single imputation methods

Answer: D) Multiple imputation typically produces less biased estimates than single imputation methods

7. Consider the following dataset:

Which among the following split-points for feature2 would give the best split according to the misclassification error?

A) 186.5
B) 188.6
C) 189.2
D) 198.1

Answer: C) 189.2

These are Introduction to Machine Learning Nptel Week 6 Answers

All weeks of Introduction to Machine Learning: Click Here

For answers to additional Nptel courses, please refer to this link: NPTEL Assignment Answers