Data Analytics with Python Nptel Week 11 Assignment Answers

Are you looking for Data Analytics with Python Nptel Week 11 Quiz Answers ? You are here at right place for Week 11 assignment answers.



Data Analytics with Python Nptel Week 11 Quiz Answers
Data Analytics with Python Nptel Week 11 Quiz Answers

Data Analytics with Python Nptel Week 11 Quiz Answers (Jan-Apr 2025)

Course link: Click here


1) Which library is used for calculating distance measures in clustering using Python?

a) distance_matrix
b) scipy.spatial
c) scipy_spatial
d) distance.matrix

View Answer


2) Formula for dissimilarity computation between two objects for categorical variables is –

Here p is a categorical variable, and m denotes the number of matches.

a) D(i, j) = (p – m) / p
b) D(i, j) = (p – m) / m
c) D(i, j) = (m – p) / p
d) D(i, j) = (m – p) / m

View Answer


3) Select the correct option for a dataset with 7 objects and an interval-scaled variable ‘f’ with the following measurements:

f = (1, 2, 3, 4, 5, 8, 50), containing one outlying value.

a) Standard deviation (std_f) and mean absolute deviation (s_f) are equally affected by the outlier.
b) Mean absolute deviation (s_f) is more affected by the outlier.
c) Standard deviation (std_f) is less affected by the outlier.
d) Standard deviation (std_f) is more affected by the outlier.

View Answer


4) Select the correct statement about standardization in the following options –

a) Standardizing the data always gives inefficient results while making clusters.
b) Standardizing the data is always beneficial during clustering analysis.
c) The variables having an absolute value may not be efficient after standardization during clustering analysis.
d) Outliers cannot be detected by standardized data.

View Answer


5) Which of the following can act as possible termination conditions in K-Means?

  1. For a fixed number of iterations.
  2. Assignment of observations to clusters does not change between iterations, except in cases with a bad local minimum.
  3. Centroids do not change between successive iterations.
  4. Terminate when RSS falls below a threshold.

a) 1, 3, and 4
b) 1, 2, 3, and 4
c) 2 and 3
d) None of these

View Answer


Data Analytics with Python Nptel Week 11 Quiz Answers


6) In the figure below, if you draw a horizontal line on the y-axis at y=2, how many clusters will be formed?

View Figure

a) 1
b) 2
c) 3
d) 4

View Answer


7) Which of the following clustering techniques requires a merging approach?

a) Partitional
b) Naive Bayes
c) Hierarchical
d) None of the above

View Answer


8) State True or False: Hierarchical clustering should primarily be used for data exploration.

a) True
b) False

View Answer


9) State True or False: For finding dissimilarity between two clusters in hierarchical clustering, average-link is the only metric used.

a) True
b) False

View Answer


10) If two variables V1 and V2 are used for clustering, which of the following are true for K-means clustering with k = 3?

  1. If V1 and V2 have a correlation of 1, the cluster centroids will be in a straight line.
  2. If V1 and V2 have a correlation of 0, the cluster centroids will be in a straight line.

a) 1 only
b) 2 only
c) 1 and 2
d) None of the above

View Answer


For answers to additional Nptel courses, please refer to this link: NPTEL Assignment

Data Analytics with Python Nptel Week 11 Quiz Answers (Jan-Apr 2025)