Natural Language Processing Nptel Week 2 Quiz Answers
Are you looking for Natural Language Processing Nptel Week 2 Quiz Answers? You’ve come to the right place! Access the most accurate answers at Progiez.
Table of Contents
Natural Language Processing Nptel Week 2 Quiz Answers (Jan-Apr 2025)
Course Link: Click Here
1. According to Zipf’s law which statements) is/are correct?
(i) A small number of words occur with high frequency.
(ii) A large number of words occur with low frequency.
a. Both (i) and (ii) are correct
b. Only (ii) is correct
c. Only (i) is correct
d. Neither (i) nor (ii) is correct
2. Consider the following corpus Ci of 4 sentences. What is the total count of unique bi-grams for which the likelihood will be estimated? Assume we do not perform any pre-processing.
tomorrow is Sachin’s birthday
He loves cream chocolates
he is also fond of sweet cake
we will celebrate his birthday with sweet chocolate cake
today is Sneha’s birthday
she likes ice cream
she is also fond of cream cake
we will celebrate her birthday with ice cream cake
a. 24
b. 28
c. 27
d. 23
3. A 4-gram model is a______________order Markov Model.
a. Two
b. Five
c. Four
d. Three
4. Which of these is/are – valid Markov assumptions?
a. The probability of a word depends only on the current word.
b. The probability of a word depends only on the previous word.
c. The probability of a word depends only on the next word.
d. The probability of a word depends only on the current and the previous word.
5. For the string ‘mash’, identify which of the following set of strings has a Levenshtein distance of
1.
a. smash, mas, lash, mushy, hash
b. bash, stash, lush, flash, dash
c. smash, mas, lash, mush, ash
d. None of the above
6. Assume that we modify the costs incurred for operations in calculating Levenshtein distance,
such that both the insertion and deletion operations incur a cost of 1 each, while substitution
incurs a cost of 2. Now, for the string ‘clash’ which of the following set of strings will have an
edit distance of 1?
a. ash, slash, clash, flush
b. flash, stash, lush, blush,
c. slash, last, bash, ash
d. None of the above
7. Given a corpus C2, the Maximum Likelihood Estimation (MLE) for the bigram “dried berries” is
0.45 and the count of occurrence of the word “dried” is 720. For the same corpus C the likelihood
of “dried berries” after applying add-one smoothing is 0.05. What is the vocabulary size of C2?
a. 4780
b. 3795
c. 4955
d. 5780
8. Calculate P(they play in a big garden) assuming a bi-gram language model.
a. 1/8
b. 1/12
c. 1/24
d. None of the above
9. Considering the same model as in Question 7, calculate the perplexity of they play in a big
garden < |s>.
a. 2.289
b. 1.426
C. 1.574
d. 2.178
10. Assume that you are using a bi-gram language model with add one smoothing. Calculate P(they play in a beautiful garden).
a. 4.472 × 101-6
b. 2.236 × 10^-6
c. 3.135 × 10^-6
d. None of the above
Natural Language Processing Nptel Week 2 Quiz Answers
For answers to others Nptel courses, please refer to this link: NPTEL Assignment