Natural Language Processing Nptel Week 2 Quiz Answers

Are you looking for Natural Language Processing Nptel Week 2 Quiz Answers? You’ve come to the right place! Access the most accurate answers at Progiez.

Natural Language Processing Nptel Week 2 Quiz Answers (Jan-Apr 2025)

Course Link: Click Here

1. According to Zipf’s law which statements) is/are correct?
(i) A small number of words occur with high frequency.
(ii) A large number of words occur with low frequency.

a. Both (i) and (ii) are correct
b. Only (ii) is correct
c. Only (i) is correct
d. Neither (i) nor (ii) is correct

View Answer

2. Consider the following corpus Ci of 4 sentences. What is the total count of unique bi-grams for which the likelihood will be estimated? Assume we do not perform any pre-processing.

tomorrow is Sachin’s birthday

He loves cream chocolates

he is also fond of sweet cake

we will celebrate his birthday with sweet chocolate cake

today is Sneha’s birthday

she likes ice cream

she is also fond of cream cake

we will celebrate her birthday with ice cream cake

a. 24

b. 28

c. 27

d. 23

View Answer

3. A 4-gram model is a______________order Markov Model.

a. Two
b. Five
c. Four
d. Three

View Answer

4. Which of these is/are – valid Markov assumptions?

a. The probability of a word depends only on the current word.
b. The probability of a word depends only on the previous word.
c. The probability of a word depends only on the next word.
d. The probability of a word depends only on the current and the previous word.

View Answer

5. For the string ‘mash’, identify which of the following set of strings has a Levenshtein distance of
1.

a. smash, mas, lash, mushy, hash
b. bash, stash, lush, flash, dash
c. smash, mas, lash, mush, ash
d. None of the above

View Answer

6. Assume that we modify the costs incurred for operations in calculating Levenshtein distance,
such that both the insertion and deletion operations incur a cost of 1 each, while substitution
incurs a cost of 2. Now, for the string ‘clash’ which of the following set of strings will have an
edit distance of 1?

a. ash, slash, clash, flush
b. flash, stash, lush, blush,
c. slash, last, bash, ash
d. None of the above

View Answer

7. Given a corpus C2, the Maximum Likelihood Estimation (MLE) for the bigram “dried berries” is
0.45 and the count of occurrence of the word “dried” is 720. For the same corpus C the likelihood
of “dried berries” after applying add-one smoothing is 0.05. What is the vocabulary size of C2?

a. 4780
b. 3795
c. 4955
d. 5780

View Answer

8. Calculate P(they play in a big garden) assuming a bi-gram language model.

a. 1/8
b. 1/12
c. 1/24
d. None of the above

View Answer

9. Considering the same model as in Question 7, calculate the perplexity of they play in a big
garden < |s>.

a. 2.289
b. 1.426
C. 1.574
d. 2.178

View Answer

10. Assume that you are using a bi-gram language model with add one smoothing. Calculate P(they play in a beautiful garden).

a. 4.472 × 101-6
b. 2.236 × 10^-6
c. 3.135 × 10^-6
d. None of the above

View Answer

Natural Language Processing Nptel Week 2 Quiz Answers