Python for Data Science NPTEL | Week 3

Session: JAN-APR 2023

Course Name: Python for Data Science

Course Link: Click Here

These are NPTEL Python for Data Science Assignment 3 Answers


Q1. Which of the following is the correct approach to fill missing values in case of categorical variable?
a. Mean
b. Median
c. Mode
d. None of the above

Answer: c. Mode


Assume a pandas dataframe df_cars which when printed is as shown below. Based on this information, answer questions 2 and 3.

image 24

Q2. Of the following set of statements, which of them can be used to extract the column Type as a separate dataframe?
a. df cars[[’Type’]]
b. df cars.iloc[[:, 1]
c. df cars.loc[:, [’Type’]]
d. None of the above

Answer: a, c


These are NPTEL Python for Data Science Assignment 3 Answers


Q3. The method df_cars.describe() will give description of which of the following column?
a. Car name
b. Brand
c. Price (in lakhs)
d. All of the above

Answer: c. Price (in lakhs)


Q4. Which pandas function is used to stack the dataframes vertically?
a. pd.merge()
b. pd.concat()
c. join()
d. None of the above

Answer: b. pd.concat()


These are NPTEL Python for Data Science Assignment 3 Answers


Q5. Which of the following are liabraries in Python?
a. Pandas
b. Matplotlib
c. NumPy
d. All of the above

Answer: d. All of the above


Read the comma-separated values file hotel bookings.csv as a dataframe data hotel and answer questions 6 – 8. Please refer to Hotel Bookings Data Description.pdf for data and variable description.

Q6. Choose the appropriate command(s) to filter those booking details whose reservation-status are a No-show?

a.

image 26

b.

image 27

c.

image 28

d.

image 29

Answer: b, d


These are NPTEL Python for Data Science Assignment 3 Answers


Q7. From the same data, find how many bookings were not canceled in the year 2017?
a. 9064
b. 6231
c. 9046
d. None of the above

Answer: a. 9064


Q9. From the total bookings that were made in 2017 and not canceled, which month had the highest number of repeated guests?
a. July
b. February
c. January
d. None of the above

Answer: c. January


These are NPTEL Python for Data Science Assignment 3 Answers


Q9. What will be the output of the following code?

image 30

a. [bool, int, float, float, str]
b. [str, int, float, float, str]
c. [bool, int, float, int, str]
d. [bool, int, int, float, str]

Answer: a. [bool, int, float, float, str]


Q10. Which command is used to generate the plot shown below?

image 31

a. plt.plot(x, linestyle = “-”)
b. plt.plot(x, linestyle = “–”)
c. plt.plot(x, linestyle = “-.”)
d. plt.plot(x, linestyle = “:”)

Answer: a. plt.plot(x, linestyle = “-”)


These are NPTEL Python for Data Science Assignment 3 Answers

More Weeks of Python for Data Science NPTEL: Click here

More NPTEL courses: https://progiez.com/nptel


Session: JULY-DEC 2022

Course name: Python for Data Science

Link to Enroll: Click Here

These are NPTEL Python for Data Science Assignment 3 Answers


Q1. Choose the appropriate command(s) to filter those booking details whose reservation_status are a No-show?
a. data_hotel_ns datahotel. loc[data_hotel.reservation_status=’No-Show’]
b. data_hotel_ns = data_hotel[ data _hotel.reservation_status = “No-Show’]
c. data hotel_ns = data_hotel.reservation_status.loc[data_hotel.isin([‘No-Show’])]
d. data_hotel_ns = data_hotel.loc [data hotel.reservation_status.isin([ No-Show’])]

Answer:b, d


Q2. From the same data, find how many bookings were not cancelled in the year 2017?

a. 9064
b. 6231
c. 9046
d. None of the above

Answer: a. 9064


Q3. From the total bookings that were made in 2017 and not cancelled, which month had the highest number of repeated guests?
a. July
b. February
c. January
d. None of the above

Answer: c. January


These are NPTEL Python for Data Science Assignment 3 Answers


Q4. Which of the following commands can be used to create a variable Flag, and set the values as Premium when the rating is equal to or greater than 3.25, and otherwise as Regular?
a. dt_cocoa[°Flag’] = [“Premium” if x 3.25 else “Regular” for x in dt_cocoa[‘Rating’ ]]
b. dt_cocoa[“Flag’] = [“Premium” if x 3.25 else “Regular” for x in dt_cocoa[ _` Rating ‘]]
c. dt_cocoa[“Flag”] = np.where(dt_cocoa[ “Rating”] < 3.25, “Regular”, “Premium”)
d. None of the above

Answer: b, c


Q5. Which instruction can be used to impute the missing values in the column Review Data from the dataframe dt_cocoa by grouping the records company–wise?

Answer: a


Q6. After checking the data summary, which feature requires a data conversion considering the data values held?
a. Rating
b. Review Date
c. Company
d. None of the above

Answer: b. Review Date


These are NPTEL Python for Data Science Assignment 3 Answers


Q7What is the maximum average rating for the cocoa companies based out of Guatemala?
a. 43.
b. 53.
c. 42.
d. None of the above

Answer: c. 42.


Q8. Which pandas function is used to stack the dataframes vertically?
a. pd.merge()
b. pd.concat()
c. join()
d. None of the above

Answer: b. pd.concat()


These are NPTEL Python for Data Science Assignment 3 Answers


Q9. Of the following set of statements, which of them can be used to extract the column Direction as a separate dataframe?

a. df_weather[[_`Direction ‘ ]]
b. df_weather.iloc[:,0]
c. df_weather.loc[:.[ ‘Direction ‘]]
d. None of the above

Answer: a, b


These are NPTEL Python for Data Science Assignment 3 Answers


Q10. Which one of these students’ average score across all subjects was the lowest? Which subject has the highest average score across students?

a. Harini, Maths
b. Sathi, Maths
c. Harini, Physics
d. Rekha, Maths

Answer: b. Sathi, Maths


These are NPTEL Python for Data Science Assignment 3 Answers

More NPTEL course answers: https://progiez.com/nptel


* The material and content uploaded on this website are for general information and reference purposes only. Please do it by your own first. COPYING MATERIALS IS STRICTLY PROHIBITED.


More from PROGIEZ

These are NPTEL Python for Data Science Assignment 3 Answers