MCQs on Advanced Data Manipulation | R

200+ Free R language MCQ Quiz |Advance| MCQs on R Programming MCQs on Advanced Data Manipulation | R

Enhance your data manipulation skills in R with complex joins, efficient use of data.table, and advanced reshaping techniques with tidyr. Master these tools for efficient data processing.

Complex Joins and Aggregations (dplyr)

Which function in dplyr is used to combine two data frames by a common column?
a) merge()
b) join()
c) left_join()
d) bind_rows()
How would you perform a full outer join using dplyr?
a) left_join()
b) right_join()
c) full_join()
d) inner_join()
Which function in dplyr is used to join data frames by multiple columns?
a) multi_join()
b) full_join()
c) inner_join()
d) by()
What does the summarise() function do in dplyr?
a) Creates new columns in a data frame
b) Filters data based on conditions
c) Summarizes data by calculating aggregates
d) Groups data based on specific columns
Which function is used to group data by one or more variables before summarizing it in dplyr?
a) arrange()
b) group_by()
c) select()
d) filter()
How can you calculate the sum of a grouped column in dplyr?
a) summarize(sum())
b) group_by() %>% sum()
c) summarize(total())
d) group_by() %>% summarize(sum(column))
In dplyr, which function can be used to combine data frames vertically?
a) bind_rows()
b) full_join()
c) left_join()
d) merge()
Which of the following dplyr joins returns all rows from the left data frame and matching rows from the right data frame?
a) full_join()
b) left_join()
c) right_join()
d) inner_join()
To join two data frames where the column names are different in each, which argument can be used in dplyr?
a) by()
b) on()
c) column_names()
d) matching()
What function can be used to calculate the mean of a grouped variable in dplyr?
a) mean_by()
b) summarize(mean())
c) mutate(mean())
d) group_by() %>% mean()

Working with data.table for Efficiency

Which package is primarily used for handling large data efficiently in R?
a) tidyverse
b) data.table
c) dplyr
d) ggplot2
How can you convert a data frame to a data.table object?
a) as.data.table()
b) data.table()
c) convert()
d) to.data.table()
How do you select a column in a data.table by reference?
a) dt[, "column_name"]
b) dt[, column_name]
c) dt$column_name
d) dt["column_name"]
Which of the following is the correct syntax to filter rows based on a condition in data.table?
a) dt[column_name > value]
b) filter(dt, column_name > value)
c) subset(dt, column_name > value)
d) dt[filter(column_name > value)]
How do you update a column in data.table by reference?
a) dt[, column_name := new_value]
b) dt$column_name <- new_value
c) update(dt, column_name, new_value)
d) dt[column_name] <- new_value
What does the setkey() function in data.table do?
a) Sorts the data by a specific column
b) Creates a key for indexing
c) Joins data tables
d) Selects rows by column values
In data.table, how would you calculate the sum of a column grouped by another column?
a) dt[, sum(column), by = group_column]
b) dt$sum(column) %>% group_by(group_column)
c) group_by(dt, group_column) %>% sum(column)
d) aggregate(dt, by = group_column, FUN = sum)
Which of the following is used to perform an inner join in data.table?
a) merge()
b) inner_join()
c) setkey()
d) join()
What is the main advantage of using data.table over a regular data.frame?
a) Smaller memory usage and faster computation
b) Better visualizations
c) Simpler syntax
d) Supports only smaller data sets
How can you perform an efficient merge operation between two data.table objects?
a) merge()
b) left_join()
c) setkey()
d) merge.data.table()

Advanced Reshaping (tidyr)

Which function in tidyr is used to convert a wide-format data frame into long format?
a) spread()
b) gather()
c) pivot_wider()
d) pivot_longer()
What does the pivot_wider() function do in tidyr?
a) Converts long-format data into wide format
b) Converts wide-format data into long format
c) Filters data based on conditions
d) Summarizes data by groups
How do you separate a single column into multiple columns based on a delimiter in tidyr?
a) separate()
b) split()
c) extract()
d) subseparate()
Which function is used to fill missing values in a column in tidyr?
a) fill()
b) na.fill()
c) replace_na()
d) complete()
To reshape a data frame where rows are stacked and multiple columns are combined into a single column, which function is used?
a) gather()
b) spread()
c) pivot_wider()
d) pivot_longer()
Which tidyr function is used to convert a data frame into a more complete form by filling missing combinations of data?
a) expand()
b) complete()
c) fill()
d) expand_grid()
What is the purpose of the unnest() function in tidyr?
a) Unwraps nested data frames or lists into separate columns
b) Removes NA values
c) Reshapes data from long to wide format
d) Converts categorical data into numeric
How can you convert a long-format data frame to a wide-format data frame in tidyr?
a) spread()
b) pivot_wider()
c) gather()
d) separate()
Which function in tidyr is used to make a data frame with all possible combinations of a set of columns?
a) expand()
b) complete()
c) spread()
d) nest()
What does the separate() function do in tidyr?
a) Combines two columns into one
b) Converts wide-format data into long format
c) Splits a single column into multiple columns
d) Removes missing values from a column

Answer Key

QNo	Answer (Option with text)
1	c) `left_join()`
2	c) `full_join()`
3	d) `by()`
4	c) Summarizes data by calculating aggregates
5	b) `group_by()`
6	d) `group_by() %>% summarize(sum(column))`
7	a) `bind_rows()`
8	b) `left_join()`
9	a) `by()`
10	b) `group_by(column) %>% summarize(mean())`
11	b) `data.table`
12	a) `as.data.table()`
13	b) `dt[, column_name]`
14	a) `dt[column_name > value]`
15	a) `dt[, column_name := new_value]`
16	b) Creates a key for indexing
17	a) `dt[, sum(column), by = group_column]`
18	a) `merge()`
19	a) Smaller memory usage and faster computation
20	d) `merge.data.table()`
21	b) `gather()`
22	a) Converts long-format data into wide format
23	a) `separate()`
24	c) `replace_na()`
25	a) `gather()`
26	b) `complete()`
27	a) Unwraps nested data frames or lists into separate columns
28	b) `pivot_wider()`
29	a) `expand()`
30	c) Splits a single column into multiple columns

Post Views: 51

Back to Course

Next Lesson