Home How to subset a dataframe based on values in a list column

Questions

How to subset a dataframe based on values in a list column

August 22, 2022

I’m having an issue where I’m pulling down information from an API, and there are nested values within specific columns. I need to filter on those values in order to return the information I need. Here’s an example:

library(dplyr)

# Make Data
problem <- list(list("thing 1", "thing 2"), list("thing 1", "thing 2", "thing 3"), list("thing 1"))
name <- list("joe", "sue", "nancy")

df<-data.frame(name=c("joe", "sue", "nancy"),problem=I(problem))

# How can I find subset rows where the problem column contains "thing 3"
filter(df, name == "sue") # this works fine
filter(df, "thing 3" %in% problem) # this doesn't

It’s obvious to me that it’s because the list is nested and filter() isn’t "seeing" the data, but it’s less clear to me how to get around it. Additionally, the data that I’m returning is fairly large, and has an arbitrary number of items per list within the column, so I don’t want to unnest the column if I can avoid it.

#EDIT: I’m not married to a dplyr solution, and in fact if there is a data.table solution, I’d be especially interested to hear it, but I’m great with base or whatever!

Any help would be appreciated.

>Solution :

df %>%
  filter(map_lgl(problem, ~any('thing 3' == .x)))

  name      problem
1  sue thing 1,....

subset

byMR

Published August 22, 2022

Add a comment

Accessing templatized static constexpr member of templatized class with template parameter

byMR

August 22, 2022

Questions

Arranging several ggplots in same window

byMR

August 22, 2022

Questions

Python shift data on every second row from one column to a new column

byMR

August 22, 2022

Questions

Why does os.system recognise space in a path when a filename exists but need spaces escaped to give a correct error when a file doesn't exist?

byMR

August 22, 2022

Questions

Canvas does not capture entire frame of video

byMR

August 22, 2022

Questions

An async method cannot return any type that has an accessible GetAwaiter method

byMR

August 22, 2022

How to subset a dataframe based on values in a list column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Accessing templatized static constexpr member of templatized class with template parameter

Arranging several ggplots in same window

Python shift data on every second row from one column to a new column

Why does os.system recognise space in a path when a filename exists but need spaces escaped to give a correct error when a file doesn't exist?

Canvas does not capture entire frame of video

An async method cannot return any type that has an accessible GetAwaiter method

Keep Up to Date with the Most Important News

How to subset a dataframe based on values in a list column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Accessing templatized static constexpr member of templatized class with template parameter

Arranging several ggplots in same window

Python shift data on every second row from one column to a new column

Why does os.system recognise space in a path when a filename exists but need spaces escaped to give a correct error when a file doesn't exist?

Canvas does not capture entire frame of video

An async method cannot return any type that has an accessible GetAwaiter method

Discover more from Dev solutions