Home R: How to remove NAs based on column data in data frames in a list?

Questions

R: How to remove NAs based on column data in data frames in a list?

June 20, 2022

I have a list (my.list) that looks like this:

$S1
  Study_ID   B   C         D
1      100  NA  C1 0.9124000
2      100 1.5 PTA        NA
3      200 1.8  C1 0.5571429
4      200 2.1 PTA 0.7849462
5      300 3.2  C1 0.3271900
6      300 1.4 PTA        NA
7      400  NA  C1 0.8248200
8      400 9.3 PTA 0.2847020

$S2
  Study_ID    B   C         D
1      100   NA  C1 0.9124000
2      100 0.70 PTA        NA
3      200   NA  C1 0.5571429
4      200 0.45 PTA 0.7849462
5      300 0.91  C1 0.3271900
6      300 0.78 PTA 0.6492000
7      400 0.65  C1 0.8248200
8      400   NA PTA        NA

If a patient has ‘NA’ in column D, I would like to remove the entire patient from the list – that is, remove them based on Study_ID.

In other words, if there is an NA in Column D, I would like to remove the two rows that have the same Study_ID.

My desired output would look like this:

$S1
  Study_ID   B   C         D
1      200 1.8  C1 0.5571429
2      200 2.1 PTA 0.7849462
3      400  NA  C1 0.8248200
4      400 9.3 PTA 0.2847020

$S2
  Study_ID    B   C         D
1      200   NA  C1 0.5571429
2      200 0.45 PTA 0.7849462
3      300 0.91  C1 0.3271900
4      300 0.78 PTA 0.6492000

How can I go about doing this?

Reproducible Data:

my.list <- structure(list(S1 = structure(list(Study_ID = c(100, 100, 200, 
200, 300,300,400,400), B = c(NA, 1.5, 1.8, 2.1, 3.2, 1.4, NA, 9.3), C = c("C1", "PTA", "C1", "PTA", "C1", "PTA","C1", "PTA"), D = c(0.9124, NA, 0.5571429, 0.7849462, 0.32719, NA, 0.82482, 0.284702
)), .Names = c("Study_ID", "B", "C", "D"), class = "data.frame", row.names = c("1", 
"2", "3", "4", "5", "6", "7", "8")), S2 = structure(list(Study_ID = c(100, 100, 200, 
200, 300,300,400,400), B = c(NA, 0.7, NA, 0.45, 
0.91, 0.78, 0.65, NA), C = c("C1", "PTA", "C1", "PTA", "C1", "PTA", "C1", "PTA"), D = c(0.9124, NA, 0.5571429, 0.7849462, 0.32719,0.6492, 0.82482, NA
)), .Names = c("Study_ID", "B", "C", 
"D"), class = "data.frame", row.names = c("1", "2", "3", "4", 
"5", "6", "7", "8"))), .Names = c("S1", "S2"))

>Solution :

Small alternative to @Yuriy answer:

library(dplyr)
library(purrr)

map(my.list, function(x) {
  x %>% 
    group_by(Study_ID) %>% 
    filter(all(!is.na(D))) %>% 
    ungroup()
})

In base R:

lapply(my.list, function(x) {
  to_remove <- unique(x[which(is.na(x$D)), "Study_ID"])
  x[!x$Study_ID %in% to_remove, ]
})

byMR

Published June 20, 2022

Add a comment

How to transform dot to comma in an amount table?

byMR

June 20, 2022

Questions

Get the innerText of a p tag when clicking on a button sitting next to it (no Jquery)

byMR

June 20, 2022

Questions

Is there a non-unifying alternative to member/2 in SWI-Prolog?

byMR

June 20, 2022

Questions

Assigning names to a vector retrieved by get()

byMR

June 20, 2022

Questions

Requirement already satisfied: pynput

byMR

June 20, 2022

Questions

C# – Refresh WPF RichTextBox during a loop

byMR

June 20, 2022

R: How to remove NAs based on column data in data frames in a list?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to transform dot to comma in an amount table?

Get the innerText of a p tag when clicking on a button sitting next to it (no Jquery)

Is there a non-unifying alternative to member/2 in SWI-Prolog?

Assigning names to a vector retrieved by get()

Requirement already satisfied: pynput

C# – Refresh WPF RichTextBox during a loop

Keep Up to Date with the Most Important News

R: How to remove NAs based on column data in data frames in a list?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to transform dot to comma in an amount table?

Get the innerText of a p tag when clicking on a button sitting next to it (no Jquery)

Is there a non-unifying alternative to member/2 in SWI-Prolog?

Assigning names to a vector retrieved by get()

Requirement already satisfied: pynput

C# – Refresh WPF RichTextBox during a loop

Discover more from Dev solutions