Home How to select columns with equal or more than 2 unique values while ignoring NA and blank?

Questions

How to select columns with equal or more than 2 unique values while ignoring NA and blank?

May 9, 2022

My dataframe looks similar to this:

 df <- data.frame(ID = c(1, 2, 3, 4, 5),
               color = c(NA, "black", "black", NA, "brown"),
              animal = c("dog", "", "", "", "")
               owner = c("YES", "NO", "NO", "YES", NA))

ID	color	animal	owner
1	NA	dog	YES
2	black		NO
3	black		NO
4	NA		YES
5	brown		NA

I would like to retrieve the column names of all columns with more than 2 unique values while ignoring NA and blanks/empty strings in this calculation.

My solution so far:

df_col <- df %>% 
        select_if(function(col) length(unique(na.omit(col)))>1)

df_col <- colnames(df_col)

But I have noticed that na.omit() won’t help, since it deletes the whole row.

Any help would be appreciated. Thank you in advance!

>Solution :

Use n_distinct, which also have na.rm argument, The _if/_at/_all are deprecated in favor of across/where. The empty strings ('') can be checked with nzchar which returns a TRUE only if it is non-empty, thus subset the elements of the columns with nzchar and then apply n_distinct column wise and create the condition to select only those columns and then get the names

library(dplyr)
df %>%
    select(where(~ n_distinct(.x[nzchar(.x)], na.rm = TRUE) > 1)) %>%
     names

-output

[1] "ID"    "color" "owner"

An option is also to convert the "" to NA (na_if), perhaps it may be slightly compact

df %>% 
  select(where(~ n_distinct(na_if(.x, ""), na.rm = TRUE) > 1)) %>% 
  names

shiny

byMR

Published May 09, 2022

Add a comment

mypy – How to mark line as unreachable

byMR

May 9, 2022

Questions

Dynamic client registration with Google's OAUTH2 (RFC7591)

byMR

May 9, 2022

Questions

How to use AutoMapper to convert Child Object list to either list of strings or list of Guids

byMR

May 9, 2022

Questions

How to set env variable in Heroku with Node.js?

byMR

May 9, 2022

Questions

Compare values from multiple dictionaries

byMR

May 9, 2022

Questions

How to filter json data while Fetching in Vuejs

byMR

May 9, 2022

How to select columns with equal or more than 2 unique values while ignoring NA and blank?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

mypy – How to mark line as unreachable

Dynamic client registration with Google's OAUTH2 (RFC7591)

How to use AutoMapper to convert Child Object list to either list of strings or list of Guids

How to set env variable in Heroku with Node.js?

Compare values from multiple dictionaries

How to filter json data while Fetching in Vuejs

Keep Up to Date with the Most Important News

How to select columns with equal or more than 2 unique values while ignoring NA and blank?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

mypy – How to mark line as unreachable

Dynamic client registration with Google's OAUTH2 (RFC7591)

How to use AutoMapper to convert Child Object list to either list of strings or list of Guids

How to set env variable in Heroku with Node.js?

Compare values from multiple dictionaries

How to filter json data while Fetching in Vuejs

Discover more from Dev solutions