Home select top n values and group the rest of the remaining ones (Rstudio)

Questions

select top n values and group the rest of the remaining ones (Rstudio)

December 10, 2021

How can I rank the first 4 group of my dataframe associated to the highest value in the count column and create a 5th group summing up the remaining groups and their associated values ?

What I did so far:

dummy_dataframe <- data.frame(group = c("A", "B", "A", "A", "C", "C", "D", "E", "F", "D","G")) 

df_aggregate <- aggregate(cbind(count = group) ~ group, 
                         data = dumy_dataframe, 
                         FUN = function(x){NROW(x)})

df_sliced <- df_aggregate %>%
       arrange(desc(count)) %>% 
      slice(1:4)

With the code above I get a dataframe with the 4 groups associated to the highest value but how I could have a fith group summing up the value of the missing group (E, F and G) ? For instance something like this:

   group     count
1     A        3
2     B        1
3     C        2
4     D        2
5   others     3

>Solution :

You can run some tidyverse operations directly on your original dataframe:

library(tidyverse)
dummy_dataframe %>%
  count(group) %>%
  mutate(id = if_else(row_number() < 5, 1L, 2L)) %>%
  group_by(id) %>%
  arrange(id, -n) %>%
  mutate(group = if_else(id == 2L, "others", group),
         n = if_else(group == "others", sum(n), n)) %>%
  ungroup() %>%
  distinct() %>%
  select(-id)

which gives:

# A tibble: 5 x 2
  group      n
  <chr>  <int>
1 A          3
2 C          2
3 D          2
4 B          1
5 others     3

rstudio

byMR

Published December 10, 2021

Add a comment

Making 2+ Text areas reflect each other

byMR

December 10, 2021

Questions

MySQL regex not replacing second word of the string

byMR

December 10, 2021

Questions

How this loop can be optimized?

byMR

December 10, 2021

Questions

how to insert a line break after every second loop in t-sql stuff function

byMR

December 10, 2021

Questions

How can I pass a function as a parameter in an angular HTML template to update a variable in the parent component?

byMR

December 10, 2021

Questions

Regex table of contents

byMR

December 10, 2021

select top n values and group the rest of the remaining ones (Rstudio)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Making 2+ Text areas reflect each other

MySQL regex not replacing second word of the string

How this loop can be optimized?

how to insert a line break after every second loop in t-sql stuff function

How can I pass a function as a parameter in an angular HTML template to update a variable in the parent component?

Regex table of contents

Keep Up to Date with the Most Important News

select top n values and group the rest of the remaining ones (Rstudio)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Making 2+ Text areas reflect each other

MySQL regex not replacing second word of the string

How this loop can be optimized?

how to insert a line break after every second loop in t-sql stuff function

How can I pass a function as a parameter in an angular HTML template to update a variable in the parent component?

Regex table of contents

Discover more from Dev solutions