Home Create cross-tabulation of most frequent value of string variable and sort by frequency

Questions

Create cross-tabulation of most frequent value of string variable and sort by frequency

June 9, 2022

I have a sample dataset:

df <- data.frame(category = c("A", "A", "B", "C", "C", "D", "E", "C", "E", "A", "B", "C", "B", "B", "B", "D", "D", "D", "D", "B"), year = c(1, 2, 1, 2, 3, 2, 3, 1, 3, 2, 1, 1, 2, 1, 2, 3, 1, 2, 3, 1))

and would like to create a cross-tabulation of year and category such that only the 3 most frequent categories are in the table and also sorted by total number of occurences:

Using something like

df %>% 
  add_count(category) %>% 
  filter(n %in% tail(sort(unique(n)),3)) %>% 
  arrange(desc(n)) %>% {table(.$category, .$year)}

will filter for the three most occurring categories but leave the table unsorted

>Solution :

This should give you what you want.

# Make a table
df.t <- table(df)
# Order by top occurrences (sum over margin 1)
df.t <- df.t[order(apply(df.t, 1, sum), decreasing=TRUE),]
# Keep top 3 results
df.t <- df.t[1:3,]

Output:

        year
category 1 2 3
       B 4 2 0
       D 1 2 2
       C 2 1 1

tabulate

byMR

Published June 09, 2022

Add a comment

How to plot my tibble in r like shown below?

byMR

June 9, 2022

Questions

TypeScript: Is there a cleaner way to null check a hierarchical object structure?

byMR

June 9, 2022

Questions

C++ call member method vs normal function overhead

byMR

June 9, 2022

Questions

Python Pandas Dataframe By Column and Row Value

byMR

June 9, 2022

Questions

Javascript round number after the . to 3 digits only

byMR

June 9, 2022

Questions

execute function having its content as a div's text

byMR

June 9, 2022

Create cross-tabulation of most frequent value of string variable and sort by frequency

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to plot my tibble in r like shown below?

TypeScript: Is there a cleaner way to null check a hierarchical object structure?

C++ call member method vs normal function overhead

Python Pandas Dataframe By Column and Row Value

Javascript round number after the . to 3 digits only

execute function having its content as a div's text

Keep Up to Date with the Most Important News

Create cross-tabulation of most frequent value of string variable and sort by frequency

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to plot my tibble in r like shown below?

TypeScript: Is there a cleaner way to null check a hierarchical object structure?

C++ call member method vs normal function overhead

Python Pandas Dataframe By Column and Row Value

Javascript round number after the . to 3 digits only

execute function having its content as a div's text

Discover more from Dev solutions