Home DataFrame groupby on each item within a column of lists

Questions

DataFrame groupby on each item within a column of lists

February 6, 2023

I have a dataframe (df):

| A   | B     | C                       |
| --- | ----- | ----------------------- |
| CA  | Jon   | [sales, engineering]    |
| NY  | Sarah | [engineering, IT]       |
| VA  | Vox   | [services, engineering] |

I am trying to group by each item in the C column list (sales, engineering, IT, etc.).

Tried:

df.groupby('C')

but got list not hashable, which is expected. I came across another post where it was recommended to convert the C column to tuple which is hashable, but I need to groupby each item and not the combination.

My goal is to get the count of each row in the df for each item in the C column list. So:

sales: 1
engineering: 3
IT: 1
services: 1

While there is probably a simpler way to obtain this than using groupby, I am still curious if groupby can be used in this case.

>Solution :

You can explode & value_counts :

out = df.explode("C").value_counts("C")

Output :

print(out)

C          
engineering    3
IT             1
sales          1
services       1
dtype: int64

pandas

byMR

Published February 06, 2023

Add a comment

Group array of objects by two properties

byMR

February 6, 2023

Questions

Redirect root path to child in react-router 6

byMR

February 6, 2023

Questions

Unable to access internal object after instantiation

byMR

February 6, 2023

Questions

CMake failed to link submodule, undefined reference

byMR

February 6, 2023

Questions

Kotlin – Passing Trailing Lambdas- Function with two Parameter as Parameter

byMR

February 6, 2023

Questions

eth0 interface getting listed 3 times

byMR

February 6, 2023

DataFrame groupby on each item within a column of lists

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Group array of objects by two properties

Redirect root path to child in react-router 6

Unable to access internal object after instantiation

CMake failed to link submodule, undefined reference

Kotlin – Passing Trailing Lambdas- Function with two Parameter as Parameter

eth0 interface getting listed 3 times

Keep Up to Date with the Most Important News

DataFrame groupby on each item within a column of lists

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Group array of objects by two properties

Redirect root path to child in react-router 6

Unable to access internal object after instantiation

CMake failed to link submodule, undefined reference

Kotlin – Passing Trailing Lambdas- Function with two Parameter as Parameter

eth0 interface getting listed 3 times

Discover more from Dev solutions