Home pandas groupby with strings – return single string

Questions

pandas groupby with strings – return single string

June 3, 2022

I have a dataframe that looks like the following:

    ID  Type    Size
0   123 Red     5
1   456 Blue    7
2   789 Yellow  12
3   789 Yellow  4

I now want to aggregate by ID and take the mean of the size for duplicates. However, I wish to only return the same string for Type, not concatenate it. I have attempted to capture this using agg:

df = pd.DataFrame({'ID' : [123, 456, 789, 789], 'Type' : ['Red', 'Blue', 'Yellow', 'Yellow'], 'Size' : [5, 7, 12, 4]})

def identity(x):
    return x

special_columns = ['Type']
aggfuncs = {col: statistics.mean for col in df.columns}
aggfuncs.update({col:identity for col in special_columns})
df.groupby(['ID'], as_index=False).agg(aggfuncs)

However, this still turns into an array of the repeated string:

    ID  Type              Size
0   123 Red                 5
1   456 Blue                7
2   789 [Yellow, Yellow]    8

The end result I wanted was:

    ID  Type              Size
0   123 Red                 5
1   456 Blue                7
2   789 Yellow              8

How can this be achieved?

>Solution :

If each ID has one corresponding type, this should work

# use both ID and Type as grouper
res = df.groupby(["ID", "Type"], as_index=False)["Size"].mean()
res

dataframe

byMR

Published June 03, 2022

Add a comment

Build a generic `map` like function in cpp

byMR

June 4, 2022

Questions

Fill NA in R: imput NA in a column X with values from same ID (column Y ) correspondance

byMR

June 4, 2022

Questions

How do I generate the following in pandas?

byMR

June 4, 2022

Questions

How to get the next quarter – ORACLE SQL

byMR

June 4, 2022

Questions

Declare multiple document.getElementById(''); variables in few lines

byMR

June 4, 2022

Questions

Looping through an object only returns the first object

byMR

June 4, 2022

pandas groupby with strings – return single string

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Build a generic `map` like function in cpp

Fill NA in R: imput NA in a column X with values from same ID (column Y ) correspondance

How do I generate the following in pandas?

How to get the next quarter – ORACLE SQL

Declare multiple document.getElementById(''); variables in few lines

Looping through an object only returns the first object

Keep Up to Date with the Most Important News

pandas groupby with strings – return single string

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Build a generic `map` like function in cpp

Fill NA in R: imput NA in a column X with values from same ID (column Y ) correspondance

How do I generate the following in pandas?

How to get the next quarter – ORACLE SQL

Declare multiple document.getElementById(''); variables in few lines

Looping through an object only returns the first object

Discover more from Dev solutions