Home Can Pandas GroupBy split into just 2 bins?

Questions

Can Pandas GroupBy split into just 2 bins?

April 2, 2024

Imagine I have this table:

Col-1 | Col-2
A     |   2
A     |   3
B     |   1
B     |   4
C     |   7

Groupby on Col-1 with a sum aggregation on Col-2 will sum A to 5, B to 5, and C to 7.

What I want to know is if there is a baked in feature that allows aggregation on a target value in a column and then groups all other entries into another bin. For example, if I wanted to groupby on Col-1 targeting A and grouping all other entries into a label named other, I would end up with A as 5 and Other as 12.

Does that make sense? I know I could do some filtering sorcery and merging datasets back together, but figured there had to be a cleaner, more Pythonic way I am missing.

I have tried going through the documentation, but nothing jumped out at me.

>Solution :

One solution is to make pd.Categorical from the Column 1 -> with two categories A for string A and Other for other strings. Then group by this categorical:

tmp = (
    pd.Categorical(df["Col1"], categories=["A"]).add_categories("Other").fillna("Other")
)

out = df.groupby(tmp, observed=False)["Col2"].sum()
print(out)

Prints:

A         5
Other    12
Name: Col2, dtype: int64

Another solution, group by boolean mask:

out = (
    df.groupby(df["Col1"].eq("A"))["Col2"]
    .sum()
    .rename(index={True: "A", False: "Other"})
)
print(out)

Prints:

Col1
Other    12
A         5
Name: Col2, dtype: int64

dataframe

byMR

Published April 02, 2024

Add a comment

How to Change case of part of Filename with Powershell and Regex?

byMR

April 2, 2024

Questions

Setting a custom_filter in graph_from_bbox results in more nodes than not having a filter

byMR

April 2, 2024

Questions

Downloading an Excel file from an URL using Python

byMR

April 2, 2024

Questions

TypeError while implementing Neural Network code

byMR

April 2, 2024

Questions

Do initialized class member from deceleration initialized again in the constructor if it's in initializer list?

byMR

April 2, 2024

Questions

laravel application loads white blank screen on cpanel

byMR

April 2, 2024

Can Pandas GroupBy split into just 2 bins?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to Change case of part of Filename with Powershell and Regex?

Setting a custom_filter in graph_from_bbox results in more nodes than not having a filter

Downloading an Excel file from an URL using Python

TypeError while implementing Neural Network code

Do initialized class member from deceleration initialized again in the constructor if it's in initializer list?

laravel application loads white blank screen on cpanel

Keep Up to Date with the Most Important News

Can Pandas GroupBy split into just 2 bins?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to Change case of part of Filename with Powershell and Regex?

Setting a custom_filter in graph_from_bbox results in more nodes than not having a filter

Downloading an Excel file from an URL using Python

TypeError while implementing Neural Network code

Do initialized class member from deceleration initialized again in the constructor if it's in initializer list?

laravel application loads white blank screen on cpanel

Discover more from Dev solutions