Home Group pandas DataFrame on column and sum it while retaining the number of sumed observations

Questions

Group pandas DataFrame on column and sum it while retaining the number of sumed observations

January 15, 2023

I have a pandas dataframe that looks like this:

import pandas as pd
df = pd.DataFrame({'id':[1, 1, 2, 2], 'comp': [-0.10,0.20,-0.10, 0.4], 'word': ['boy','girl','man', 'woman']})

I would like to group the dataframe on id, and calculate the sum of corresponding comp as well as get a new column called n_obs that tracks how many rows(ids) were summed up.

I tried using df.groupby('id').sum() but this is not quite producing the results that I want.

I’d like an output on the below form:

id   comp   n_obs
1    0.1    2
2    0.3    2

Any suggestions on how I can do this?

>Solution :

You can use .groupby() with .agg():

df.groupby("id").agg(comp=("comp", "sum"), n_obs=("id", "count"))

This outputs:

    comp  n_obs
id
1    0.1      2
2    0.3      2

group-by

byMR

Published January 15, 2023

Add a comment

I need write a program that should return the amount of notes and coins for the customer's change

byMR

January 15, 2023

Questions

how to transpose a table by a column with group by pandas

byMR

January 15, 2023

Questions

Referencing a non-PK field in Oracle

byMR

January 15, 2023

Questions

Which regular expression do I have to implement to extract text between two lines containing a string and an arbitrary number of digits?

byMR

January 15, 2023

Questions

Thunk Middleware: how/why does action creator have access to dispatch when it's not passed in?

byMR

January 15, 2023

Questions

Change R data frame within a function? Why alteration of df causing assign() to fail?

byMR

January 15, 2023

Group pandas DataFrame on column and sum it while retaining the number of sumed observations

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

I need write a program that should return the amount of notes and coins for the customer's change

how to transpose a table by a column with group by pandas

Referencing a non-PK field in Oracle

Which regular expression do I have to implement to extract text between two lines containing a string and an arbitrary number of digits?

Thunk Middleware: how/why does action creator have access to dispatch when it's not passed in?

Change R data frame within a function? Why alteration of df causing assign() to fail?

Keep Up to Date with the Most Important News

Group pandas DataFrame on column and sum it while retaining the number of sumed observations

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

I need write a program that should return the amount of notes and coins for the customer's change

how to transpose a table by a column with group by pandas

Referencing a non-PK field in Oracle

Which regular expression do I have to implement to extract text between two lines containing a string and an arbitrary number of digits?

Thunk Middleware: how/why does action creator have access to dispatch when it's not passed in?

Change R data frame within a function? Why alteration of df causing assign() to fail?

Discover more from Dev solutions