Home Select rows of dataframe whose column values amount to a given sum

Questions

Select rows of dataframe whose column values amount to a given sum

January 17, 2023

I need to find out how many of the first N rows of a dataframe make up (just over) 50% of the sum of values for that column.

Here’s an example:

import pandas as pd
import numpy as np
df = pd.DataFrame(np.random.rand(10, 1), columns=list("A"))

0  0.681991
1  0.304026
2  0.552589
3  0.716845
4  0.559483
5  0.761653
6  0.551218
7  0.267064
8  0.290547
9  0.182846

therefore

sum_of_A = df["A"].sum()

4.868260213425804

and with this example I need to find, starting from row 0, how many rows I need to get a sum of at least 2.43413 (approximating 50% of sum_of_A).

Of course I could iterate through the rows and sum and break when I get over 50%, but is there a more concise/Pythonic/efficient way of doing this?

>Solution :

I would use .cumsum(), which we can use to get all the rows where the cumulative sum is at least half of the total sum:

df[df["A"].cumsum() < df["A"].sum() / 2]

dataframe

byMR

Published January 17, 2023

Add a comment

In Terraform, how to output values from a list?

byMR

January 17, 2023

Questions

In Terraform, how to output values from a list?

byMR

January 17, 2023

Questions

How do I make Next.js 13 server-side components in the app directory that depend on useEffect for props?

byMR

January 17, 2023

Questions

KeyError(key) in get_loc_level after using .transform() or apply()

byMR

January 17, 2023

Questions

Google Colab, loop through google drive folder, read each CSV file in folder into a datframe, then append dataframe

byMR

January 17, 2023

Questions

Call variable from string value with React Redux

byMR

January 17, 2023

Select rows of dataframe whose column values amount to a given sum

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

In Terraform, how to output values from a list?

In Terraform, how to output values from a list?

How do I make Next.js 13 server-side components in the app directory that depend on useEffect for props?

KeyError(key) in get_loc_level after using .transform() or apply()

Google Colab, loop through google drive folder, read each CSV file in folder into a datframe, then append dataframe

Call variable from string value with React Redux

Keep Up to Date with the Most Important News

Select rows of dataframe whose column values amount to a given sum

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

In Terraform, how to output values from a list?

In Terraform, how to output values from a list?

How do I make Next.js 13 server-side components in the app directory that depend on useEffect for props?

KeyError(key) in get_loc_level after using .transform() or apply()

Google Colab, loop through google drive folder, read each CSV file in folder into a datframe, then append dataframe

Call variable from string value with React Redux

Discover more from Dev solutions