Follow

Follow

Contact

Home How to filter rows and columns based on the maximum value in a Python DataFrame

Questions

How to filter rows and columns based on the maximum value in a Python DataFrame

byMR

May 1, 2022

Shown below are few details on a DataFrame.

Below is the syntax that is been used and do not get the expected output.

df = df.sort_values(by=['country','Year','Value'], ascending=[True,True,False])
df = df.drop_duplicates('country')

how could I get the expected output shown below

>Solution :

Try sorting by "Value" and keeping the last row for each country

>>> df.sort_values("Value").drop_duplicates("country",keep="last")
    Year country  Value
2   2003     USA   7000
6   2002   India   9000
10  2001   Japan  10000

Alternatively, you could use groupby:

>>> df[df["Value"].eq(df.groupby("country")["Value"].transform('max'))]
    Year country  Value
2   2003     USA   7000
6   2002   India   9000
10  2001   Japan  10000

dataframe

byMR

Published May 01, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

Python Dataframe process two columns of lists and find minimum

byMR

May 1, 2022

Questions

how to revert (last clicked list item) back to its original color when any other item is clicked – react hooks

byMR

May 1, 2022

Questions

Euclidian distance between two rows

byMR

May 1, 2022

Questions

How to remove padding on the last element of a list in flutter?

byMR

May 1, 2022

Questions

Swapping mutlidimensional arrays every 5 seconds in Javascript

byMR

May 1, 2022

Questions

fetch in parallel async/await with Promises.all

byMR

May 1, 2022