Home Remove related row from pandas dataframe

Questions

Remove related row from pandas dataframe

April 7, 2022

I have the following dataframe:

id	relatedId	coordinate
123	125	55
125	123	45
128	130	60
132	135	50
130	128	40
135	132	50

So I have 6 rows in this dataframe, but I would like to get rid of the related rows resulting in 3 rows. The coordinate column equals 100 between the two related rows, and I would like to keep the one with the lowest value (so the one less than 50. If both are 50, simply one of them). The resulting dataframe would thus be:

id	relatedId	coordinate
125	123	45
132	135	50
130	128	40

Hopefully someone has a good solution for this problem.
Thanks

>Solution :

You can sort the values and get the first value per group using a frozenset of the 2 ids as grouper:

(df
 .sort_values(by='coordinate')
 .groupby(df[['id', 'relatedId']].agg(frozenset, axis=1), as_index=False)
 .first()
)

output:

    id  relatedId  coordinate
0  130        128          40
1  125        123          45
2  132        135          50

Alternatively, to keep the original order, and original indices, use idxmin per group:

group = df[['id', 'relatedId']].agg(frozenset, axis=1)
idx = df['coordinate'].groupby(group).idxmin()
df.loc[sorted(idx)]

output:

    id  relatedId  coordinate
1  125        123          45
3  132        135          50
4  130        128          40

dataframe

byMR

Published April 07, 2022

Add a comment

How to get min/max and avg lengths of text field?

byMR

April 7, 2022

Questions

Retrieving Values from Getters and Setters

byMR

April 7, 2022

Questions

use Pandas to drop values from csv

byMR

April 7, 2022

Questions

Making an upward pointing arrow from downward pointing one in css

byMR

April 7, 2022

Questions

Executing execvp() in Child process after fork() still taking over parent process?

byMR

April 7, 2022

Questions

How to get multi decimals digit using regex?

byMR

April 7, 2022

Remove related row from pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to get min/max and avg lengths of text field?

Retrieving Values from Getters and Setters

use Pandas to drop values from csv

Making an upward pointing arrow from downward pointing one in css

Executing execvp() in Child process after fork() still taking over parent process?

How to get multi decimals digit using regex?

Keep Up to Date with the Most Important News

Remove related row from pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to get min/max and avg lengths of text field?

Retrieving Values from Getters and Setters

use Pandas to drop values from csv

Making an upward pointing arrow from downward pointing one in css

Executing execvp() in Child process after fork() still taking over parent process?

How to get multi decimals digit using regex?

Discover more from Dev solutions