Home Selecting first row from each subgroup (pandas)

Questions

Selecting first row from each subgroup (pandas)

April 1, 2022

How to select the subset of rows where distance is lowest, grouping by date and p columns?

df
    v       p       distance    date
0   14.6    sst     22454.1     2021-12-30
1   14.9    sst     24454.1     2021-12-30
2   14.8    sst     33687.4     2021-12-30
3   1.67    wvht    23141.8     2021-12-30
4   1.9     wvht    24454.1     2021-12-30
5   1.8     wvht    24454.1     2021-12-30
6   1.7     wvht    23141.4     2021-12-31
7   2.1     wvht    24454.1     2021-12-31

Ideally, the returned dataframe should contain:

df
    v       p       distance    date
0   14.6    sst     22454.1     2021-12-30
3   1.67    wvht    23141.8     2021-12-30
6   1.7     wvht    23141.4     2021-12-31

>Solution :

One way is to use groupby + idxmin to get the index of the smallest distance per group, then use loc to get the desired output:

out = df.loc[df.groupby(['date', 'p'])['distance'].idxmin()]

Output:

       v     p  distance        date
0  14.60   sst   22454.1  2021-12-30
3   1.67  wvht   23141.8  2021-12-30
6   1.70  wvht   23141.4  2021-12-31

pandas-groupby

byMR

Published April 01, 2022

Add a comment

Python: values written to csv file are in reverse order?

byMR

April 1, 2022

Questions

Why the print statement prints two times whenever I reload my web page in NODEJS

byMR

April 1, 2022

Questions

Unable to parse API Response

byMR

April 1, 2022

Questions

how to make same width of various menus?

byMR

April 1, 2022

Questions

Publishing angular app on github enterprise pages not working

byMR

April 1, 2022

Questions

Execute shell commands by ProcessBuilder In java but seems nothing work

byMR

April 1, 2022

Selecting first row from each subgroup (pandas)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Python: values written to csv file are in reverse order?

Why the print statement prints two times whenever I reload my web page in NODEJS

Unable to parse API Response

how to make same width of various menus?

Publishing angular app on github enterprise pages not working

Execute shell commands by ProcessBuilder In java but seems nothing work

Keep Up to Date with the Most Important News

Selecting first row from each subgroup (pandas)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Python: values written to csv file are in reverse order?

Why the print statement prints two times whenever I reload my web page in NODEJS

Unable to parse API Response

how to make same width of various menus?

Publishing angular app on github enterprise pages not working

Execute shell commands by ProcessBuilder In java but seems nothing work

Discover more from Dev solutions