Home Get first and last occurrence of duplicate value

Questions

Get first and last occurrence of duplicate value

November 24, 2021

I have a dataframe like this

index    col1     col2    col3    col4
  0      11/20    11/26   abc     35
  1      11/21    11/24   xxx     30
  2      11/22    11/27   abc     20

Here col3 has a same value (duplicated based on value).
I sum() col4 from rows based on the col3 value.

So in this case I do

df = df.groupby(['col3'])[['col4']].sum()

But with this approach I get

index    col3    col4
  0      abc     55
  1      xxx     30

I would like to be able to preserve first occurrence of duplicated value’s col1(11/20) and last occurrence of col2(11/27), so the final product would looks like

index    col1     col2    col3    col4
  0      11/20    11/27   abc     55
  1      11/21    11/24   xxx     30

>Solution :

One way using pandas.DataFrame.groupby.agg:

new_df = df.groupby("col3", as_index=False).agg({"col1": "first",
                                                 "col2": "last",
                                                 "col4": "sum"})
print(new_df)

Output:

  col3   col1   col2  col4
0  abc  11/20  11/27    55
1  xxx  11/21  11/24    30

pandas

byMR

Published November 24, 2021

Add a comment

Is there a simple way to change the plot axis decimal indicator from point to comma in base R?

byMR

November 24, 2021

Questions

How can I solve androidx.appcompat.widget.SearchView cannot be cast to android.widget.SearchView

byMR

November 24, 2021

Questions

Map Key value to create a json structure with nested objects using javascript

byMR

November 24, 2021

Questions

How to wait for multiple requests

byMR

November 24, 2021

Questions

How to check if dates in a pandas column are after a date

byMR

November 24, 2021

Questions

Display featured image before post title

byMR

November 24, 2021

Get first and last occurrence of duplicate value

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Is there a simple way to change the plot axis decimal indicator from point to comma in base R?

How can I solve androidx.appcompat.widget.SearchView cannot be cast to android.widget.SearchView

Map Key value to create a json structure with nested objects using javascript

How to wait for multiple requests

How to check if dates in a pandas column are after a date

Display featured image before post title

Keep Up to Date with the Most Important News

Get first and last occurrence of duplicate value

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Is there a simple way to change the plot axis decimal indicator from point to comma in base R?

How can I solve androidx.appcompat.widget.SearchView cannot be cast to android.widget.SearchView

Map Key value to create a json structure with nested objects using javascript

How to wait for multiple requests

How to check if dates in a pandas column are after a date

Display featured image before post title

Discover more from Dev solutions