Home Perform merge for specific duplicate rows in pandas DataFrame

Questions

Perform merge for specific duplicate rows in pandas DataFrame

November 4, 2022

Let’s be the following two DataFrames in python:

df:

code_1	other
19001	white
19009	blue
19008	red

df_1:

code_1	code_2
19001	00001
19001	00002
19009	00003
19008	00001

I want to merge df with df_1:

    df_merge = pd.merge(df, df_1, how="left", on=['code_1'])

df_merge:

code_1	other	code_2
19001	white	00001
19001	white	00002
19009	blue	00003
19008	red	00004

I want the merge to remove duplicates in the case of code_1 and only do the merge for the first row. I could do a drop_duplicates for [other, code_1], but I would like to know if it is possible to include some parameter in the merge function to do it directly.

Expected result:

code_1	other	code_2
19001	white	00001
19009	blue	00003
19008	red	00004

>Solution :

In my opinion there is no specifc parameter for pandas.merge() that fit your needs, but you could reduce the result by dropping duplicates before merging, assumed there are only duplicates in df_1:

df_merge = df.merge(df_1.drop_duplicates('code_1'), how="left", on=['code_1'])

merge

byMR

Published November 04, 2022

Add a comment

String interpolation (String formatting/template literal) doesn't work in .js files

byMR

November 4, 2022

Questions

Disallow or replace certain characters in string with Firebase Realtime Database Security Rules?

byMR

November 4, 2022

Questions

Will null always hit default label in switch statement in Java 8?

byMR

November 4, 2022

Questions

Function only adding textContent in one of my divs

byMR

November 4, 2022

Questions

Check if word is contained exactly in a Python string

byMR

November 4, 2022

Questions

who does IndexError: list index out of range appears ? i did some test still cant find out

byMR

November 4, 2022

Perform merge for specific duplicate rows in pandas DataFrame

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

String interpolation (String formatting/template literal) doesn't work in .js files

Disallow or replace certain characters in string with Firebase Realtime Database Security Rules?

Will null always hit default label in switch statement in Java 8?

Function only adding textContent in one of my divs

Check if word is contained exactly in a Python string

who does IndexError: list index out of range appears ? i did some test still cant find out

Keep Up to Date with the Most Important News

Perform merge for specific duplicate rows in pandas DataFrame

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

String interpolation (String formatting/template literal) doesn't work in .js files

Disallow or replace certain characters in string with Firebase Realtime Database Security Rules?

Will null always hit default label in switch statement in Java 8?

Function only adding textContent in one of my divs

Check if word is contained exactly in a Python string

who does IndexError: list index out of range appears ? i did some test still cant find out

Discover more from Dev solutions