Home How to compare 2 dataframes and then output an ID to inform if a row has changed?

Questions

How to compare 2 dataframes and then output an ID to inform if a row has changed?

byMR

May 10, 2022

I have 2 dataframes:

df1 = pd.DataFrame({"id1": ["A", "B", "C", "D"], "id2": ["1", "2", "2", "1"]})

df2 = pd.DataFrame({"id1": ["A", "B", "F", "C", "D", "E"], "id2": ["1", "1", "2", "1", "1", "2"]})

I want to find the rows which are different, here is the code I tried:

vals = set(df2.iloc[:, :2].apply(tuple, 1)) \ - set(df1.apply(tuple, 1))

The result is the following which gives me the rows which are different between the 2 dataframes:

{('B', '1'), ('E', '2'), ('C', '1'), ('F', '2')}

I would like to get an additional output which tells me which rows of df2 have been modified or not (Y for yes and N for No) :

	id1	Modified_ID
0	A	N
1	B	Y
2	F	N
3	C	Y
4	D	N
5	E	N

>Solution :

You can merge and build a boolean mask that returns True if a value didn’t change (or was expanded) and map Y/N values according to it:

df2 = df2.merge(df1, on='id1', how='left', suffixes=('_',''))
df2['Modified_ID'] = (df2['id2_'].eq(df2['id2']) | df2['id2'].isna()).map({True:'N', False:'Y'})
df2 = df2.drop(columns=['id2_','id2'])

Output:

  id1 Modified_ID
0   A           N
1   B           Y
2   F           N
3   C           Y
4   D           N
5   E           N

dataframe

byMR

Published May 10, 2022

Add a comment

When should I mark the static method of a specific class as private other than public?

byMR

May 10, 2022

Questions

Referring alias from select sql query

byMR

May 10, 2022

Questions

program to check arithmetic progression from .txt file and print true/false

byMR

May 10, 2022

Questions

How to find column b when column a equals a certain value in data frames python

byMR

May 10, 2022

Questions

One of many recursive calls of a function found the correct result, but it can't "tell" the others. Is there a better fix than this ugly workaround?

byMR

May 10, 2022

Questions

Calculate % Change Avoiding NA's, By Group

byMR

May 10, 2022

How to compare 2 dataframes and then output an ID to inform if a row has changed?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

When should I mark the static method of a specific class as private other than public?

Referring alias from select sql query

program to check arithmetic progression from .txt file and print true/false

How to find column b when column a equals a certain value in data frames python

One of many recursive calls of a function found the correct result, but it can't "tell" the others. Is there a better fix than this ugly workaround?

Calculate % Change Avoiding NA's, By Group

Keep Up to Date with the Most Important News

How to compare 2 dataframes and then output an ID to inform if a row has changed?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

When should I mark the static method of a specific class as private other than public?

Referring alias from select sql query

program to check arithmetic progression from .txt file and print true/false

How to find column b when column a equals a certain value in data frames python

One of many recursive calls of a function found the correct result, but it can't "tell" the others. Is there a better fix than this ugly workaround?

Calculate % Change Avoiding NA's, By Group

Discover more from Dev solutions