Python Pandas – drop duplicates from dataframe and merge the columns value

April 7, 2023

I am trying to remove duplicates from my Dataframe and save their data into the columns where they are NA/Empty.

Example:
I’ve the following DATAFRAME and I would like to remove all the duplicates in column A but merge the values from the rest of the tables

A	B	C	D	E
1	X
2	X
2		X
2			X
3		X
3			X
2				X

The expected output:

A	B	C	D	E
1	X
2	X	X	X	X
3		X	X

How can I perform the above dynamically?

Thanks in advance for the answers

>Solution :

You can use groupby_first because it compute the first non-null entry of each column.:

>>> df.groupby('A', as_index=False).first()
   A     B     C     D     E
0  1     X  None  None  None
1  2     X     X     X     X
2  3  None     X     X  None

duplicates

byMR

Published April 07, 2023

Add a comment

How can I align a text inside a div that has 2 text span

byMR

April 7, 2023

Questions

Casting at compile time

byMR

April 7, 2023

Questions

VUE3 data is not reactive

byMR

April 7, 2023

Questions

How do I make images in flexbox align?

byMR

April 7, 2023

Questions

Why I got KeyError in loop "for" with regular expressions?

byMR

April 7, 2023

Questions

Can I return 400 error instead of 422 error

byMR

April 7, 2023

Python Pandas – drop duplicates from dataframe and merge the columns value