Follow

Follow

Contact

Home remove duplicated rows and convert it to list or tuple

Questions

remove duplicated rows and convert it to list or tuple

byMR

February 8, 2022

I have dataframe as follows (Name is index):

Name	Age	year
Tom	20	2020
Tom	20	2021
Nick	19	2019
Jack	18	2018

my goal is to remove duplicate and convert the column year to tuple or list, like below

Name	Age	year
Tom	20	(2020, 2019)
Nick	19	2019
Jack	18	2018

how can I do that efficiently since my df has more than 800,000 rows

>Solution :

Use np.unique on groupby. Assuming Name is already the index:

>>> df.groupby(level=0).agg(np.unique)
      Age          year
Name                   
Jack   18          2018
Nick   19          2019
Tom    20  [2020, 2021]

pandas

byMR

Published February 08, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

excel sumif multiple columns data

byMR

February 8, 2022

Questions

Create menu using Map but to remove the { } and ", "

byMR

February 8, 2022

Questions

Printing Armstrong Numbers from a Range in a For Loop as an Array Python

byMR

February 8, 2022

Questions

VBA Excel SelectionChange Reading Target Range into array and returning column count

byMR

February 8, 2022

Questions

How to print symbols vertically in python?

byMR

February 8, 2022

Questions

How to download all files in a directory in ASP.NET using C#

byMR

February 8, 2022