Home How to delete duplicated elements in columns of csv

Questions

How to delete duplicated elements in columns of csv

April 3, 2022

I need help with deleting duplicated elements language columns that appears more than one time using python.

Here is my csv:

f = pd.DataFrame({'Movie': ['name1','name2','name3','name4'],
                  'Year': ['1905', '1905','1906','1907'],
                  'Id': ['tt0283985', 'tt0283986','tt0284043','tt3402904'],
                  'language':['Mandarin,Mandarin','Mandarin,Cantonese,Mandarin','Mandarin,Cantonese','Cantonese,Cantonese']})

Where f now looks like:

   Movie  Year         Id   language
0  name1  1905  tt0283985  Mandarin,Mandarin
1  name2  1905  tt0283986  Mandarin,Cantonese,Mandarin
2  name3  1906  tt0284043  Mandarin,Cantonese
3  name4  1907  tt3402904  Cantonese,Cantonese

And the result should be like this:

   Movie  Year         Id             language
0  name1  1905  tt0283985            Mandarin
1  name2  1905  tt0283986            Mandarin,Cantonese
2  name3  1906  tt0284043            Mandarin,Cantonese
3  name4  1907  tt3402904            Cantonese

I am having trouble with writing a function to delete complicated values in language columns.
Thanks in advance!

>Solution :

Try this:

f['language'].str.split(',').map(lambda x: ','.join(set(x)))

Output:

0              Mandarin
1    Mandarin,Cantonese
2    Mandarin,Cantonese
3             Cantonese

data-cleaning

byMR

Published April 03, 2022

Add a comment

Search with regex but replace only a portion of the string with sed

byMR

April 3, 2022

Questions

ReactDOM only renders one element

byMR

April 3, 2022

Questions

variable is redefined in augment assignment if it is used in closure

byMR

April 3, 2022

Questions

even if param and DEFAUL_PARAM are the same, they are not equal?

byMR

April 3, 2022

Questions

Is there a way to count repeated observations using the summarize function in R?

byMR

April 3, 2022

Questions

CSS url() not accepting URLs with end slashes

byMR

April 3, 2022

How to delete duplicated elements in columns of csv

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Search with regex but replace only a portion of the string with sed

ReactDOM only renders one element

variable is redefined in augment assignment if it is used in closure

even if param and DEFAUL_PARAM are the same, they are not equal?

Is there a way to count repeated observations using the summarize function in R?

CSS url() not accepting URLs with end slashes

Keep Up to Date with the Most Important News

How to delete duplicated elements in columns of csv

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Search with regex but replace only a portion of the string with sed

ReactDOM only renders one element

variable is redefined in augment assignment if it is used in closure

even if param and DEFAUL_PARAM are the same, they are not equal?

Is there a way to count repeated observations using the summarize function in R?

CSS url() not accepting URLs with end slashes

Discover more from Dev solutions