Home Finding common words in a column based on values from another column

Questions

Finding common words in a column based on values from another column

November 20, 2021

In a dataframe with a column named source, made of two different word lists

 source  words  letter_count
1 list1  apple       5
2 list1  pear        4
3 list1  banana      6
4 list2  ford        4
5 list2  chevy       5
6 list2  apple       5
7 list2  banana      6

I’m trying to return a new dataframe that shows the duplicate words in list1 and list2

   words   letter_count
1  apple        5
2  banana       6

I’m using python and pandas

>Solution :

I think you’re looking for pandas.Series.duplicated(). It returns a mask (a series containing True/False values corresponding to values that match a condition) where values that occur more than once in the series are True, and those that occur only are False. Then, you can index the dataframe with that mask:

new_df = df[df['words'].duplicated()].drop('source', axis=1)

Output:

>>> new_df
    words  letter_count
6  banana             6
7   apple             5

dataframe

byMR

Published November 20, 2021

Add a comment

How to print out actual object value instead of memory address in gdb?

byMR

November 20, 2021

Questions

Creating an object from two arrays

byMR

November 20, 2021

Questions

Problem with Java Array finding prime numbers

byMR

November 20, 2021

Questions

Python regexp to get substring contains '/\'

byMR

November 20, 2021

Questions

freeing container does it free the child both allocated on heap. I do not think that

byMR

November 20, 2021

Questions

JAVA – How to get text after a particular character?

byMR

November 20, 2021

Finding common words in a column based on values from another column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to print out actual object value instead of memory address in gdb?

Creating an object from two arrays

Problem with Java Array finding prime numbers

Python regexp to get substring contains '/\'

freeing container does it free the child both allocated on heap. I do not think that

JAVA – How to get text after a particular character?

Keep Up to Date with the Most Important News

Finding common words in a column based on values from another column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to print out actual object value instead of memory address in gdb?

Creating an object from two arrays

Problem with Java Array finding prime numbers

Python regexp to get substring contains '/\'

freeing container does it free the child both allocated on heap. I do not think that

JAVA – How to get text after a particular character?

Discover more from Dev solutions