Home python dataframe count word occurrences

Questions

python dataframe count word occurrences

May 30, 2022

I have searched a lot here and I couldnt find the answer for it.
I have a dataframe with column "Descriptions" which contain a long string,
I’m trying to count the number of occurence for a specific word "restaurant",

df['has_restaurants'] = 0
for index,text in enumerate(df['Description']):
    text = text.split()
    df['has_restaurants'][index] = (sum(map(lambda count : 1 if 'restaurant' in count else 0, text)))

Did the above and it works but it doesn’t look like a good way to do it and it generates this "error" as well:

See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy
  df['has_restaurants'][index] = (sum(map(lambda count : 1 if 'restaurant' in count else 0, text)))

>Solution :

You might simplify that by using .str.count method, consider following simple example

import pandas as pd
df = pd.DataFrame({"description":["ABC DEF GHI","ABC ABC ABC","XYZ XYZ XYZ"]})
df['ABC_count'] = df.description.str.count("ABC")
print(df)

output

   description  ABC_count
0  ABC DEF GHI          1
1  ABC ABC ABC          3
2  XYZ XYZ XYZ          0

count

byMR

Published May 30, 2022

Add a comment

Color different countries with R based on ggplot and map_data

byMR

May 30, 2022

Questions

How to split more than one comma separated column as a separate row in SQL using CROSS APPLY

byMR

May 30, 2022

Questions

Transforming List of dicts to pandas dataframe with dynamic header columns

byMR

May 30, 2022

Questions

How to implement .each() function into piece of code?

byMR

May 30, 2022

Questions

Create ID variable per chain of values

byMR

May 30, 2022

Questions

exclude a folder in git fetch upstream

byMR

May 30, 2022

python dataframe count word occurrences

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Color different countries with R based on ggplot and map_data

How to split more than one comma separated column as a separate row in SQL using CROSS APPLY

Transforming List of dicts to pandas dataframe with dynamic header columns

How to implement .each() function into piece of code?

Create ID variable per chain of values

exclude a folder in git fetch upstream

Keep Up to Date with the Most Important News

python dataframe count word occurrences

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Color different countries with R based on ggplot and map_data

How to split more than one comma separated column as a separate row in SQL using CROSS APPLY

Transforming List of dicts to pandas dataframe with dynamic header columns

How to implement .each() function into piece of code?

Create ID variable per chain of values

exclude a folder in git fetch upstream

Discover more from Dev solutions