Home Map dataframe function without lambda

Questions

Map dataframe function without lambda

January 5, 2023

I have the following function:

def summarize(text, percentage=.6):
    import numpy as np
    sentences = nltk.sent_tokenize(text)
    sentences = sentences[:int(percentage*len(sentences))]
    summary = ''.join([str(sentence) for sentence in sentences])
    return summary

And I want to map it to dataframe rows. It works pretty well when I use the following code :

df['summary'] = df['text'].map(summarize)

However, when I want to change the percentage variable in this call, it does df['summary'] = df['text'].map(summarize(percentage=.8)), it shows an error indicating it requires another argument, which is text. Of course, it can be resolved using a lambda function as follows:

df['summary'] = df['text'].map(lambda x: summarize(x, percentage=.8))

But I do not want use the lambda in the call. Is there any method to do it otherwise? For example using kwargs inside the function to refer to the text column in the dataframe? Thank you

>Solution :

Possible solution is use Series.apply instead map, then is possible add parameters without lambda like named arguments:

df['summary'] = df['text'].map(summarize, percentage=.8)

TypeError: map() got an unexpected keyword argument ‘percentage’

df['summary'] = df['text'].apply(summarize, percentage=.8)

pandas

byMR

Published January 05, 2023

Add a comment

Delimit a column in R based on 2 characters

byMR

January 5, 2023

Questions

Python sum over map over list

byMR

January 5, 2023

Questions

Code for checking if file/record already exists

byMR

January 5, 2023

Questions

How to merge and squash at the same time

byMR

January 5, 2023

Questions

Global Variable becomes local

byMR

January 5, 2023

Questions

Pivot a column so repeated values/records are placed in 1 cell

byMR

January 5, 2023

Map dataframe function without lambda

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Delimit a column in R based on 2 characters

Python sum over map over list

Code for checking if file/record already exists

How to merge and squash at the same time

Global Variable becomes local

Pivot a column so repeated values/records are placed in 1 cell

Keep Up to Date with the Most Important News

Map dataframe function without lambda

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Delimit a column in R based on 2 characters

Python sum over map over list

Code for checking if file/record already exists

How to merge and squash at the same time

Global Variable becomes local

Pivot a column so repeated values/records are placed in 1 cell

Discover more from Dev solutions