Home Efficiently add a value to a new column in a large DataFrame

Questions

Efficiently add a value to a new column in a large DataFrame

April 13, 2023

I have two dataframes, adv_text with about 9,000 rows and events with over 900,000 rows. events is essentially an expanded version of adv_text with about 100 rows per row in adv_text. I want to add three columns from adv_text to events.

The following code is a partial addition of one column.

events_x = events.head(30000).copy()

def add_date(game_id):
    date = adv_text[adv_text['id_odsp'] == game_id]['date']
    return(date.iloc[0])

events_x['date'] = events_x['id_odsp'].apply(add_date)

This test code takes almost 25 seconds for 30,000 rows. At this speed, adding all three columns over the full dataframe will take nearly 40 minutes. Is this typical? Is there a faster way to accomplish this task?

>Solution :

IIUC, one way is to use merge:

events_x['date'] = events_x.merge(adv_text[['id_odsp', 'date']], on='id_odsp')['date']

More information: Pandas Merging 101

apply

byMR

Published April 13, 2023

Add a comment

NaN column after using map and replace attribute

byMR

April 14, 2023

Questions

Using a conditional sum to check whether data in a moving observation window meets certain criteria in SAS

byMR

April 14, 2023

Questions

How to render a SolidJS component?

byMR

April 14, 2023

Questions

Why is find returning a directory with -type f?

byMR

April 14, 2023

Questions

Why conditionally compile an obsolete K&R definition versus standard C definition?

byMR

April 14, 2023

Questions

How to toggle the appearance of content on clicking on a paragraph jQuery

byMR

April 14, 2023

Efficiently add a value to a new column in a large DataFrame

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

NaN column after using map and replace attribute

Using a conditional sum to check whether data in a moving observation window meets certain criteria in SAS

How to render a SolidJS component?

Why is find returning a directory with -type f?

Why conditionally compile an obsolete K&R definition versus standard C definition?

How to toggle the appearance of content on clicking on a paragraph jQuery

Keep Up to Date with the Most Important News

Efficiently add a value to a new column in a large DataFrame

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

NaN column after using map and replace attribute

Using a conditional sum to check whether data in a moving observation window meets certain criteria in SAS

How to render a SolidJS component?

Why is find returning a directory with -type f?

Why conditionally compile an obsolete K&R definition versus standard C definition?

How to toggle the appearance of content on clicking on a paragraph jQuery

Discover more from Dev solutions