Home Vectorized alternative for itertuples using file.write()

Questions

Vectorized alternative for itertuples using file.write()

April 21, 2023

Suppose we have a pandas dataframe:

import pandas as pd

data = pd.DataFrame({'columnNM': ['Jerry', 'Bob', 'Phil', 'Bill', 'Mickey', 'Pigpen', 'Robert'], 
                     'columnNM2': ['John', 'Tom', 'Donna', 'Keith', 'Brent', 'Vince', 'Bruce']})

Also suppose we have an open file we are writing to, something opened using:

file = open('myPathExample', 'w')

I want to perform comparison operations, control flow on the data and write back to that file. A simple example would be:

for row in data.itertuples():
    file.write('%s was friends with %s \n' %(row.columnNM, row.columnNM2))

Now, I am a beginner level in python and I have read all over that looping or iterating over rows in a pandas dataframe is not ideal, especially for large datasets. I don’t have the knowledge to understand the full details of why.

Is a good vectorized alternative to itertuples for this example even possible? If so, what is it?

>Solution :

The vectorial alternative would be to build a single string and write once to the file:

file.write('\n'.join(data['columnNM']+' was friends with '+data['columnNM2']))

Or, if you want to keep the loop:

for line in (data['columnNM']+' was friends with '+data['columnNM2']+' \n'):
    file.write(line)

vectorization

byMR

Published April 21, 2023

Add a comment

Much needed KQL query assistance

byMR

April 21, 2023

Questions

is there a way to do one subscription but two callbacks (one debouced?)

byMR

April 21, 2023

Questions

how in pandas mark if a set of column is unique or not?

byMR

April 22, 2023

Questions

finding the position of an element in a nested list

byMR

April 22, 2023

Questions

Macro not correct compiling?

byMR

April 22, 2023

Questions

How to get the (n) largest values from a pandas data frame? And label them as '1' else '0'

byMR

April 22, 2023

Vectorized alternative for itertuples using file.write()

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Much needed KQL query assistance

is there a way to do one subscription but two callbacks (one debouced?)

how in pandas mark if a set of column is unique or not?

finding the position of an element in a nested list

Macro not correct compiling?

How to get the (n) largest values from a pandas data frame? And label them as '1' else '0'

Keep Up to Date with the Most Important News

Vectorized alternative for itertuples using file.write()

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Much needed KQL query assistance

is there a way to do one subscription but two callbacks (one debouced?)

how in pandas mark if a set of column is unique or not?

finding the position of an element in a nested list

Macro not correct compiling?

How to get the (n) largest values from a pandas data frame? And label them as '1' else '0'

Discover more from Dev solutions