Home Add new rows to data frame, where one column stays the same while other column changes values

Questions

Add new rows to data frame, where one column stays the same while other column changes values

July 13, 2022

sorry, if my title sounds a bit confusing. What I’m basically trying to do is adding new rows in a data frame, where I duplicate the value of each unique value of one column, while another column’s new values are changing.

This is what my data frame looks like:

id	year
01	2022
02	2022
03	2022
…	…
99	2022

And I want it to look like this:

id	year
01	2022
01	2023
01	2024
02	2022
02	2023
02	2024
03	2022
…	…
99	2024

I.e. I want for every id to add the years 2023 and 2024 in the year column. I tried doing this with an apply function, but it always didn’t work out, could you guys help me out in solving this?

>Solution :

You can simply make a list comprehension and concat all dataframe years wirh increments of your desire. For example:

pd.concat([df.assign(year=df.year+increment) for increment in range(0,3)]).sort_values(by='id').reset_index(drop=True)

This will increment your dataframe to three years as follows. You can play around with range for the desired number of extensions:

id	year
1	2022
1	2023
1	2024
2	2022
2	2023
2	2024
3	2022
3	2023
3	2024

dataframe

byMR

Published July 13, 2022

Add a comment

Is it better to replace new ActionListener() with lambda? And why?

byMR

July 13, 2022

Questions

script taking long time to run due to multiple api calls

byMR

July 13, 2022

Questions

the second elif condition doesnt get executed here?

byMR

July 13, 2022

Questions

Is there any cache advantage to using ADD <url> vs RUN wget/curl <url> in a Dockerfile

byMR

July 13, 2022

Questions

count the frequency with little adjustment in the order within row

byMR

July 13, 2022

Questions

Strange behavior from lag(), skipping over certain rows

byMR

July 13, 2022

Add new rows to data frame, where one column stays the same while other column changes values

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Is it better to replace new ActionListener() with lambda? And why?

script taking long time to run due to multiple api calls

the second elif condition doesnt get executed here?

Is there any cache advantage to using ADD <url> vs RUN wget/curl <url> in a Dockerfile

count the frequency with little adjustment in the order within row

Strange behavior from lag(), skipping over certain rows

Keep Up to Date with the Most Important News

Add new rows to data frame, where one column stays the same while other column changes values

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Is it better to replace new ActionListener() with lambda? And why?

script taking long time to run due to multiple api calls

the second elif condition doesnt get executed here?

Is there any cache advantage to using ADD <url> vs RUN wget/curl <url> in a Dockerfile

count the frequency with little adjustment in the order within row

Strange behavior from lag(), skipping over certain rows

Discover more from Dev solutions