Home How to minimize parameter in row pandas dataframe

Questions

How to minimize parameter in row pandas dataframe

January 21, 2022

I have dataframe with bus stop arrival forecast:

path_id | forecast | forecast_made_at | bus_id
 int    | datetime |  datetime        | int

We make predictions every 5 minutes, so database entries can be duplicated. for example

In 11:50 we predict bus #11544 will arrive at 11:59
In 11:50 we predict bus #95447 will arrive at 11:55
--......--
In 11:55 we predict bus #11544 will arrive at 12:02

I want to get newest prediction with biggest forecast_made_at parameter:

res = pd.DataFrame()
for k, row in t_data.iterrows():
  prediction = dict(**row)
  forecasts = t_data[t_data["bus_id"] == prediction["bus_id"]] # Forecasts with the same bus_id
  prediction["best"] = (prediction["forecast_made_at"] == max(forecasts["forecast_made_at"]))
  res = res.append(prediction, ignore_index=True)

res = res[res["best"] == True]

In this code, we are working with dictionaries and not with pandas objects, so this one is very slow. How can I do this using pandas tools

>Solution :

What you need is a combination of grouping by bus_id, sorting by date and selection of most recent row.

One option – dropping duplicates by bus_id and only keeping most recent record:

t_data.sort_values('forecast_made_at').drop_duplicates(subset=['bus_id'], keep='last')

Another option: Grouping by bus_id and selecting last record:

t_data.sort_values('forecast_made_at').groupby('bus_id').last().reset_index()

data-science

byMR

Published January 21, 2022

Add a comment

Add contents of 1 file to the top of another file without overwrite in PHP?

byMR

January 21, 2022

Questions

Why does df.where() not replace all null values?

byMR

January 21, 2022

Questions

Cannot read properties of undefined (reading 'id') in Nodejs

byMR

January 21, 2022

Questions

How can i get the total of the 2 rows that i subtract?

byMR

January 21, 2022

Questions

What does Row() does in excel

byMR

January 21, 2022

Questions

How to test if a string contains more than N nested function calls?

byMR

January 21, 2022

How to minimize parameter in row pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Add contents of 1 file to the top of another file without overwrite in PHP?

Why does df.where() not replace all null values?

Cannot read properties of undefined (reading 'id') in Nodejs

How can i get the total of the 2 rows that i subtract?

What does Row() does in excel

How to test if a string contains more than N nested function calls?

Keep Up to Date with the Most Important News

How to minimize parameter in row pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Add contents of 1 file to the top of another file without overwrite in PHP?

Why does df.where() not replace all null values?

Cannot read properties of undefined (reading 'id') in Nodejs

How can i get the total of the 2 rows that i subtract?

What does Row() does in excel

How to test if a string contains more than N nested function calls?

Discover more from Dev solutions