Home How to down sample a dataframe in Python based on condition

Questions

How to down sample a dataframe in Python based on condition

November 15, 2021

I am new here so don’t know how to use this site.

I have a timeseries data of 37404 ICU Patients. Each patient have multiple rows. I want to down sample my dataframe and select only 2932 patients (all rows of the respective patient ID). Can anyone help me? My data looks like this:

HR	SBP	DBP	P_ID
92	120	80	0
98	115	85	0
93	125	75	1
95	130	90	1
102	120	80	1
109	115	75	2
94	135	100	2
97	100	70	3
85	120	80	4
88	115	75	4
93	125	85	4
78	130	90	5
115	140	110	5
102	120	80	5
98	140	110	5

I know I should use some condition on P_ID column, but I am confused.

Thanks for the help.

>Solution :

Use numpy.random.choice for random P_ID and filter in Series.isin with boolean indexing:

df2 = df[df['P_ID'].isin(np.random.choice(df['P_ID'].unique(), size=2932, replace=False))]

Alternative:

df2 = df[df['P_ID'].isin(df['P_ID'].drop_duplicates().sample(n=2932))]

EDIT: For random positions use:

df1 = df['P_ID'].drop_duplicates().sample(n=2932).to_frame('P_ID')

df2 = df.merge(df1, how='right')

pandas-resample

byMR

Published November 15, 2021

Add a comment

using express.json() after raw-body

byMR

November 15, 2021

Questions

Box and Unbox in ExpressionParameter or how to generically wrap every method

byMR

November 15, 2021

Questions

Angular: SVG attribute by event

byMR

November 15, 2021

Questions

Replace a string with array values in javascript

byMR

November 15, 2021

Questions

Dynamic memory allocation | Unable to write on location

byMR

November 15, 2021

Questions

Fastest "trivial" way of shuffling a vector

byMR

November 15, 2021

How to down sample a dataframe in Python based on condition

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

using express.json() after raw-body

Box and Unbox in ExpressionParameter or how to generically wrap every method

Angular: SVG attribute by event

Replace a string with array values in javascript

Dynamic memory allocation | Unable to write on location

Fastest "trivial" way of shuffling a vector

Keep Up to Date with the Most Important News

How to down sample a dataframe in Python based on condition

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

using express.json() after raw-body

Box and Unbox in ExpressionParameter or how to generically wrap every method

Angular: SVG attribute by event

Replace a string with array values in javascript

Dynamic memory allocation | Unable to write on location

Fastest "trivial" way of shuffling a vector

Discover more from Dev solutions