Home Minimum date of each continuous block of data of another column

Questions

Minimum date of each continuous block of data of another column

December 23, 2021

I have a dataframe with time-series data as follows:

      Date      Value
0  2021-12-01     A
1  2021-12-02     A
2  2021-12-03     A
3  2021-12-04     B
4  2021-12-05     B
5  2021-12-06     A
6  2021-12-07     A
7  2021-12-08     C

I’m trying to reduce this to only have the first date of each continuous block for the Value column. So the result would look like:

      Date      Value
0  2021-12-01     A
1  2021-12-04     B
2  2021-12-06     A
3  2021-12-08     C

I’ve tried a bunch of different ways of masking, dropping duplicates based on the mask, etc. but cannot do it. Any help is appreciated!

>Solution :

You can use ne (not equals) + shift to create a mask where the first value of each consecutive group is True, and then cumsum to create a unique for each group that’s shared by all its items.

Then, drop_duplicates based on that, and use index of the returned rows to index the dataframe:

subset = df.loc[df['Value'].ne(df['Value'].shift(1)).cumsum().drop_duplicates().index]

Output:

>>> subset
         Date Value
0  2021-12-01     A
3  2021-12-04     B
5  2021-12-06     A
7  2021-12-08     C

dataframe

byMR

Published December 23, 2021

Add a comment

How to publish changes to Docker images using Github Actions

byMR

December 23, 2021

Questions

Sorting from an array

byMR

December 23, 2021

Questions

Firestore query – assign repetitive query elements to variable

byMR

December 23, 2021

Questions

Create a new key and or array to existing object in node.js

byMR

December 23, 2021

Questions

Replace null values in pandas data frame column with 2D np.zeros() array

byMR

December 23, 2021

Questions

golang does passing pointer through channel break the csp design?

byMR

December 23, 2021

Minimum date of each continuous block of data of another column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to publish changes to Docker images using Github Actions

Sorting from an array

Firestore query – assign repetitive query elements to variable

Create a new key and or array to existing object in node.js

Replace null values in pandas data frame column with 2D np.zeros() array

golang does passing pointer through channel break the csp design?

Keep Up to Date with the Most Important News

Minimum date of each continuous block of data of another column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to publish changes to Docker images using Github Actions

Sorting from an array

Firestore query – assign repetitive query elements to variable

Create a new key and or array to existing object in node.js

Replace null values in pandas data frame column with 2D np.zeros() array

golang does passing pointer through channel break the csp design?

Discover more from Dev solutions