Home Filter dataframe for value that exists in all dates

Questions

Filter dataframe for value that exists in all dates

October 24, 2023

Say I have a dataframe like this:

df = pd.DataFrame({
    'PortDt': ['2022-01-31', '2022-02-28', '2022-02-28', '2022-03-31', '2022-03-31'],
    'loannum': ['111', '111', '222', '111', '333']
})

I want to filter the dataset so that I am left with only records who appear in every distinct value for PortDt.

For this example, the result would be:

PortDt     |   loannum
-----------+-------------
2022-01-31 |  111
2022-02-28 |  111
2022-03-31 |  111

>Solution :

Using groupby.transform with ‘nunique’ and comparing to the overall number of unique values:

out = df[df.groupby('loannum')['PortDt']
           .transform('nunique').eq(df['PortDt'].nunique())]

Or same logic with better efficiency:

s = df.groupby('loannum')['PortDt'].nunique().eq(df['PortDt'].nunique())

df[df['loannum'].isin(s[s].index)]

Or with crosstab+all instead of groupby:

s = pd.crosstab(df['PortDt'], df['loannum']).all()
out = df[df['loannum'].isin(s[s].index)]

Output:

       PortDt loannum
0  2022-01-31     111
1  2022-02-28     111
3  2022-03-31     111

dataframe

byMR

Published October 24, 2023

Add a comment

Composable not recomposing on Android API 26 (Oreo) after changing the value of a MutableState

byMR

October 24, 2023

Questions

Influence of relative vs. absolute URLs on loading time

byMR

October 24, 2023

Questions

R: Yearmon producing NA values

byMR

October 24, 2023

Questions

gnuplot: customize labels on bar plots

byMR

October 24, 2023

Questions

How can I limit the amount of rows returned in my report?

byMR

October 24, 2023

Questions

C++ threads – Are threaded-function's function calls part of the thread?

byMR

October 24, 2023

Filter dataframe for value that exists in all dates

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Composable not recomposing on Android API 26 (Oreo) after changing the value of a MutableState

Influence of relative vs. absolute URLs on loading time

R: Yearmon producing NA values

gnuplot: customize labels on bar plots

How can I limit the amount of rows returned in my report?

C++ threads – Are threaded-function's function calls part of the thread?

Keep Up to Date with the Most Important News

Filter dataframe for value that exists in all dates

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Composable not recomposing on Android API 26 (Oreo) after changing the value of a MutableState

Influence of relative vs. absolute URLs on loading time

R: Yearmon producing NA values

gnuplot: customize labels on bar plots

How can I limit the amount of rows returned in my report?

C++ threads – Are threaded-function's function calls part of the thread?

Discover more from Dev solutions