Home filter dataframe of frozensets if they have a certain elemnet

Questions

filter dataframe of frozensets if they have a certain elemnet

January 5, 2022

I would like to filter a datframe that has association rules results. I want antecedents that contain an element like H or L in my case. The antecedents are frozenset types. I tried Hrules but it is not working.

Hrules=fdem_rules['H'  in fdem_rules['antecedents']]
Hrules=fdem_rules[frozenset({'H'})  in fdem_rules['antecedents']]

did not work

In the following example, I need only rows 46 and 89 as they have H.

df = pd.DataFrame({'antecedents': [frozenset({'N', 'M', '60'}), frozenset({'H', 'AorE'}), frozenset({'0-35', 'H', 'AorE', '60'}), frozenset({'AorE', 'M', '60', '0'}), frozenset({'0-35', 'F'})]})

             antecedents
75            (N, M, 60)
46             (H, AorE)
89   (0-35, H, AorE, 60)
103     (AorE, M, 60, 0)
38             (0-35, F)

>Solution :

set/frozenset methods

You can use apply with set/frozenset’s method. Here to check is at least H or L is present, one can use the negation of {'H', 'L'}.isdisjoint:

match = {'H', 'L'}
df['H or L'] = ~df['antecedents'].apply(match.isdisjoint)

A much faster variant of the above is to use a list comprehension:

match = {'H', 'L'}
df['H or L'] = [not match.isdisjoint(x) for x in df['antecedents']]

explode+isin+aggregate

Another option is to explode the frozenset, use isin, and aggregate the result with groupby+any:

match = {'H', 'L'}
df['H or L'] = df['antecedents'].explode().isin(match).groupby(level=0).any()

output:

>>> df[['antecedents', 'H or L']]
             antecedents  H or L
75            (N, M, 60)   False
46             (H, AorE)    True
89   (0-35, H, AorE, 60)    True
103     (AorE, M, 60, 0)   False
38             (0-35, F)   False

slicing matching rows

match = {'H', 'L'}
idx = [not match.isdisjoint(x) for x in df['antecedents']]
df[idx]

output:

            antecedents consequents other_cols
46            (H, AorE)         (N)        ...
89  (0-35, H, AorE, 60)         (0)        ...

frozenset

byMR

Published January 05, 2022

Add a comment

GROUP BY with flagged value

byMR

January 5, 2022

Questions

can't convert String to MMM/dd/yyy format in java

byMR

January 5, 2022

Questions

How to create relationships between 3 models in django?

byMR

January 5, 2022

Questions

Getting the distance matrix back from already clustered data

byMR

January 5, 2022

Questions

Python how to count group size if there is remainder

byMR

January 5, 2022

Questions

stream_socket_client and server not working

byMR

January 5, 2022

filter dataframe of frozensets if they have a certain elemnet

MEDevel.com: Open-source for Healthcare and Education

>Solution :

set/frozenset methods

explode+isin+aggregate

slicing matching rows

Like this:

Leave a ReplyCancel reply

Read more

GROUP BY with flagged value

can't convert String to MMM/dd/yyy format in java

How to create relationships between 3 models in django?

Getting the distance matrix back from already clustered data

Python how to count group size if there is remainder

stream_socket_client and server not working

Keep Up to Date with the Most Important News

filter dataframe of frozensets if they have a certain elemnet

MEDevel.com: Open-source for Healthcare and Education

>Solution :

set/frozenset methods

explode+isin+aggregate

slicing matching rows

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

GROUP BY with flagged value

can't convert String to MMM/dd/yyy format in java

How to create relationships between 3 models in django?

Getting the distance matrix back from already clustered data

Python how to count group size if there is remainder

stream_socket_client and server not working

Discover more from Dev solutions