Home Pandas filter rows based on certain number of certain columns being NaN

Questions

Pandas filter rows based on certain number of certain columns being NaN

May 5, 2022

I have a data set like this:

seq S01-T01 S01-T02 S01-T03 S02-T01 S02-T02 S02-T03 S03-T01 S03-T02 S03-T03
A   NaN       4       5       NaN     4       7       NaN       6       8
B   7         2       9       2       1       9       2         1       1 
C   NaN       4       4       2       4       NaN     2         6       8
D   5         NaN     NaN     2       5       9       NaN       1       1

I want to remove the rows where at least three of the columns marked ‘T01’ are NaN

So the output would be:

seq S01-T01 S01-T02 S01-T03 S02-T01 S02-T02 S02-T03 S03-T01 S03-T02 S03-T03
B   7         2       9       2       1       9       2         1       1 
C   NaN       4       4       2       4       NaN     2         6       8
D   5         NaN     NaN     2       5       9       NaN       1       1

Because the A row there is NaN is S01-T01, S02-T02, S03-T01. Row D also has three NaNs, but it is kept in because I am only interested in removing the rows if specifically there is >=3 NaN in the column names that have a T01 in them.

I know this could be simple to do, I wrote:

import sys
import pandas as pd

df = pd.read_csv('data.csv',sep=',')
print(df.columns.str.contains['T01'])

To first get all of the cells with T01 in them, and then I was going to count them.

I got the error:

    print(df.columns.str.contains['T01'])
TypeError: 'method' object is not subscriptable

Then I thought about iterating through the rows and counting instead e.g.:

for index,row in df.iterrows():
        if 'T01' in row:
                print(row)

This runs without error but prints nothing to screen. Could someone demonstrate a better way to do this?

>Solution :

If you select only the ‘T01’ columns, you can take the rowwise sum of nulls and keep only rows that are less than 3.

df.loc[df[[x for x in df if 'T01' in x]].isnull().sum(1).lt(3)]

pandas

byMR

Published May 05, 2022

Add a comment

typescript array not assignable after map and find

byMR

May 5, 2022

Questions

Divide rows by more than one columns containing commas

byMR

May 5, 2022

Questions

I need to replace a certain number in a SQL file with python

byMR

May 5, 2022

Questions

Using TypeScript generic with `…rest` operator

byMR

May 5, 2022

Questions

Renaming a single directory of files with a specific syntax

byMR

May 5, 2022

Questions

How to use redis client in nestjs?

byMR

May 5, 2022

Pandas filter rows based on certain number of certain columns being NaN

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

typescript array not assignable after map and find

Divide rows by more than one columns containing commas

I need to replace a certain number in a SQL file with python

Using TypeScript generic with `…rest` operator

Renaming a single directory of files with a specific syntax

How to use redis client in nestjs?

Keep Up to Date with the Most Important News

Pandas filter rows based on certain number of certain columns being NaN

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

typescript array not assignable after map and find

Divide rows by more than one columns containing commas

I need to replace a certain number in a SQL file with python

Using TypeScript generic with `…rest` operator

Renaming a single directory of files with a specific syntax

How to use redis client in nestjs?

Discover more from Dev solutions