Home How to rank only duplicated rows and without Nan?

Questions

How to rank only duplicated rows and without Nan?

June 14, 2023

I have a table with data:

How can I rank only duplicated values (without taking into account NaN as well)?
My current output is where unfortunately unique values are ranked as well:

   Col1   Rn
0   1.0  1.0
1   1.0  2.0
2   1.0  3.0
3   2.0  1.0
4   3.0  1.0
5   4.0  1.0
6   NaN  NaN

The output I need is:

   Col1   Rn
0   1.0  1.0
1   1.0  2.0
2   1.0  3.0
3   2.0  NaN
4   3.0  NaN
5   4.0  NaN
6   NaN  NaN

Example of the code:

import numpy as np
import pandas as pd

df = pd.DataFrame([[1],
                   [1],
                   [1],
                   [2],
                   [3],
                   [4],
                   [np.NaN]], columns=['Col1'])
print(df)


# Adding row_number for each pair:
df['Rn'] = df[df['Col1'].notnull()].groupby('Col1')['Col1'].rank(method="first", ascending=True)
print(df)

# I managed to select only necessary rows for mask, but how can I apply it along with groupby?:
m = df.dropna().loc[df['Col1'].duplicated(keep=False)]
print(m)

Thank you!

>Solution :

Try:

m = df['Col1'].duplicated(keep=False)
df['Rn'] = df[m].groupby('Col1')['Col1'].rank(method="first", ascending=True)
print(df)

Prints:

   Col1   Rn
0   1.0  1.0
1   1.0  2.0
2   1.0  3.0
3   2.0  NaN
4   3.0  NaN
5   4.0  NaN
6   NaN  NaN

dataframe

byMR

Published June 14, 2023

Add a comment

import js file from another js file in browser

byMR

June 14, 2023

Questions

Creating rule function using pandas dataframe;

byMR

June 14, 2023

Questions

Python Pandas: Select multiple related rows from dataframe using comparisons across rows

byMR

June 14, 2023

Questions

OSError: [WinError 10038] An operation was attempted on something that is not a socket when trying to quit

byMR

June 14, 2023

Questions

How to make the mouse cursor's position only update in the monitor when the python program says so?

byMR

June 14, 2023

Questions

What is the difference between 'bigint' (lowercase) and 'BigInt'?

byMR

June 14, 2023

How to rank only duplicated rows and without Nan?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

import js file from another js file in browser

Creating rule function using pandas dataframe;

Python Pandas: Select multiple related rows from dataframe using comparisons across rows

OSError: [WinError 10038] An operation was attempted on something that is not a socket when trying to quit

How to make the mouse cursor's position only update in the monitor when the python program says so?

What is the difference between 'bigint' (lowercase) and 'BigInt'?

Keep Up to Date with the Most Important News

How to rank only duplicated rows and without Nan?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

import js file from another js file in browser

Creating rule function using pandas dataframe;

Python Pandas: Select multiple related rows from dataframe using comparisons across rows

OSError: [WinError 10038] An operation was attempted on something that is not a socket when trying to quit

How to make the mouse cursor's position only update in the monitor when the python program says so?

What is the difference between 'bigint' (lowercase) and 'BigInt'?

Discover more from Dev solutions