Home Pandas or R – Merge Rows By Same Value in Column Over NaN values – Look at Example

Questions

Pandas or R – Merge Rows By Same Value in Column Over NaN values – Look at Example

December 21, 2022

I have a very specific dataset it looks something like this:

record_id	event_id	instrument	repeat_inst
PI0005	v03_abc_1	`NaN`	1
PI0005	v03_abc_1	i_sensor	`NaN`
PI0005	v03_abc_1	`NaN`	`NaN`
PI0005	v02_abc_33	i_sensor	`NaN`
PI0005	v02_abc_33	`NaN`	`NaN`
PI0006	v02_abc_1	i_sensor	1
PI0006	v02_abc_1	`NaN`	`NaN`

How do I make it look like this:

record_id	event_id	instrument	repeat_inst
PI0005	v03_abc_1	i_sensor	1
PI0005	v02_abc_33	i_sensor	`NaN`
PI0006	v02_abc_2	i_sensor	1

Where rows with the same record_id and event_id get merged together, where NaN values are replaced with the other value, and if both values are NaN, then NaN can be kept (like in the forth and fifth row in the original dataframe).

Assume that only one of the related cells have a value and all others have NaN.

This should apply to all columns of the data, there are thousands of columns and rows.

I tried using group by, but don’t know how to continue.

>Solution :

With R

library(dplyr)
df1 %>%
   group_by(record_id, event_id) %>%
   summarise(across(everything(),  ~.x[!is.na(.x)][1]))

pandas

byMR

Published December 21, 2022

Add a comment

Get one row per unique column value combination (`DISTINCT ON` operation without using it)

byMR

December 21, 2022

Questions

Problem with algorithm calculating the determinant of N x N square matrix

byMR

December 21, 2022

Questions

making instance of class in c#

byMR

December 21, 2022

Questions

How to handle json object that was stringified in Python

byMR

December 21, 2022

Questions

How to handle json object that was stringified in Python

byMR

December 21, 2022

Questions

ModuleNotFoundError: No module named 'skfeature'

byMR

December 21, 2022

Pandas or R – Merge Rows By Same Value in Column Over NaN values – Look at Example

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Get one row per unique column value combination (`DISTINCT ON` operation without using it)

Problem with algorithm calculating the determinant of N x N square matrix

making instance of class in c#

How to handle json object that was stringified in Python

How to handle json object that was stringified in Python

ModuleNotFoundError: No module named 'skfeature'

Keep Up to Date with the Most Important News

Pandas or R – Merge Rows By Same Value in Column Over NaN values – Look at Example

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Get one row per unique column value combination (`DISTINCT ON` operation without using it)

Problem with algorithm calculating the determinant of N x N square matrix

making instance of class in c#

How to handle json object that was stringified in Python

How to handle json object that was stringified in Python

ModuleNotFoundError: No module named 'skfeature'

Discover more from Dev solutions