Follow

Follow

Contact

Home Only leave duplicated rows in a dataframe, with R

Questions

Only leave duplicated rows in a dataframe, with R

byMR

January 27, 2023

I have a dataframe that looks like this:

col1	col2	col3
tn1	a	b
tn1	a	c
tn2	d	b
tn3	a	b

And I want to leave only those rows that are duplicated for col1 & col2, keeping BOTH rows:

col1	col2	col3
tn1	a	b
tn1	a	c

I’ve been trying to do this by using unique() or distinct() or anti_join() but can’t figure it out.

>Solution :

Base R:

df[df$col1 %in% df$col1[duplicated(df$col1)],]

  col1 col2 col3
1  tn1    a    b
2  tn1    a    c

duplicates

byMR

Published January 27, 2023

Add a comment

Leave a ReplyCancel reply

Read more

Questions

'tuple' object has no attribute 'split_contents'

byMR

January 27, 2023

Questions

Oracle: Retrieving specific group of records based by date

byMR

January 27, 2023

Questions

Building a Tinder-like swipe feature on SwiftUI

byMR

January 27, 2023

Questions

Get the first item of the list returned by function

byMR

January 27, 2023

Questions

Python – New to Python and I am trying to take a section of a list that is a string and make it a integer

byMR

January 27, 2023

Questions

How to entry repeated value in a row

byMR

January 27, 2023