Home Replace value based on match between two dataframes

Questions

Replace value based on match between two dataframes

January 10, 2022

Let’s say I have two starting data frames:

df1 <- data.frame(code1 = c("a", "b","z"), code2 = c("2", "3", "4"))
df2 <- data.frame(code1 = c("c", "o", "p"), code2 = c("2", "4", "5"), 
                  column3 = "a", column4 = "b", column5 = "c")

I want to match the two data frames by the column ‘code2’ and where that’s a match, replace the value of code1 in df1 to the value of code1 in df2 so that the final data frame looks like this:

df3<- data.frame(code1 = c("c", "b", "o"), code2 = c("2", "3", "4"))

>Solution :

Here’s a solution with dplyr. It "looks up" code1 in df2, wherever code2 matches; and when no match is found, it defaults to the original code1 in df1.

Solution

library(dplyr)


# ...
# Code to generate 'df1' and 'df2'.
# ...


df1 %>% mutate(code1 = coalesce(
  # Look up the 'code1' according to 'code2'...
  df2$code1[match(code2, df2$code2)],
  # ...and otherwise default to the original 'code1'.
  code1
))

Result

Given df1 and df2 as in your example

df1 <- data.frame(
  code1 = c("a", "b","z"),
  code2 = c("2", "3", "4")
)

df2 <- data.frame(
  code1 = c("c", "o", "p"),
  code2 = c("2", "4", "5"),
  column3 = "a",
  column4 = "b",
  column5 = "c"
)

this solution should yield the desired result:

  code1 code2
1     c     2
2     b     3
3     o     4

Note

One advantage of using match() rather than a dplyr::*_join(): no additional steps are needed to purge extraneous columns from your results.

dataframe

byMR

Published January 10, 2022

Add a comment

How to use set.intersection in an if-statement?

byMR

January 10, 2022

Questions

When a user selects startDate and endDate, I want start date to be displayed in other text box

byMR

January 10, 2022

Questions

How do I properly overwrite generic of AsyncThunk in Redux Toolkit

byMR

January 10, 2022

Questions

How do I add a column to a pandas dataframe which has the highest value in a range but applying it to every row?

byMR

January 10, 2022

Questions

Printing the same Title on a Function

byMR

January 10, 2022

Questions

Why am I getting error ts(1005) ',' expected

byMR

January 10, 2022

Replace value based on match between two dataframes

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Solution

Result

Note

Like this:

Leave a ReplyCancel reply

Read more

How to use set.intersection in an if-statement?

When a user selects startDate and endDate, I want start date to be displayed in other text box

How do I properly overwrite generic of AsyncThunk in Redux Toolkit

How do I add a column to a pandas dataframe which has the highest value in a range but applying it to every row?

Printing the same Title on a Function

Why am I getting error ts(1005) ',' expected

Keep Up to Date with the Most Important News

Replace value based on match between two dataframes

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Solution

Result

Note

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to use set.intersection in an if-statement?

When a user selects startDate and endDate, I want start date to be displayed in other text box

How do I properly overwrite generic of AsyncThunk in Redux Toolkit

How do I add a column to a pandas dataframe which has the highest value in a range but applying it to every row?

Printing the same Title on a Function

Why am I getting error ts(1005) ',' expected

Discover more from Dev solutions