Home R – substr over multiple columns in Dataframe

Questions

R – substr over multiple columns in Dataframe

August 28, 2022

Lets say I have a dataframe that looks like this

Column1,  Column2,  Column3
 a_2019    b_2020    c_2021
 d_2019    e_2020    f_2021
 a_2019    b_2020    c_2021
 d_2019    e_2020    f_2021

And I would like to take out "_2019", "_2020", and "_2021". I could use

df$Column1 <- substr(df$Column1, 1, nchar(df$Column1)-5)

For every column, but I have multiple dataframes with quite a few columns. substr need a text or a vector for it to work, so using df[,3:10] doesn´t work, lapply either.

Any suggestion on how to achieve this in an elegant way? Thank you

>Solution :

We can try using lapply along with sub for a base R option:

df[cols] <- lapply(df[cols], function(x) sub("_(?:2019|2020|2021)$", "", x))

Here cols should be a vector containing the column names on which you seek to make the replacement.

More generally, to target underscore followed by any number, we can use:

df[cols] <- lapply(df[cols], function(x) sub("_\\d+$", "", x))  # or _\\d{4} for a year

lapply

byMR

Published August 28, 2022

Add a comment

numpy array divide multiple columns by one column

byMR

August 28, 2022

Questions

Console blocking execution because getElementbyId is null (but I want it to be)

byMR

August 28, 2022

Questions

MergSort won't sort the array

byMR

August 28, 2022

Questions

Null Check Operator used on null value – flutter

byMR

August 28, 2022

Questions

How to move an imaage smoothly in pygame

byMR

August 28, 2022

Questions

How to find() Data with object keys with Mongodb

byMR

August 28, 2022

R – substr over multiple columns in Dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

numpy array divide multiple columns by one column

Console blocking execution because getElementbyId is null (but I want it to be)

MergSort won't sort the array

Null Check Operator used on null value – flutter

How to move an imaage smoothly in pygame

How to find() Data with object keys with Mongodb

Keep Up to Date with the Most Important News

R – substr over multiple columns in Dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

numpy array divide multiple columns by one column

Console blocking execution because getElementbyId is null (but I want it to be)

MergSort won't sort the array

Null Check Operator used on null value – flutter

How to move an imaage smoothly in pygame

How to find() Data with object keys with Mongodb

Discover more from Dev solutions