Home R – Convert various csv numeric columns to date

Questions

R – Convert various csv numeric columns to date

March 18, 2024

I have a csv datasheet with 7 columns filled with numeric values.
3 of these columns represent the date of the measurements: "YYYY", "MM", "DD", followed by 4 columns of relevant corresponding data: "qobs", "ckhs", "qceq", "qcol".

How do I convert the three first columns filled with numeric values into a date-datatype, while maintaining the dependency of the dates to the corresponding date?

#   YYYY, MM, DD, qobs, ckhs, qceq, qcol
# 1 1981, 1, 1, 7.136, 0, 0, 0
# 2 1981, 1, 2, 6.76, 0, 0, 0
# 3 1981, 1, 3, 10.886, 0, 0, 0
# ...

I looked online and only found solutions using the as.Date function that correspond to a single character string. I’m fairly new to programming and have only used R for a couple of days, so an elementary explanation would be greatly appreciated.

>Solution :

A tiydverse solution:

library(vroom)
library(dplyr)
library(lubridate) # a truly wonderful package for this kind of thing

df <- vroom("path-to-your-file.csv"
            col_types = "iiidddd")

df <-
  mutate(
    df, 

    date = make_date(YYYY, MM, DD)

   .keep = "unused", # drop the columns used for computation
   .before = qobs
   )

Explanation

vroom::vroom() is a really useful (and really fast!) function for reading plaintext data into R. It guesses the delimiter from the data and is generally pretty easy to implement.

dplyr::mutate() is a staple of tidyverse data manipulation. It computes new columns within dataframes, or modifies existing columns by overwriting them with new values. Here, we are computing a new column called date using lubridate::make_date(), which does what it says on the tin.

We also specify some of mutate()‘s named arguments:

.keep = "unused" lets us automatically drop all of the columns we used to calculate our new variable, because we no longer need the YYYY, MM or DD columns
.before = qobs just makes our new date column appear in front of qobs, on the left-hand-side of our dataframe.

Edit: I was previously implementing the convoluted:

paste(YYYY, MM, DD, sep = ",") |>
lubridate::ymd()

Thanks to Adriano for showing me that make_date() exists!

type-conversion

byMR

Published March 18, 2024

Add a comment

Is it possible to use MassTransit transactional outbox with multiple db contexts?

byMR

March 18, 2024

Questions

How to view histograms juxtaposed using matplotlib

byMR

March 18, 2024

Questions

How to create a Google Cloud Job/Service/Run based on a Docker image

byMR

March 18, 2024

Questions

Simulate an inner join between two JSON responses of an API call

byMR

March 18, 2024

Questions

Keep all matched rows when reshaping from long to wide

byMR

March 18, 2024

R – Convert various csv numeric columns to date

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Explanation

Like this:

Leave a ReplyCancel reply

Read more

Is it possible to use MassTransit transactional outbox with multiple db contexts?

How to view histograms juxtaposed using matplotlib

How to create a Google Cloud Job/Service/Run based on a Docker image

Simulate an inner join between two JSON responses of an API call

Keep all matched rows when reshaping from long to wide

Keep Up to Date with the Most Important News

R – Convert various csv numeric columns to date

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Explanation

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Is it possible to use MassTransit transactional outbox with multiple db contexts?

Merge list of dfs AND extract index as a new column

How to view histograms juxtaposed using matplotlib

How to create a Google Cloud Job/Service/Run based on a Docker image

Simulate an inner join between two JSON responses of an API call

Keep all matched rows when reshaping from long to wide

Discover more from Dev solutions