Home Replace duplicate values across columns of CSV file

Questions

Replace duplicate values across columns of CSV file

November 13, 2024

I have a headerless CSV file that is sorted on the first column. When the 2nd and 3rd columns are identical, I want to "collapse" them into one – i.e. replace the last column with a comma, which would combine with the first comma to make ,,, indicating the third column is empty. In other words, this:

0000001,11111,66666
0000002,12121,22222
0000003,33333,33333
0000004,74747,44444
0000005,12345,12345

…becomes this:

0000001,11111,66666
0000002,12121,22222
0000003,33333,,
0000004,74747,44444
0000005,12345,,

I’ve tried various permutations of grep and cut but can’t get anything to work – the closest I’ve come is cut -c 8-19 file.csv, which just isolates the 2nd and 3rd columns. I have a feeling needing to do this across columns and needing to replace the value instead of just delete the whole line makes this complicated enough to require awk or sed, and I don’t know enough about either to know how to approach this.

>Solution :

Using sed

$ sed -E 's/([^,]*,([^,]*),)\2/\1,/' input_file
0000001,11111,66666
0000002,12121,22222
0000003,33333,,
0000004,74747,44444
0000005,12345,,

byMR

Published November 13, 2024

Add a comment

How can I fix the Python REPL in VS Code with Python 3.13?

byMR

November 14, 2024

Questions

getYouTubeThumbnail function not returning the correct thumbnail URL?

byMR

November 14, 2024

Questions

Is there are any way to add 2 models as auth-user-model?

byMR

November 14, 2024

Questions

react query is refreshing hook form after successful submission using useMutation

byMR

November 14, 2024

Questions

Delete all files in a folder (but not the folder itself) except for specific type files

byMR

November 14, 2024

Questions

Laravel Livewire: data is not updated

byMR

November 14, 2024

Replace duplicate values across columns of CSV file

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How can I fix the Python REPL in VS Code with Python 3.13?

getYouTubeThumbnail function not returning the correct thumbnail URL?

Is there are any way to add 2 models as auth-user-model?

react query is refreshing hook form after successful submission using useMutation

Delete all files in a folder (but not the folder itself) except for specific type files

Laravel Livewire: data is not updated

Keep Up to Date with the Most Important News

Replace duplicate values across columns of CSV file

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How can I fix the Python REPL in VS Code with Python 3.13?

getYouTubeThumbnail function not returning the correct thumbnail URL?

Is there are any way to add 2 models as auth-user-model?

react query is refreshing hook form after successful submission using useMutation

Delete all files in a folder (but not the folder itself) except for specific type files

Laravel Livewire: data is not updated

Discover more from Dev solutions