Home Remove all occurrences of duplicate lines in bash or Python and getting only and only the unique lines

Questions

Remove all occurrences of duplicate lines in bash or Python and getting only and only the unique lines

May 20, 2022

I have already tried the solution here but it gives me an empty file, even though I have non-duplicated unique lines.

I have a large text file (2GB) containing very long strings in each line.

AB02819380213.   : (( 00 99   -   MO:ASKDJIO*U* HIUGHUHAHUHHA AUCCGTCTTCTTTTTTA FFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFF
a01219f8b
NJSAJDH*)8888-   + 99 100.    -   NKJJABHASDGASGYUOISADIJIJA  TCTCTCTTTCTACACTAATCACAATACTACA FFFFFFFFFFF
a023129ab
NJSAJDH*)8888-   + 99 100.    -   NKJJABHASDGASGYUOISADIJIJA  TCTCTCTTTCTACACTAATCACAATACTACA FFFFFFFFFFF
000axa2381a
AB02819380213.   : (( 00 99   -   MO:ASKDJIO*U* HIUGHUHAHUHHA AUCCGTCTTCTTTTTTA FFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFF

The expected output here would be

a01219f8b
a023129ab
000axa2381a

How can I do this in bash or Python?

>Solution :

If you are not worried about the ordering of the output:

$ awk '{a[$0]++}END{for (i in a) if (a[i] == 1) print i}' file
000axa2381a
a01219f8b
a023129ab

Array a will hold the count of occurrence of each line. And in the end, print when the count is 1.

byMR

Published May 20, 2022

Add a comment

How to move a function into a separate method?

byMR

May 20, 2022

Questions

Rowwise partial match in all columns of a tibble

byMR

May 20, 2022

Questions

AlpineJS pass prefilled values to x-model

byMR

May 20, 2022

Questions

SwiftUI: why can't I map views in a previews in PreviewProvider?

byMR

May 20, 2022

Questions

how to use the codes of a css document inline in html document

byMR

May 20, 2022

Questions

Or-tools not printing results when changing an example

byMR

May 20, 2022

Remove all occurrences of duplicate lines in bash or Python and getting only and only the unique lines

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to move a function into a separate method?

Rowwise partial match in all columns of a tibble

AlpineJS pass prefilled values to x-model

SwiftUI: why can't I map views in a previews in PreviewProvider?

how to use the codes of a css document inline in html document

Or-tools not printing results when changing an example

Keep Up to Date with the Most Important News

Remove all occurrences of duplicate lines in bash or Python and getting only and only the unique lines

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to move a function into a separate method?

Rowwise partial match in all columns of a tibble

AlpineJS pass prefilled values to x-model

SwiftUI: why can't I map views in a previews in PreviewProvider?

how to use the codes of a css document inline in html document

Or-tools not printing results when changing an example

Discover more from Dev solutions