Home using python I would like to remove duplicate rows based on first column but would like to keep all the values in second column

Questions

using python I would like to remove duplicate rows based on first column but would like to keep all the values in second column

byMR

January 4, 2023

I have below dataframe

              CVE ID             Product Versions
0      CVE-2022-46689                   Mac OS 12
1      CVE-2022-42856                      Safari
2      CVE-2022-46689             Windows 10 21h1
3      CVE-2022-41121             Windows 10 21h2
4      CVE-2022-42856                      Safari

I would like to remove duplicates based on the column CVE ID but also want to make sure that I store the value present in the 2nd column Product Versions (but remove the value if already present)

Something like this below:

              CVE ID             Product Versions
0      CVE-2022-46689            Mac OS 12, Windows 10 21h1
1      CVE-2022-42856            Safari
2      CVE-2022-41121            Windows 10 21h2

How should I do it?

Any help is appreciated

>Solution :

here is one way to do it

# drop duplicates (in memory)
# groupby CVE ID and join the resulting list of product version

out=(df.drop_duplicates(subset=['CVE ID','Product Versions'])
 .groupby(['CVE ID'],as_index=False)['Product Versions']
 .agg(','.join ))

out

            CVE ID  Product Versions
0   CVE-2022-41121  Windows 10 21h2
1   CVE-2022-42856  Safari
2   CVE-2022-46689  Mac OS 12, Windows 10 21h1

dataframe

byMR

Published January 04, 2023

Add a comment

calling setState only once inside of useEffect–is there a better method?

byMR

January 4, 2023

Questions

Spacy Regex "SyntaxError: invalid syntax"

byMR

January 4, 2023

Questions

Sheets IFS function with multiple AND operators

byMR

January 4, 2023

Questions

Is there a pattern to limit the first four numbers in my input box?

byMR

January 4, 2023

Questions

Python how to format a string time from number time to name time

byMR

January 4, 2023

using python I would like to remove duplicate rows based on first column but would like to keep all the values in second column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

calling setState only once inside of useEffect–is there a better method?

Spacy Regex "SyntaxError: invalid syntax"

Sheets IFS function with multiple AND operators

Is there a pattern to limit the first four numbers in my input box?

Python how to format a string time from number time to name time

How to make condition on dropdownMenu

Keep Up to Date with the Most Important News

using python I would like to remove duplicate rows based on first column but would like to keep all the values in second column

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

calling setState only once inside of useEffect–is there a better method?

Spacy Regex "SyntaxError: invalid syntax"

Sheets IFS function with multiple AND operators

Is there a pattern to limit the first four numbers in my input box?

Python how to format a string time from number time to name time

How to make condition on dropdownMenu

Discover more from Dev solutions