Home Python: pandas groupby two columns, without merging them

Questions

Python: pandas groupby two columns, without merging them

October 8, 2024

My dataframe looks like this:

| col1 | col2 | col3 |
| ---- | ---- | ---- |
|  1   | abc  | txt1 |
|  1   | abc  | txt2 |
|  2   | abc  | txt3 |
|  1   | xyz  | txt4 |
|  2   | xyz  | txt5 |

I want to merge the text in col3 between rows only if the rows have the same value in col1 AND the rows have same value in col2.

Expected result:

| col1 | col2 | col3       |
| ---- | ---- | ---------- |
|  1   | abc  | txt1, txt2 |
|  2   | abc  | txt3       |
|  1   | xyz  | txt4       |
|  2   | xyz  | txt5       |

I have used this:

df = df.groupby([df[col1], df[col2]]).aggregate({'col3': ', '.join})

Which joins the col3 correctly, but it also merges col1 and col2 into one column (list). How can I achieve the expected result while keeping 3 separate columns (col1, col2, col3)?

>Solution :

A possible solution, which:

Performs a group-by operation using two columns, col1 and col2, as the grouping keys.
It then aggregates the values in col3 for each group by applying a lambda function that concatenates the values into a single string, with each value separated by a comma.

(df.groupby(['col1', 'col2'], as_index=False)
 .agg({'col3': lambda x: ', '. join(x)}))

Output:

   col1 col2        col3
0     1  abc  txt1, txt2
1     1  xyz        txt4
2     2  abc        txt3
3     2  xyz        txt5

pandas

byMR

Published October 08, 2024

Add a comment

Interpolating Values from Lists in R Using approx() in R

byMR

October 8, 2024

Questions

Find space in C

byMR

October 8, 2024

Questions

Else clause of ifelse() not executing

byMR

October 8, 2024

Questions

How to write this Excel in-cell formula "properly" and more compact?

byMR

October 9, 2024

Questions

Google sheets open multiple hyperlinks when filtered! openallLinks()

byMR

October 9, 2024

Questions

Error while vectorizing a function containing while loop

byMR

October 9, 2024

Python: pandas groupby two columns, without merging them

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Interpolating Values from Lists in R Using approx() in R

Find space in C

Else clause of ifelse() not executing

How to write this Excel in-cell formula "properly" and more compact?

Google sheets open multiple hyperlinks when filtered! openallLinks()

Error while vectorizing a function containing while loop

Keep Up to Date with the Most Important News

Python: pandas groupby two columns, without merging them

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Interpolating Values from Lists in R Using approx() in R

Find space in C

Else clause of ifelse() not executing

How to write this Excel in-cell formula "properly" and more compact?

Google sheets open multiple hyperlinks when filtered! openallLinks()

Error while vectorizing a function containing while loop

Discover more from Dev solutions