Home Pivot and Concatenate columns in pyspark dataframe

Questions

Pivot and Concatenate columns in pyspark dataframe

August 23, 2022

I have this dataframe below, and I need to get basically one row with all the marks fields concatenated with a delimiter like pipe.
So: PACKAGING MARKS 3|PACKAGING MARKS 2|PACKAG…..

And there can be varying amounts of marks records for each mid.

mid	marksId	id	index	marks
2	3	3	2	PACKAGING MARKS 3
2	3	3	1	PACKAGING MARKS 2
2	3	3	0	PACKAGING MARKS 1
2	4	4	2	PACKAGING MARKS 23
2	4	4	1	PACKAGING MARKS 22
2	4	4	0	PACKAGING MARKS 21

Thanks

>Solution :

Assuming you want 1 delimited string for each "mid", you can collect all "marks" with collect_list() and use concat_ws() to create the string:

import pyspark.sql.functions as F

df.groupby('mid').agg(F.concat_ws('|', F.collect_list('marks')).alias('marks_str')).show(truncate=False)

pivot

byMR

Published August 23, 2022

Add a comment

What is the format for a Snakemake JSON config file?

byMR

August 23, 2022

Questions

std::map::find does not access operator==

byMR

August 23, 2022

Questions

adding a search icon inside search bar

byMR

August 23, 2022

Questions

How do I fetch relative to a JavaScript file

byMR

August 23, 2022

Questions

SQL How to extract Middle Characters in a string

byMR

August 23, 2022

Questions

Extract all occurrences of timestamp in string

byMR

August 23, 2022

Pivot and Concatenate columns in pyspark dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

What is the format for a Snakemake JSON config file?

std::map::find does not access operator==

adding a search icon inside search bar

How do I fetch relative to a JavaScript file

SQL How to extract Middle Characters in a string

Extract all occurrences of timestamp in string

Keep Up to Date with the Most Important News

Pivot and Concatenate columns in pyspark dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

What is the format for a Snakemake JSON config file?

std::map::find does not access operator==

adding a search icon inside search bar

How do I fetch relative to a JavaScript file

SQL How to extract Middle Characters in a string

Extract all occurrences of timestamp in string

Discover more from Dev solutions