Home Convert tuples into grouped rows in dataframe without changing the order

Questions

Convert tuples into grouped rows in dataframe without changing the order

June 20, 2022

I have a tuple and I need to convert it to dataframe.

res1_ =  [
  ('z1', '1'),
  ('z1', '2'),
  ('x1', '1'),
  ('x2', '1'),
  ('x1', '3'),
  ('z1', '1')]

My expected dataframe should be like this :

docid secid
z1    [1,2]
x1    [1]
x2    [1]
x1    [3]
z1    [1]

If you note, the order is not changed and if docid get repeated in next row, then two secids are merged into a single list.
Although x1 is occurring twice, sec id 1 and 3 are not in single list as we have docid x2 in mid of the x1s.

I tried with :

df = pd.DataFrame(res1_,columns=['docid','secid'])
df.groupby('docid')['secid'].apply(list)

But no luck as I am losing the order and x1 too is grouped.

Any pointers appreciated.

Thank you.

>Solution :

You can use the DataFrame constructor, then GroupBy.agg:

df = pd.DataFrame(res1_, columns=['docid', 'setid'])
group = df['docid'].ne(df['docid'].shift()).cumsum()
df = df.groupby(group.values).agg({'docid': 'first', 'setid': list})

output:

  docid   setid
1    z1  [1, 2]
2    x1     [1]
3    x2     [1]
4    x1     [3]
5    z1     [1]

dataframe

byMR

Published June 20, 2022

Add a comment

Covert S3 Buffer Data to PDF File

byMR

June 20, 2022

Questions

How to find whole string when using logical "or" in python regular expressions

byMR

June 20, 2022

Questions

Python if statement not working correctly and no idea why

byMR

June 20, 2022

Questions

removing a certain number from the output

byMR

June 20, 2022

Questions

Question about the implementation of std::istream& operator>>(std::istream& is, icmp_header& header)

byMR

June 20, 2022

Questions

I have checked for null value but still get error saying I didn't checked

byMR

June 20, 2022

Convert tuples into grouped rows in dataframe without changing the order

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Covert S3 Buffer Data to PDF File

How to find whole string when using logical "or" in python regular expressions

Python if statement not working correctly and no idea why

removing a certain number from the output

Question about the implementation of std::istream& operator>>(std::istream& is, icmp_header& header)

I have checked for null value but still get error saying I didn't checked

Keep Up to Date with the Most Important News

Convert tuples into grouped rows in dataframe without changing the order

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Covert S3 Buffer Data to PDF File

How to find whole string when using logical "or" in python regular expressions

Python if statement not working correctly and no idea why

removing a certain number from the output

Question about the implementation of std::istream& operator>>(std::istream& is, icmp_header& header)

I have checked for null value but still get error saying I didn't checked

Discover more from Dev solutions