Home How to convert the column with lists into one hot encoded columns?

Questions

How to convert the column with lists into one hot encoded columns?

December 13, 2024

Assume, there is one DataFrame such as following

import pandas as pd 
import numpy as np 

df = pd.DataFrame({'id':range(1,4), 
                   'items':[['A', 'B'], ['A', 'B', 'C'], ['A', 'C']]})
df
        id  items
        1   [A, B]
        2   [A, B, C]
        3   [A, C]

Is there an efficient way to convert above DataFrame into the following (one-hot encoded columns)? Many Thanks in advance!

   id   items       A   B   C
    1   [A, B]      1   1   0
    2   [A, B, C]   1   1   1
    3   [A, C]      1   0   1

>Solution :

Another possible solution, whose steps are:

First, the explode function is used to transform each item of a list-like to a row, replicating the index values.
Then, the pivot_table function is applied to reshape the data based on the unique values in the items column, aggregating the count of each id for every item. The fill_value=0 ensures that any missing combinations are filled with zeros.
The rename_axis method is used to remove the axis name for the columns.
Finally, reset_index is called to reset the index of the dataframe, turning the index into a column.
The original dataframe df is then merged with this transformed dataframe using the merge function.

df.merge(
    df.explode('items')
    .pivot_table(index='id', columns='items', values='id', aggfunc=len, 
                 fill_value=0)
    .rename_axis(None, axis=1).reset_index())

Output:

   id      items  A  B  C
0   1     [A, B]  1  1  0
1   2  [A, B, C]  1  1  1
2   3     [A, C]  1  0  1

numpy

byMR

Published December 13, 2024

Add a comment

How to Regex InnnerHTML value to input text from copied InnerHTML Value to input text?

byMR

December 13, 2024

Questions

Creating a Manifest v3 URL redirect from directly a URL

byMR

December 13, 2024

Questions

Create a object class "matrix" "array" with dataframe

byMR

December 13, 2024

Questions

In Dart how can I define a Arithmetic Operator as a Variable?

byMR

December 13, 2024

Questions

How to compare multiple columns and update a new column with boolean logic in kdb/q?

byMR

December 14, 2024

Questions

Is there a way to stop a function from being called when assigning to a dictionary?

byMR

December 14, 2024

How to convert the column with lists into one hot encoded columns?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to Regex InnnerHTML value to input text from copied InnerHTML Value to input text?

Creating a Manifest v3 URL redirect from directly a URL

Create a object class "matrix" "array" with dataframe

In Dart how can I define a Arithmetic Operator as a Variable?

How to compare multiple columns and update a new column with boolean logic in kdb/q?

Is there a way to stop a function from being called when assigning to a dictionary?

Keep Up to Date with the Most Important News

How to convert the column with lists into one hot encoded columns?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to Regex InnnerHTML value to input text from copied InnerHTML Value to input text?

Creating a Manifest v3 URL redirect from directly a URL

Create a object class "matrix" "array" with dataframe

In Dart how can I define a Arithmetic Operator as a Variable?

How to compare multiple columns and update a new column with boolean logic in kdb/q?

Is there a way to stop a function from being called when assigning to a dictionary?

Discover more from Dev solutions