Home How to apply onehot encoder over vectorized dataframe columns?

Questions

How to apply onehot encoder over vectorized dataframe columns?

November 14, 2022

Suppose that we have this data frame:

ID	CATEGORIES
0	[‘A’]
1	[‘A’, ‘C’]
2	[‘B’, ‘C’]

And I want to apply one hot encoder to categories column. The result I want is

ID	A	B	C
0	1	0	0
1	1	0	1
2	0	1	1

I know it can be easily codded. I just want to know if this function is already implemented in some package. Code it in python will probably result in a quite slow function.

(i needed to put the tables in code fields because stackoverflow was not allowing me to post it as tables)

>Solution :

You can use str.join combined with str.get_dummies:

out = df[['ID']].join(df['CATEGORIES'].str.join('|').str.get_dummies())

Output:

   ID  A  B  C
0   0  1  0  0
1   1  1  0  1
2   2  0  1  1

used input:

df = pd.DataFrame({'ID': [0, 1, 2],
                   'CATEGORIES': [['A'], ['A', 'C'], ['B', 'C']]})

There are many other alternatives, using pivot, crosstab, etc.

One example:

df2 = df.explode('CATEGORIES')

out = pd.crosstab(df2['ID'], df2['CATEGORIES']).reset_index()

one-hot-encoding

byMR

Published November 14, 2022

Add a comment

Why does else function run even the elif statement is true?

byMR

November 14, 2022

Questions

Postgres escape double quotes

byMR

November 14, 2022

Questions

Filter null values from a Map in JavaScript

byMR

November 14, 2022

Questions

How to import data from a url to pandas dataframe?

byMR

November 14, 2022

Questions

Hotchocolate logging errors with a scoped service

byMR

November 14, 2022

Questions

Is there any way match two different csv files with similar columns in python?

byMR

November 14, 2022

How to apply onehot encoder over vectorized dataframe columns?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Why does else function run even the elif statement is true?

Postgres escape double quotes

Filter null values from a Map in JavaScript

How to import data from a url to pandas dataframe?

Hotchocolate logging errors with a scoped service

Is there any way match two different csv files with similar columns in python?

Keep Up to Date with the Most Important News

How to apply onehot encoder over vectorized dataframe columns?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Why does else function run even the elif statement is true?

Postgres escape double quotes

Filter null values from a Map in JavaScript

How to import data from a url to pandas dataframe?

Hotchocolate logging errors with a scoped service

Is there any way match two different csv files with similar columns in python?

Discover more from Dev solutions