Home Make dummy variable for categorical data, based on ID column with duplicate values in python

Questions

Make dummy variable for categorical data, based on ID column with duplicate values in python

March 23, 2022

I have the following pandas dataframe:

    ID    value
0   1     A
1   1     B
2   1     C
3   2     B
4   10    C
5   4     C
6   4     A

I want to make dummy variables for the values in the column ‘value’, for each value in the column ‘ID’. So I want it this:

    ID    A    B    C
0   1     1    1    1
1   2     0    1    0
2   10    0    0    1
3   4     1    0    1

How can I do this in python?

>Solution :

Use crosstab with limit counts to 1 by DataFrame.clip:

df1  = (pd.crosstab(df['ID'], df['value'])
          .clip(upper=1)
          .reset_index()
          .rename_axis(None, axis=1))
print (df1)
   ID  A  B  C
0   1  1  1  1
1   2  0  1  0
2   4  1  0  1
3  10  0  0  1

pandas

byMR

Published March 23, 2022

Add a comment

illegal_argument_exception error elasticsearch

byMR

March 23, 2022

Questions

JQuery copy and paste in real time a input value into another

byMR

March 23, 2022

Questions

Linux signal: process exit if there's no handler

byMR

March 23, 2022

Questions

Searching specific character in file in powershell not working

byMR

March 23, 2022

Questions

How to count Y/N answers in Python?

byMR

March 23, 2022

Questions

How to convert CSV to nested JSON in Python

byMR

March 23, 2022

Make dummy variable for categorical data, based on ID column with duplicate values in python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

illegal_argument_exception error elasticsearch

JQuery copy and paste in real time a input value into another

Linux signal: process exit if there's no handler

Searching specific character in file in powershell not working

How to count Y/N answers in Python?

How to convert CSV to nested JSON in Python

Keep Up to Date with the Most Important News

Make dummy variable for categorical data, based on ID column with duplicate values in python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

illegal_argument_exception error elasticsearch

JQuery copy and paste in real time a input value into another

Linux signal: process exit if there's no handler

Searching specific character in file in powershell not working

How to count Y/N answers in Python?

How to convert CSV to nested JSON in Python

Discover more from Dev solutions