Home Dataframe: create new columns and assign its values from existing column's values

Questions

Dataframe: create new columns and assign its values from existing column's values

April 18, 2024

I have a Dataset downloaded from Kaggle for my project, I would like to seek help in creating new columns and assigning its values based on an existing column.

My actual Dataset is complicated, I will give a similar but simpler dataset for easy discussion.

Input:

Month  | Fruit  | Weight
------- -------- --------
1-2020 | Orange | 0.2
1-2020 | Kiwi   | 0.9
2-2020 | Orange | 2.1
2-2020 | Kiwi   | 1.4
...... | .....  | ...

To be able to create a required line chart, I need to change this Dataset structure, making "Orange", "Kiwi" new columns with Weight values, so that "Month' is not repeated.

Desired output:

    Month  | Orange| Kiwi
    ------- -------- --------
    1-2020 | 0.2   | 0.9
    2-2020 | 2.1   | 1.4

Any help would be greatly appreciated

>Solution :

The basic thing you’re looking for is a pivot:

import pandas as pd

data = {
    'Month': ['1-2020', '1-2020', '2-2020', '2-2020'],
    'Fruit': ['Orange', 'Kiwi', 'Orange', 'Kiwi'],
    'Weight': [0.2, 0.9, 2.1, 1.4]
}
df = pd.DataFrame(data)

pivot_df = df.pivot_table(index='Month', columns='Fruit', values='Weight')

print(pivot_df)

Output:

Fruit   Kiwi  Orange
Month               
1-2020   0.9     0.2
2-2020   1.4     2.1

If you want Month as a column instead of the index:

pivot_df.reset_index(inplace=True)
pivot_df.columns.name = None

print(pivot_df)

Output:

    Month  Kiwi  Orange
0  1-2020   0.9     0.2
1  2-2020   1.4     2.1

If there are duplicate combinations of Month and Fruit in the original data, and you want to sum them (for example):

data = {
    'Month': ['1-2020', '1-2020', '1-2020', '2-2020', '2-2020'],
    'Fruit': ['Orange', 'Kiwi', 'Kiwi', 'Orange', 'Kiwi'],
    'Weight': [0.2, 0.9, 1.1, 2.1, 1.4]
}
df = pd.DataFrame(data)

pivot_df = df.pivot_table(index='Month', columns='Fruit', values='Weight', aggfunc='sum')
print(pivot_df)

Output:

Fruit   Kiwi  Orange
Month               
1-2020   2.0     0.2
2-2020   1.4     2.1

dataframe

byMR

Published April 18, 2024

Add a comment

Is it bad practice to return on every iteration of a loop

byMR

April 18, 2024

Questions

downcasting a double/float to a uint8_t

byMR

April 18, 2024

Questions

How to concatenate words with a separator based on values in other column?

byMR

April 18, 2024

Questions

how validate allow characters & special character but not allow number with character or number in kotlin

byMR

April 18, 2024

Questions

removing delimiters for nulls in concataned string oracle

byMR

April 18, 2024

Questions

Python: How to calculate a local variable in function_1 and send the result to function_2 without running function_1 every time function_2 is called?

byMR

April 18, 2024

Dataframe: create new columns and assign its values from existing column's values

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Is it bad practice to return on every iteration of a loop

downcasting a double/float to a uint8_t

How to concatenate words with a separator based on values in other column?

how validate allow characters & special character but not allow number with character or number in kotlin

removing delimiters for nulls in concataned string oracle

Python: How to calculate a local variable in function_1 and send the result to function_2 without running function_1 every time function_2 is called?

Keep Up to Date with the Most Important News

Dataframe: create new columns and assign its values from existing column's values

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Is it bad practice to return on every iteration of a loop

downcasting a double/float to a uint8_t

How to concatenate words with a separator based on values in other column?

how validate allow characters & special character but not allow number with character or number in kotlin

removing delimiters for nulls in concataned string oracle

Python: How to calculate a local variable in function_1 and send the result to function_2 without running function_1 every time function_2 is called?

Discover more from Dev solutions