Follow

Follow

Contact

Home Pandas split list upon DataFrame creation

Questions

Pandas split list upon DataFrame creation

byMR

July 5, 2022

I have a JSON file coming in, which I am doing some operations/trimming on.

The result looks like this:

print("User:", user)
> User: {'id': 1, 'label': 'female', 'position': {'lat': 47.72485566, 'lon': 10.32219439}, 'confidence': 0.8}

When applying df = pd.DataFrame(user, index=[0]) I get the following Dataframe:

     id   label    position  velocity
0    1    female   NaN       0.8

When applying df = pd.DataFrame(user) I get:

      id   label    position     confidence
lat   1    female   47.72485566  0.8
lon   1    female   10.32219439  0.8

I am aware, as to why that happens, however none is what I want.

I’d like the following:

     id   label    lat          lon           confidence
0    1    female   47.72485566  10.32219439   0.8

However I am not sure what the best way is to split the position parameter.

>Solution :

You can just pandas.json_normalize , then later rename the columns:

>>> df = pd.json_normalize({'id': 1, 'label': 'female', 'position': {'lat': 47.72485566, 'lon': 10.32219439}, 'confidence': 0.8})
>>> df = df.rename(columns={'position.lat': 'lattitude', 'position.lon': 'longitude'})

OUTPUT

id   label  confidence  lattitude  longitude
0   1  female         0.8  47.724856  10.322194

dataframe

byMR

Published July 05, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

Conditional statement to check if you are in a git repository in shell

byMR

July 5, 2022

Questions

Regex: Exact match ignoring leading and trailing whitespaces

byMR

July 5, 2022

Questions

TypeError: base.valid is not a function

byMR

July 5, 2022

Questions

Is PHP Memcached thread safe?

byMR

July 5, 2022

Questions

aws – secrets manager – how to secure secrets when I want to access from ec2?

byMR

July 5, 2022

Questions

How to print some column by for loop in R

byMR

July 5, 2022