Home pyspark table to pandas dataframe

Questions

pyspark table to pandas dataframe

March 15, 2022

I have an object type <class 'pyspark.sql.dataframe.DataFrame'> and I want to convert it to Pandas DataFRame. But the dataset is too big and I just need some columns, thus I selected the ones I want with the following:

df = spark.table("sandbox.zitrhr023")
columns= ['X', 'Y', 'Z', 'etc']

and then:

df_new= df.select(*columns).show()

but it returns a NoneType object. When I try the following:

df_new = df_new.toPandas()

It gives the following error:

AttributeError: 'NoneType' object has no attribute 'toPandas'

Do I need to put df_new in a spark dataframe before converting it with toPandas()? How do I do that?

>Solution :

You are trying to cast it to Pandas Dataframe after calling show which print the Dataframe and return None, can you try the following

df_new= df.select.(*columns).toPandas()

pyspark

byMR

Published March 15, 2022

Add a comment

Update an Empty Cell in a range

byMR

March 15, 2022

Questions

Python: Testing abstract class with concrete implementation details

byMR

March 15, 2022

Questions

How to store and update score in the game?

byMR

March 15, 2022

Questions

C++: Why I can change the assignment of reference on my computer?

byMR

March 15, 2022

Questions

Implement a blocking collection of a concurrent stack

byMR

March 15, 2022

Questions

Redux: A state mutation was detected inside a dispatch

byMR

March 15, 2022

pyspark table to pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Update an Empty Cell in a range

Python: Testing abstract class with concrete implementation details

How to store and update score in the game?

C++: Why I can change the assignment of reference on my computer?

Implement a blocking collection of a concurrent stack

Redux: A state mutation was detected inside a dispatch

Keep Up to Date with the Most Important News

pyspark table to pandas dataframe

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Update an Empty Cell in a range

Python: Testing abstract class with concrete implementation details

How to store and update score in the game?

C++: Why I can change the assignment of reference on my computer?

Implement a blocking collection of a concurrent stack

Redux: A state mutation was detected inside a dispatch

Discover more from Dev solutions