Follow

Follow

Contact

Home getting unusual error when creating a string schema type dataframe

Questions

getting unusual error when creating a string schema type dataframe

byMR

November 30, 2022

I am creating a simple data frame.

df=spark.createDataFrame(data=[('11s1 ab')],schema=['str'])

I get error:

TypeError: Can not infer schema for type: <class ‘str’>

However if I change the statement to :

df=spark.createDataFrame(data=[('11s1 ab',)],schema=['str'])

my dataframe is successfully created.

I want to understand why that comma sign matters in data definition tuple in spark.createdataFrame.

>Solution :

In the document of createDataFrame you can see the data field must be:

data: Union[pyspark.rdd.RDD[Any], Iterable[Any], ForwardRef('PandasDataFrameLike')]

(1,) or [1] are iterable but (1) would be integer type which is not iterable

apache-spark-sql

byMR

Published November 30, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

Selenium can not find elements in dynamic web page, page source does not be loaded completely

byMR

November 30, 2022

Questions

How to scroll two SingleChildScrollView same time in Flutter?

byMR

November 30, 2022

Questions

Aggregate and concatenate multiple columns

byMR

November 30, 2022

Questions

Converting XML to CSV formate using XSLT

byMR

November 30, 2022

Questions

how to replace 2nd filed of line separated by comma with a value stored in variable

byMR

November 30, 2022