Home How use two columns as a single condition to get results in pyspark

Questions

How use two columns as a single condition to get results in pyspark

June 27, 2022

I have:

+-----------+------+
|ColA       |ColB  |
+-----------+------+
|       A   |     B|
|       A   |     D|
|       C   |     U|
|       B   |     B|
|       A   |     B|
+-----------+------+

and I want to get:

+-----------+------+
|ColA       |ColB  |
+-----------+------+
|       A   |     D|
|       C   |     U|
|       B   |     B|
+-----------+------+

I want to "remove" all rows with the combination of "colA == A and colB == B".
When I tried this SQL Statement

SELECT * FROM table where (colA != 'A' and colB != 'B')

worked fine.

But when I try to translate to spark (or even to pandas) I got an error.

Py4JError: An error occurred while calling o109.and. Trace:…

#spark
sparkDF.where((sparkDF['colA'] != 'A' & sparkDF['colB'] != 'B')).show() 

#pandas
pandasDF[(pandasDF["colA"]!="A" & pandasDF["colB"]!="B")]

What am I doing wrong here?

>Solution :

Need add parentheses and | for bitwise OR:

pandasDF[(pandasDF["colA"]!="A") | (pandasDF["colB"]!="B")]

sparkDF.where((sparkDF['colA'] != 'A') | (sparkDF['colB'] != 'B')).show()

pyspark

byMR

Published June 27, 2022

Add a comment

Mocking an api call in Jest with React Typescript

byMR

June 27, 2022

Questions

BeautifulSoup: get data in class that change name

byMR

June 27, 2022

Questions

Vertically merge two columns of a list of lists in Python

byMR

June 27, 2022

Questions

return 2 values javascript

byMR

June 27, 2022

Questions

Def and Return Function in python

byMR

June 27, 2022

Questions

regex to find a string or another string in search and then return one part of the searched with group

byMR

June 27, 2022

How use two columns as a single condition to get results in pyspark

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Mocking an api call in Jest with React Typescript

BeautifulSoup: get data in class that change name

Vertically merge two columns of a list of lists in Python

return 2 values javascript

Def and Return Function in python

regex to find a string or another string in search and then return one part of the searched with group

Keep Up to Date with the Most Important News

How use two columns as a single condition to get results in pyspark

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Mocking an api call in Jest with React Typescript

BeautifulSoup: get data in class that change name

Vertically merge two columns of a list of lists in Python

return 2 values javascript

Def and Return Function in python

regex to find a string or another string in search and then return one part of the searched with group

Discover more from Dev solutions