All Questions
1 question
1
vote
2
answers
443
views
Use pyspark to run a custom defined function from python
I wrote a custom function that will find the most word occurrences from a .txt file. I need to run it through PySpark as an RDD
I wrote a function called top_five whos only parameter is file_name
...