Skip to main content

All Questions

1 vote
2 answers
443 views

Use pyspark to run a custom defined function from python

I wrote a custom function that will find the most word occurrences from a .txt file. I need to run it through PySpark as an RDD I wrote a function called top_five whos only parameter is file_name ...
user11193431's user avatar