Skip to main content
WestCoastProjects's user avatar
WestCoastProjects's user avatar
WestCoastProjects's user avatar
WestCoastProjects
  • Member for 14 years
  • Last seen this week
  • Mountain View, CA
109 votes
Accepted

Is it possible to get the current spark context settings in PySpark?

70 votes
Accepted

Number of partitions in RDD and performance in Spark

67 votes

How are stages split into tasks in Spark?

58 votes
Accepted

How to set Master address for Spark examples from command line

52 votes
Accepted

How to restore default layout for intellij Run/Debug window/tabs?

47 votes

Pythonic way to combine for-loop and if-statement

46 votes
Accepted

Show partitions on a pyspark RDD

41 votes
Accepted

Recursively fetch file contents from subdirectories using sc.textFile

41 votes

How to match a string, but case-insensitively?

30 votes
Accepted

How to install sbt on ubuntu/debian with apt-get

25 votes

Neo4j sharding aspect

23 votes

How to run a single test in scalatest from maven

22 votes
Accepted

How to create a Spark Dataset from an RDD

22 votes

How to check if collection contains any element from other collection in Scala?

21 votes
Accepted

Test accuracy is greater than train accuracy what to do?

21 votes
Accepted

Why is "abstract override" required not "override" alone in subtrait?

20 votes
Accepted

Addition of two RDD[mllib.linalg.Vector]'s

20 votes
Accepted

Json serialization error using matplotlib mpld3 with LinkedBrush

20 votes
Accepted

How to run a Scala script within IntelliJ IDEA?

20 votes
Accepted

Hive command to execute NOT IN clause

19 votes

Hadoop DistCp using wildcards?

19 votes
Accepted

How to convert a Scala Array to ArrayBuffer?

19 votes
Accepted

How to sort an RDD in Scala Spark?

18 votes

Why does Spark fail with java.lang.OutOfMemoryError: GC overhead limit exceeded?

18 votes

What do columns ‘rawPrediction’ and ‘probability’ of DataFrame mean in Spark MLlib?

17 votes

Is there a 'foreach' function in Python 3?

15 votes
Accepted

UnsatisfiedLinkError: no snappyjava in java.library.path when running Spark MLLib Unit test within Intellij

15 votes
Accepted

Spark : check your cluster UI to ensure that workers are registered

14 votes

Multiple table join in hive

13 votes

Spark : how to run spark file from spark shell

1
2 3 4 5
27