Skip to main content

Questions tagged [mapreduce]

MapReduce is an algorithm for processing huge datasets on certain kinds of distributable problems using a large number of nodes

4 votes
3 answers
327 views

Problem Statement: Given two sparse vectors, compute their dot product. Implement class SparseVector: SparseVector(nums) Initializes the object with the vector nums dotProduct(vec) Compute the dot ...
Pankaj Kumar's user avatar
4 votes
0 answers
478 views

For context, the whole of the project code can be found here. This question was created specifically for the progress.py file. The goal behind it is to allow ...
gabe appleton's user avatar
4 votes
2 answers
78 views

I have this array of objects: ...
Katherine Maurus's user avatar
0 votes
2 answers
5k views

I have the object with values. I trying to filter based on values. ...
RSKMR's user avatar
  • 101
2 votes
1 answer
230 views

I have a transformer helper function. It reduces over the array and transform key/value pairs. At the end of the loop there is the key 'EXAMPLE1' exists and I should insert two objects after the first ...
user215631's user avatar
2 votes
1 answer
94 views

Problem Given the following data: ...
Dominic Campos's user avatar
1 vote
2 answers
213 views

The task is for training go-lang. The idea is to extract unique words sorted and grouped by length. Might be useful in learning new words. The program uses command line argument assuming it's a file ...
triclosan's user avatar
  • 244
2 votes
1 answer
95 views

I have a bidimensional array like this: ...
robe007's user avatar
  • 123
2 votes
1 answer
104 views

When building objects using reduce, I often have crappy code like this: ...
Sophie McCarrell's user avatar
2 votes
0 answers
345 views

Jenkins DSL doesn't support collect and inject from what I can tell (I get missing method exceptions when I try), so I ...
RubberDuck's user avatar
  • 31.2k
3 votes
2 answers
835 views

This function takes a list of questions and list of answers provided by the user. The list of answers is always a list of booleans (for true and false) and the list of questions takes the following ...
Calin Leafshade's user avatar
6 votes
0 answers
100 views

I've written a Pyspark program that will completely solve a tiered board game (no loops, each game position is a member of only one tier) and writes each tier to a file. It also determines the ...
Michael's user avatar
  • 61
8 votes
3 answers
1k views

I have a few of files, and total size of them is more than 5 GB. Each line of the files is a IP address, looks like: 127.0.0.1 reset success ... 127.0.0.2 reset success how can i find Top10 ...
Cherlex's user avatar
  • 145
3 votes
1 answer
133 views

I am new to Spark and Scala and I have solved the following problem. I have a table in database with following structure: ...
Shams Tabraiz Alam's user avatar
5 votes
2 answers
670 views

I am implementing map and reduce - style functions for processing geospatial raster datasets. I would like the ...
jramm's user avatar
  • 133

15 30 50 per page