Questions tagged [opencl]
OpenCL is a C-based API for writing parallel programming intended to be executed on a compute device, typically (though not limited to) graphics cards and enterprise compute cards.
16 questions
1
vote
1
answer
169
views
Corner detect takes long time to run
I am using OpenCV’s findchessboardcornersSB function to find the corners in the image. I am using below code/ header file:
...
3
votes
0
answers
92
views
Accelerating this stacked matrix-matrix product in OpenCL
I am trying to compute a matrix-matrix product of N stacked complex double N x N matrices. For simplicity, I assume N = 512. I have written code in C++ parallelized with OMP and using OpenBLAS for the ...
1
vote
1
answer
195
views
pointer jumping on a GPU using Aparapi
Classic pointer jumping algorithm summarizing values from an array adapted to run on a GPU instead of a PRAM.
Read about openCL and Aparapi yesterday for the first time, so this is my first trial: ...
1
vote
1
answer
413
views
Implement 2D and 1D std::array in opencl kernel
I am asked to implement the following part of code into kernel code. Actually, I have tried but not sure about the std::array.
This is the original code for the ...
1
vote
1
answer
187
views
OpenCL approximation of Pi using vectorized implementation
Yesterday I started following the hands-on OpenCL course. I now got to the point where we are requested to reimplement an approximation algorithm for Pi in OpenCL (in steps, up to a vectorized ...
4
votes
1
answer
567
views
Computational verification of Collatz conjecture using OpenCL
This code review request follows my previous request Computational verification of Collatz conjecture.
Unlike the previous program (which was designed for the CPU), this code should run on modern GPUs....
4
votes
1
answer
191
views
Repeatedly incrementing array elements using OpenCL
I am learning how to program in opencl and I made a simple program that simply takes an array and adds 1 to every element. I want to run it many times (this is just so that i can benchmark how it does ...
3
votes
0
answers
109
views
Numpy versus Theano GPU parallelization
I am learning Theano to accelerate my image processing functions. As a start, I am trying to reimplement the function to turn images from colors to black and white (with the same number of channels): ...
13
votes
1
answer
1k
views
OpenCL implementations of IQZZ and IDCT for MJPEG
I am using this code for MJPEG decoding and I am trying to make two functions (IQZZ and IDCT) run faster on the GPU (NVIDIA Tesla k20c). I am using the OpenCL framework to accomplish this task.
I ...
4
votes
1
answer
161
views
Computing the Fourier transform of a distribution of electron charge (represented by atomic positions in space)
I am computing a nested loop operation using OpenCL (open computing language). My main question is, given the code outlined below, how might I optimize the speed and efficiency of using the GPU, for ...
1
vote
1
answer
682
views
Mandelbrot Set with OpenCL and SFML [closed]
I don't know if I'm doing this correctly, using OpenCL and SFML together but I know a little about both so I decided to make something with them. I've already tried implementing a pure C++ and SFML ...
4
votes
1
answer
1k
views
Speeding up OpenCL matrix-vector multiplication
So I'd like to get a performance boost above and beyond standard Julia matrix-vector multiply, using my Intel HD Graphics 4000 1536 MB GPU, but I can't do better than an order of magnitude worse ...
7
votes
2
answers
280
views
Generating permutation of indices from seed ID
In my OpenCL kernel, I have the following logic:
...
5
votes
1
answer
1k
views
Two-step OpenCL convolution for series of matrices
I've implemented a two-step convolution in OpenCL running on GPUs. The convolution is applied to a series of 1480x1552 matrices. All matrices are pre-loaded and are stored in the ...
9
votes
1
answer
740
views
Parallel accumulated sum in OpenCL
This is my first relevant code in OpenCL. Please, let me know if I'm doing something wrong in a logical level (i.e., I guess it can crash in situations?) or accounting for performance (bottlenecks, ...