Skip to main content

Questions tagged [opencl]

OpenCL is a C-based API for writing parallel programming intended to be executed on a compute device, typically (though not limited to) graphics cards and enterprise compute cards.

1 vote
1 answer
169 views

I am using OpenCV’s findchessboardcornersSB function to find the corners in the image. I am using below code/ header file: ...
Dushyant Patil's user avatar
3 votes
0 answers
92 views

I am trying to compute a matrix-matrix product of N stacked complex double N x N matrices. For simplicity, I assume N = 512. I have written code in C++ parallelized with OMP and using OpenBLAS for the ...
sqrt6's user avatar
  • 31
1 vote
1 answer
195 views

Classic pointer jumping algorithm summarizing values from an array adapted to run on a GPU instead of a PRAM. Read about openCL and Aparapi yesterday for the first time, so this is my first trial: ...
morgwai's user avatar
  • 389
1 vote
1 answer
413 views

I am asked to implement the following part of code into kernel code. Actually, I have tried but not sure about the std::array. This is the original code for the ...
user247399's user avatar
1 vote
1 answer
187 views

Yesterday I started following the hands-on OpenCL course. I now got to the point where we are requested to reimplement an approximation algorithm for Pi in OpenCL (in steps, up to a vectorized ...
Koekje's user avatar
  • 1,062
4 votes
1 answer
567 views

This code review request follows my previous request Computational verification of Collatz conjecture. Unlike the previous program (which was designed for the CPU), this code should run on modern GPUs....
DaBler's user avatar
  • 309
4 votes
1 answer
191 views

I am learning how to program in opencl and I made a simple program that simply takes an array and adds 1 to every element. I want to run it many times (this is just so that i can benchmark how it does ...
user avatar
3 votes
0 answers
109 views

I am learning Theano to accelerate my image processing functions. As a start, I am trying to reimplement the function to turn images from colors to black and white (with the same number of channels): ...
hyamanieu's user avatar
  • 131
13 votes
1 answer
1k views

I am using this code for MJPEG decoding and I am trying to make two functions (IQZZ and IDCT) run faster on the GPU (NVIDIA Tesla k20c). I am using the OpenCL framework to accomplish this task. I ...
a_sid's user avatar
  • 435
4 votes
1 answer
161 views

I am computing a nested loop operation using OpenCL (open computing language). My main question is, given the code outlined below, how might I optimize the speed and efficiency of using the GPU, for ...
dermen's user avatar
  • 353
1 vote
1 answer
682 views

I don't know if I'm doing this correctly, using OpenCL and SFML together but I know a little about both so I decided to make something with them. I've already tried implementing a pure C++ and SFML ...
olaralex's user avatar
  • 165
4 votes
1 answer
1k views

So I'd like to get a performance boost above and beyond standard Julia matrix-vector multiply, using my Intel HD Graphics 4000 1536 MB GPU, but I can't do better than an order of magnitude worse ...
Set's user avatar
  • 372
7 votes
2 answers
280 views

In my OpenCL kernel, I have the following logic: ...
Xirema's user avatar
  • 247
5 votes
1 answer
1k views

I've implemented a two-step convolution in OpenCL running on GPUs. The convolution is applied to a series of 1480x1552 matrices. All matrices are pre-loaded and are stored in the ...
Daniel Becker's user avatar
9 votes
1 answer
740 views

This is my first relevant code in OpenCL. Please, let me know if I'm doing something wrong in a logical level (i.e., I guess it can crash in situations?) or accounting for performance (bottlenecks, ...
MaiaVictor's user avatar
  • 1,101

15 30 50 per page