A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
-
Updated
Apr 22, 2025 - Python
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Computations and statistics on manifolds with geometric structures.
Implementation of a Transformer, but completely in Triton
Fast deterministic all-Python Lennard-Jones particle simulator that utilizes Numba for GPU-accelerated computation.
Implementation of the Apriori and Eclat algorithms, two of the best-known basic algorithms for mining frequent item sets in a set of transactions, implementation in Python.
Boilerplate for GPU-Accelerated TensorFlow and PyTorch code on M1 Macbook
🌟 Vertex Centric approach for building GNN/TGNNs
pyCUDA implementation of forward propagation for Convolutional Neural Networks
Fundamentals of heterogeneous parallel programming with CUDA C/C++ at the beginner level.
bilibili视频【CUDA 12.x 并行编程入门(Python版)】配套代码
vgg16 inference implementation using tensorflow, numpy and pycuda
A package to run commands when GPU resources are available
A helper package to easily time Numba CUDA GPU events ⌛
Simplify GPU Setup: Drivers, CUDA, Frameworks, and more!
A Bifrost plug-in for the Tensor-Core Correlator.
GPU programming using CUDA & Python
Project for the Parallel and Concurrent Programming course 2023/2024
An attempt...
CUDA accelerated raytracer using PyCUDA in Python
Add a description, image, and links to the gpu-programming topic page so that developers can more easily learn about it.
To associate your repository with the gpu-programming topic, visit your repo's landing page and select "manage topics."