|
KernelForge.jl — High-performance portable GPU primitives for arbitrary types and operators
|
|
11
|
999
|
March 22, 2026
|
|
[ANN] JACC.jl v1.0 now available for 100% portable CPU/GPU code
|
|
4
|
425
|
March 17, 2026
|
|
[ANN] cuTile.jl: Tile-based GPU programming for CUDA GPUs
|
|
4
|
338
|
March 4, 2026
|
|
[ANN] AcceleratedKernels.jl - Cross-architecture parallel algorithms for Julia's GPU backends
|
|
17
|
1818
|
March 3, 2026
|
|
Multi-GPU inference in Flux.jl
|
|
2
|
127
|
January 24, 2026
|
|
Argmax mapreduce on GPU
|
|
5
|
221
|
January 12, 2026
|
|
Cartesian Indices Sequence on the GPU
|
|
0
|
68
|
January 12, 2026
|
|
Using Interpolations.jl on CuVector
|
|
6
|
1917
|
January 11, 2026
|
|
Feedback wanted: GPU-accelerated 2D elastic wave simulation (staggered-grid FD) in Julia
|
|
10
|
350
|
January 10, 2026
|
|
GPU support for Turing modeling with system of ODEs
|
|
8
|
1073
|
January 5, 2026
|
|
Large ODE Solver for Metal.jl
|
|
11
|
303
|
December 29, 2025
|
|
Failed to precompile CUDA
|
|
14
|
151
|
December 16, 2025
|
|
Block/Tile-Based GPU Programming (not Scratch)
|
|
3
|
642
|
December 8, 2025
|
|
[San Francisco, CA] Performance Engineer - GPU Atmospheric Modeling
|
|
2
|
403
|
November 14, 2025
|
|
[ANN] Raycore.jl: High-Performance Ray Tracing for CPU and GPU
|
|
11
|
1025
|
November 13, 2025
|
|
Array addition of oneAPI.jl slower
|
|
10
|
270
|
November 10, 2025
|
|
Batched Matrix Multiply
|
|
12
|
4013
|
October 30, 2025
|
|
Improving performance of CUDA GPU kernel: LU factorization
|
|
17
|
636
|
October 28, 2025
|
|
Postdoc offer: graph algorithms on GPU with Julia
|
|
0
|
406
|
October 27, 2025
|
|
Error in oneAPI.jl tests
|
|
3
|
137
|
October 18, 2025
|
|
SciMLSensitivity fails on GPU?
|
|
5
|
103
|
September 22, 2025
|
|
Custom (NumPy style) broadcasting rule that avoids iterating over elements (for GPU-acceleration)
|
|
10
|
441
|
August 24, 2025
|
|
Using getrf_batched to find matrix inverses
|
|
2
|
110
|
August 7, 2025
|
|
Bend: a new GPU-native language
|
|
45
|
13113
|
August 6, 2025
|
|
Sparse matrix multiplication for Metal
|
|
15
|
541
|
July 31, 2025
|
|
😤 Multi-line expressions aren't fully computed
|
|
22
|
567
|
July 11, 2025
|
|
PackageCompiler fails to create app for MadNLPGPU + ExaModels (CUDSS linear solver)
|
|
11
|
445
|
June 30, 2025
|
|
How to Manage Memory with Sequential, GPU-Intensive (e.g., PyTorch) Python Calls via PythonCall.jl
|
|
0
|
84
|
June 17, 2025
|
|
Julia (AcceleratedKernels) vs JAX time comparison
|
|
21
|
1280
|
June 11, 2025
|
|
GPU/CPU Agnostic FFT code
|
|
7
|
573
|
June 10, 2025
|