PackageCompiler fails to create app for MadNLPGPU + ExaModels (CUDSS linear solver)
|
|
4
|
52
|
February 21, 2025
|
[ANN] Introducing AlternateVectors.jl - A Library for Peculiar One-Dimensional Array Patterns
|
|
0
|
169
|
February 8, 2025
|
Any updates on using AMDGPU in WSL?
|
|
8
|
120
|
February 6, 2025
|
FFTW scales pretty well (some @btime benchmarks)
|
|
1
|
1685
|
February 4, 2025
|
How to develop code in Vulkan using Julia?
|
|
1
|
131
|
February 1, 2025
|
Batched Matrix Multiply
|
|
11
|
3561
|
January 31, 2025
|
Does the new LLVM SPIR-V backend help Julia in any way?
|
|
2
|
252
|
January 28, 2025
|
Lux, optimization on gpu
|
|
8
|
246
|
January 13, 2025
|
Broadcasting performance
|
|
13
|
533
|
January 6, 2025
|
CUDA async is not working properly
|
|
4
|
140
|
December 31, 2024
|
Cumulative sum on GPUArray using KernelAbstractions
|
|
4
|
129
|
December 24, 2024
|
Can I move an array asynchronously from main program to CUDA?
|
|
7
|
178
|
December 15, 2024
|
Symmetric view of sparse matrix CUDA.jl
|
|
0
|
34
|
December 13, 2024
|
Is sharedmemory really accelerates GPU kernel?
|
|
1
|
81
|
December 2, 2024
|
How to improve the performance of CUDA kernel function which loop on a large struct array
|
|
4
|
144
|
November 28, 2024
|
How do we compute the gradient and Laplacian of a neural network using GPU?
|
|
9
|
247
|
November 19, 2024
|
GPU Julia vs GPU Matlab
|
|
61
|
937
|
November 18, 2024
|
CUDA Error : ArgumentError: Objects are on devices with different types: CPUDevice and CUDADevice
|
|
4
|
38
|
October 23, 2024
|
Scalar indexing is disallowed - ODE solve using GPU
|
|
2
|
71
|
October 23, 2024
|
[ANN] AcceleratedKernels.jl - Cross-architecture parallel algorithms for Julia's GPU backends
|
|
16
|
1190
|
September 27, 2024
|
Why fft with MEASURE plan 10x slower than calling fft directly with CUDA.CUFFT?
|
|
7
|
158
|
September 22, 2024
|
JUHPC: HPC setup for Juliaup, Julia and some HPC key packages
|
|
0
|
419
|
September 18, 2024
|
Improving GPU performance for symbolic regression
|
|
14
|
963
|
September 12, 2024
|
Clever design for basis arrays
|
|
3
|
130
|
September 6, 2024
|
Testing GPU compatability in CI
|
|
2
|
71
|
September 4, 2024
|
[ANN] WaterLily.jl: A differentiable fluid simulator with fast heterogeneous execution
|
|
9
|
1855
|
August 29, 2024
|
Why Random.jl is fixed to version 0.0.0?
|
|
8
|
660
|
August 26, 2024
|
Synchronize streams in CUDA.jl
|
|
11
|
302
|
August 23, 2024
|
Putting obj files on the GPU with Metal.jl
|
|
0
|
45
|
August 20, 2024
|
Parallelize differential equation solve with interpolated forcing function
|
|
0
|
31
|
August 14, 2024
|