Help Debugging GPU Performance Issue

Yes, of course but here the allocs estimate is 283 vs 39763 where the latter case includes unsafe_free!. The memory estimate also jumps from 8.7 Kb to 2.9 Mb.

I assumed that kernel parameter buffers will be created and destroyed in both cases.