Can I make my code faster with parallelism, or just plain better coding?

mauro3 · February 15, 2019, 8:35am

I think you misread @aaowens’s code, this A[i] += abc means “copy abc to location i of vector A”. This works because A is mutable (even though its elements are not). Thus A[i][1] += B[i][1] would not work because A[i][1] is a field of a SVector and thus cannot be updated.

To fill an SVector, see Constructing SVector with a loop - #7 by tkf. But for your example just do SVector(-fdx, -fdy, f*dz).

Topic		Replies	Views
Slower with threads Performance question	26	1171	August 6, 2022
Speeding up force calculations and mutable structs Performance	20	1186	August 5, 2020
Learning to optimize Performance	12	1229	March 15, 2021
Performance improvements for simple molecular surface caclulation Performance	15	635	September 14, 2020
Nbabel nbody integrator speed up Performance question , review	64	3037	August 16, 2021

Can I make my code faster with parallelism, or just plain better coding?

Related topics