KrylovKit, syntax issue

Enlil50 · February 20, 2024, 10:36am

how can i pick the absolute value of the number inside the matrix? if I use this tolerance even negative number are suppressed, but if I write abs(A) it throws an error.

Additionally: is droptol! considering just the absolute value or is he destroying negative numbers too?

abraemer · February 20, 2024, 10:39am

Oh sorry I forgot the modulus You just need to broadcast it:

A .= ifelse.(abs.(A) .< tol, 0, A)

And droptol! takes the absolute of course

Enlil50 · February 20, 2024, 4:06pm

Another question, just to push ths to its limit: is this going to overwrite existing data with the data itself, when the condition is met, or is it going to do nothing? instead of doing 2 loops to retrieve the index and say in which index I’m putting a 0, is there some already predefined syntax, just like the one you used, to explicitely say that in the case when abs.(A) > tol, it has to do nothing?

abraemer · February 20, 2024, 9:16pm

Yes this will overwrite existing data with itself but this is likely faster than conditionally skipping the overwrite due to how modern CPUs work

Enlil50 · February 20, 2024, 9:17pm

interested: why so? (genuine and curious question)

abraemer · February 20, 2024, 9:42pm

This is a bit difficult to explain but I couldn’t find a good explanation quickly so I will try my best. See modern CPUs don’t really start executing an instruction, wait for it to finish and then start executing the next one. Usually they overlap multiple instructions. Note that I am talking about a single CPU core - multiple cores ofc work in parallel as well but even a single core will overlap instructions. This is because many instructions take multiple cycles to complete (e.g. a float division can take upto ~20 clock cycles) so the core can do something else in the meantime. So even if you read some assembly code, you only get a vague idea how the CPU will actually execute the code because it will likely reorder instructions, execute them out of order, optimize loads/stores etc. in order to maximize throughput.

So why is skipping the store to the array sometimes likely slower? The first part is that a conditional is SLOW. Thw CPU has to wait for the result pf the comparison and then decide wether to store or not to store. Actually it won’t wait, it will try to predict the branch and just go ahead and do something until the result of the comparison arives. If the branch was predicted right, then everything is fine. If it was predict wrong, then the CPU has to undo its work and take the correct branch, which is SLOW. If you use ifelse (actually in most cases you could also use the ternary operator ?: because the compiler is smart - however you can’t broadcast it so ifelse needs to be used in our case) the compiler can simply compute both results and use a conditional move instruction cmov to select the correct result without a branch! This is great and always fast
It actually has another huge benefit in that it can be vectorized with special SIMD instructions. That means instead of operating on single numbers your CPU can operator on a whole set of number at once (I think upto 8 Float64 - how many depends on your CPU). This only works because we do the exactly same thing for every number (conditional moving is fine - branching destroys SIMD).

Enlil50 · February 20, 2024, 10:26pm

2 hearts reaction not enough! What a crazy mess out there, beautiful!
Thanks a ton

lkdvos · February 21, 2024, 7:58am

Maybe as a final note here, if you are really using this to implement IDMRG, there are already quite a few packages out there that do so:

MPSKit.jl
ITensors.jl
or even, if you only need finite size systems:
FiniteMPS.jl

There might even be more that I don’t know of, or can’t think of right now. These all wrap KrylovKit in very nice ways, and many people have already spent quite some time optimizing these codes, and have many nice additional features.

Enlil50 · February 21, 2024, 8:25am

Thanks

Topic		Replies	Views
Krylov based lanczos etc for sparse matrices General Usage question , sparse	2	329	January 26, 2024
Krylovkit tolerance changes results drastically General Usage sparse , matrices , eigenvalues , eigenvectors , sparsearrays	0	168	February 20, 2024
Trouble with passing arguments to KrylovKit.eigsolve General Usage	1	609	February 10, 2021
Smallest magnitude sparse generalized eigenvalues General Usage	13	241	April 17, 2025
Extracting internally computed array from `KrylovKit.eigsolve` (without forking) General Usage	6	36	June 24, 2025

KrylovKit, syntax issue

Related topics