Using CLMUL instruction

tkoolen · May 5, 2019, 6:01pm

I don’t have much experience with this, so I was wondering: why is the last argument an Int32? Looking at PCLMULQDQ — Carry-Less Multiplication Quadword, I would have expected a 8 bytes, and that does seem to work as well:

const m128 = NTuple{2,VecElement{Int64}}

function carrylessmul(a::m128, b::m128)
    ccall("llvm.x86.pclmulqdq", llvmcall, m128, (m128, m128, UInt8), a, b, 0)
end

julia> @code_native carrylessmul(m128((1, 2)), m128((3, 4)))
	.section	__TEXT,__text,regular,pure_instructions
; ┌ @ REPL[8]:2 within `carrylessmul'
	vpclmulqdq	$0, %xmm1, %xmm0, %xmm0
	retl
	nopw	(%eax,%eax)
; └

Topic		Replies	Views
Issue using llvm General Usage	5	251	March 9, 2024
Functions for low-level arithmetic Internals & Design	5	1150	March 29, 2017
Julia equivalent of C compiler intrinsics? General Usage	23	3261	November 8, 2018
Calling AVX-512 intrinsics from Julia General Usage bit-twiddling	6	843	July 4, 2023
C routine uses AVX intrinsics General Usage interoperability , c	15	1582	September 26, 2022

Using CLMUL instruction

Related topics