[pre-ANN]: CuTensorOperations.jl

juthohaegeman · August 8, 2019, 9:42am

CuTensorOperations.jl provides support for tensor contractions and related operations with CuArray objects, i.e. it makes TensorOperations.jl compatible with CuArray objects.

For now, it merely dispatches the necessary operations to a new experimental library of NVidia (thanks to the NVidia developers involved) cuTENSOR and does not provide a Julia/CUDAnative implementation. The basic functionality of cuTENSOR is wrapped in the custom/experimental branch ksh/tensor of CuArrays.jl (thanks to @maleadt and @kslimes).

Given the current state, this is a pre-announcement, and CuTensorOperations.jl is not registered yet. It is more likely to become part of TensorOperations.jl in time.

Check out the README for more info, known (current) limitations and installation instructions. Experiment, and report any issues!

ChrisRackauckas · August 8, 2019, 11:32am

Nicely done! This is a great thing to see added to the ecosystem!

Does it need to be a different macro? It would be better if it was the same macro and just worked via dispatch. That way it could work in generic codes.

Also, do you plan on adding differentiation rule overloads for ChainRules.jl? It would be nice for these operations to be compatible with AD.

juthohaegeman · August 8, 2019, 12:37pm

The same macro works for CuArrays. The new macro @cutensor will take Arrays of the host, and transfer them to the GPU for you. Let me know if the explanation is not clear on this fact.

ChrisRackauckas · August 8, 2019, 12:38pm

Nope, I just missed that. Awesome! Hope to see some good chain rules on this

juthohaegeman · August 8, 2019, 1:53pm

So do I. There was some work on AD with TensorOperations.jl by other people, e.g. @mcabbott . Not sure if that is still alive, fully functional, integrated somewhere, compatible with ChainRules.jl ?

mcabbott · August 8, 2019, 3:29pm

Nice to see, haven’t tried it yet but I will!

Yes, I had a very naiive approach up & running but have not revisited it. Probably easy to package up if there is interest. Was fiddling with a smarter way, but it doesn’t work yet.

I believe @under-Peter is working harder on a new package which includes AD, although parallel to TensorOperations.jl, not built on top.

under-Peter · August 8, 2019, 3:57pm

Partially built on top since when possible I dispatch to TensorOperations but for the GPU we currently have a custom kernel.

As far as AD is concerned, my project works by naively switching arguments as described here - I’m not sure if there are much better ways, although it’s possible that the optimal contraction order for the backwards-pass is not usually the reverse of the forward pass and one could probably smartly cache results. If anyone has resources for smarter AD for these operations, I’d be interested.

mcabbott · August 10, 2019, 1:03pm

Sorry about getting the details wrong!

I have just made some quick packages for old and new gradient code. These are a bit rough, but perhaps they display some different kinds of naivety. OP pointed me to this paper Format selector for 1310.8023 which has matlab code for what sounds like a very similar problem, but I have not read it closely.

Topic		Replies	Views
TensorOperations.jl Package Announcements tensors	0	875	November 20, 2019
TensorOperations.jl v4 Package Announcements tensoroperations	0	514	July 20, 2023
[ANN] TensorOperations v5 Package Announcements tensors , tensor-contraction , tensoroperations	0	177	July 16, 2024
CuArray and Optim GPU optim , optimization	9	3425	September 4, 2018
Tensor regression models in Julia Machine Learning	5	1325	June 12, 2018

[pre-ANN]: CuTensorOperations.jl

Related topics