Also see here, I’m working on a timm port for Lux.jl: [ANN] Jimm.jl: Lux ports of timm image backbones, with HuggingFace pretrained weights
That’s almost exactly what I’m working on (my working name was even Jimm). I’ll take a look and see if I can contribute. So far, I’ve implemented all variants of Timm’s VisionTransformer, ConvNeXt (both v1 and v2), and Eva (basically ViT with rotary positional embeddings used by SAM3). I also have implementations for Swin, PVT, and Twins, but I didn’t get around to adding pre-trained weights yet. It should be relatively straightforward to convert from Flux to Lux.