Community Interest Check: LLMs from Scratch in Pure Julia

avikpal · November 22, 2024, 6:29pm

Reactant would be the way to go if you want good performance for these workloads. If you want a starter code, we have some WIP versions scattered across PRs atm

feat: nanoGPT implementation using Reactant by avik-pal · Pull Request #1062 · LuxDL/Lux.jl · GitHub
feat: add a Llama2 model by avik-pal · Pull Request #88 · EnzymeAD/Reactant.jl · GitHub

Even the quantized ops needed for inference exist in the StableHLO land but we haven’t hooked them up yet on the Julia side but it is definitely doable.

Topic		Replies	Views
Sequence language models in Julia Machine Learning	3	181	June 29, 2025
[ANN] Jjama3.jl (unregistered) - Llama3.1 and Llama3.2 (text) in Julia Package Announcements	9	676	December 6, 2024
A new LLM benchmark for Julia programming Tooling generative-ai	0	204	May 21, 2025
An LLM fine-tuned for Julia, call for comments + help Tooling llm , generative-ai	30	3117	May 20, 2024
Fine-tuning an LLM for Julia, updates Tooling generative-ai	1	700	December 31, 2024

Community Interest Check: LLMs from Scratch in Pure Julia

Related topics