My journey training an LLM from scratch in Julia (and why I see huge potential)

mouad-tarif · April 23, 2026, 11:32pm

I started training a language model from scratch in Julia. Not using pre-built libraries for the core - building my own BPE tokenizer, my own training loop, facing hallucinations, and rebuilding.

I tried Flux and Lux. Both have strengths, but also critical weaknesses (CUDA conflicts, design limitations). After a long struggle, I found a different path that worked.

PythonCall played a key role, bridging Julia to Python’s ecosystem when needed. But Julia itself was the heart of the project.

What I learned is that Julia is not just “another language”. It is a platform for real understanding. If the community focuses on its unique strengths (speed, metaprogramming, Python interop), I believe Julia can surpass many expectations.

I am not sharing technical details now. But I wanted to confirm: Julia is inspiring. With more work, it can become much, much more.
“I am sharing a screenshot as a proof of concept. The full code is not open-source at this stage. I want to document it properly first. I may share it later. I hope you understand and respect that.”
#machinelearning

lilachint · April 24, 2026, 5:52am

When I first came across Julia a few weeks ago, it really is my dream come true! So many language features and design that I adore (I once tried designing an intuitive language myself, and turns out Julia is literally what I was looking for the whole time). I could confirm Julia is so inspiring, and in my opinion at least, Julia is THE language (well, Julia is unironically a cult lol).

davidbp · April 24, 2026, 6:54am

Please consider sharing the code, would be really interested taking a look. This discourse is one of the best ways to learn and share julia tips and tricks.

I would be specially interested looking at the BPE tokenizer from scratch.

mouad-tarif · April 24, 2026, 12:04pm

Thank you for your interest. I understand that the Discourse is for learning, and I appreciate that.

However, as I mentioned in the original post, the code is not open-source at this stage. I am still documenting it and refining it.

Regarding the BPE tokenizer: I built it from scratch after studying multiple implementations. The general approach is standard (Byte-Pair Encoding), but my specific adaptation for Arabic text and the model’s vocabulary is what makes it unique.

I am not sharing the code yet, but I am happy to discuss the algorithm or the challenges I faced. What specifically would you like to know about BPE in Julia?

mouad-tarif · April 24, 2026, 2:03pm

confimation using julia

mouad-tarif · April 24, 2026, 5:07pm

good luck in your dream

Topic		Replies	Views
Sequence language models in Julia Machine Learning	6	483	August 18, 2025
Is Julia Falling Behind in Relevance? (because it's not used in LLM research?) Offtopic	69	8554	July 16, 2025
LLM AI just for Julia? A proposal: Julia plus science LLM? General Usage machine-learning	4	1714	June 24, 2023
Julia motivation for machine learning Machine Learning	6	1617	April 18, 2019
Could a Julia fine tuned version of Llama 2 code be created General Usage question	14	1666	September 10, 2023

My journey training an LLM from scratch in Julia (and why I see huge potential)

Related topics