Hi everyone,
I’d like to ask for advice on how to improve the quality of Julia code generated by LLMs, and to spark a discussion about what the community could do to make LLMs more useful for Julia tasks.
What I’ve tried so far
I wanted to use a local LLM to create plots with Makie. To give it better context, I downloaded the entire Makie documentation and set up a RAG system using Anything LLM + Gemma 4 MoE. Unfortunately, that didn’t help much – the generated code still looked very “Python-like” and rarely worked out of the box.
The frustration
Many AI coding agents, when asked to visualize some abstraction, produce Python or JavaScript code that actually runs. It’s clear that these models have been trained (or fine-tuned) on enough examples that they can even self-correct when they hit a problem, resulting in a working solution right away. I’d love to see something similar happen for Julia – ideally a tool that can generate ready-to-use Pluto or Bonito notebooks, or at least produce working scripts for Makie visualizations.
Question for the community
What would it take to get there? Could we create a large, high-quality public dataset of Julia visualization code (and more generally, idiomatic Julia examples) and fine-tune a local LLM on it? Are there already efforts in this direction that I could contribute to? I’d really appreciate any suggestions, experiences, or pointers to existing projects.
I already read similar topics, but maybe it’s time to make some updates?
Thanks in advance!