Best Practices for Using Coding Agents with Julia?

Matt_jl · April 22, 2026, 2:09pm

Very easy, just like the title says:

What are the best practices for letting coding agents start and manage Julia sessions for testing and debugging, while keeping the workflow safe, reproducible, and easy to reason about?

I think many people could find this useful, so maybe we can share our experiences, tips, and tricks in the comments below.

Satvik · April 22, 2026, 3:03pm

Julia depends very heavily on the REPL, so you want your agent to have access to a REPL.

For me, that means spinning up a REPL in a separate tmux/zellij session. Then the agents can just use send-keys to access the REPL and get fast feedback.

sob · April 22, 2026, 4:53pm

I currently use the Claude Code plugin (in a non edit mode) in Codium. I’m at the experimental stage so for now it’s more of a “how can I optimized this code”, “can you find bugs in this file” type use.

I have also experimented with a purely local setup using OpenCode linked to Qwen 3.6 or GPT-OSS but have found the models inferior.

My use is rather simple, so I’m also interested in what others are doing.

hersle · April 22, 2026, 8:03pm

Has anyone been able to set up Opencode to use a running Julia REPL? Out of the box it’s restarting Julia all the time, making trivial mistakes with setting up environments etc.

apo383 · April 22, 2026, 9:06pm

Use an MCP server like kaimon.jl. You can have a persistent session with running state, which eliminates the startup cost of CLI julia.

I did a test with copilot CLI, to use Julia to create a DataFrame df with one column data with one row 1. I had to ask it to use kaimon otherwise it would spin up its own julia shell. But it connected to the MCP server, which you can monitor for kaimon TUI. I’m just getting started, but kaimon looks quite capable, and the agent knows quite a lot. You can use the ex MCP command, or just ask it to do stuff in natural language.

BTW copilot can also create a persistent julia session in a shell and refer back to it. Not sure what the limitations are compared to MCP.

Haven’t tried opencode but expect it would be similar.

Satvik · April 24, 2026, 1:00am

I use a skill (a markdown file that gives the agent instructions) that I had Claude write, which tells the agent how to find an existing tmux session, start one if it doesn’t exist, start/restart the REPL, and send code to the REPL.

tim.holy · April 26, 2026, 10:47am

I have started bundling package-maintenance tasks into skills. I would rate these as “maturing” rather than “mature,” as I am still regularly finding ways to improve the skills to ensure higher-quality results. I’ve also made some effort to improve context-efficiency by being specific about certain instructions (“use a subagent to …” or “don’t read the source, extract this from Base.Docs.meta(MyPackage) using the following script:”).

Repo: GitHub - timholy/claude_config: Configuration files for claude code · GitHub

freeman · May 9, 2026, 3:33pm

Would you share please?

I just verified that asking along the lines of “start a tmux session. write to it using tmux send-keys and see response using tmux capture-pane”, works. But it’s not a genuinely interactive session and furthermore it seems inefficient and brittle as capture-pane always captures the entire screen, and no more. And then the LLM would have to use its context window to diff between two capture-pane “states”. This looks harder than it should.

Satvik · May 14, 2026, 6:51pm

I just uploaded the skill & scripts here: GitHub - Satvik/julia-repl-skill · GitHub

But I would say the specific skill is less important than the process. What you really want to do is have Claude write the skill, try it out, and then suggest improvements when you see Claude struggling. For example, in the first version I saw it had a really hard time figuring out when the REPL was done processing, so I ended up having it write a separate python script wait-julia.

csvance · May 14, 2026, 7:55pm

When the cost of verification is an edit (hot patch w/ Revise) + single tool call to evaluate a function / expression in a REPL, these agents get crazy good at writing Julia. It’s the same principle in software engineering where a mistake caught early is cheap compared to one caught late. There is a huge difference between having an agent write some code and testing it when its done and having the agent verify things at each step. When I switched to this style of workflow, suddenly the Julia code I got back that was supposed to be non-allocating actually was, not to mention it was significantly more likely to one-shot the task.

sob · May 28, 2026, 12:57pm

I second that. I’ve been experimenting with Claude Code, and had it write small programs entirely on its own. The programs did the job and did it well. Burned through my 5 hour session quota rather fast though (Opus 4.7).

Topic		Replies	Views
Letting AI agents use the VSCode Julia REPL General Usage	7	516	April 21, 2026
[ANN] julia-mcp — persistent Julia sessions for AI assistants Package Announcements ai	41	2657	May 21, 2026
[ANN] MCPRepl.jl -- share your REPL with your AI Agent Package Announcements	32	1973	February 27, 2026
Looking for custom instructions / settings for coding agents General Usage agents , ai , llm	0	115	August 8, 2025
Multi agent orchestration for Julia code generation: Best tooling / practices? Tooling llm , openai	1	311	October 28, 2025

Best Practices for Using Coding Agents with Julia?

Related topics