Introducing Rust alongside C in Julia's source tree?

MilesCranmer · January 9, 2025, 1:56am

Just curious if anyone has ever raised the question of introducing Rust inside the Julia source tree, alongside C? I know that the Linux kernel has recently started this in their C source to improve memory safety: Rust for Linux - Wikipedia. Since Julia’s GC is now multithreaded, and compilation is moving this way as well, I feel like the memory safety guarantees of Rust compared to C would help prevent any issues that could arise. Having experienced several bugs from race conditions inside Julia itself from 1.11’s multithreaded GC alone (#56761, #56759, #56735), I feel like the increasing complexity of the Julia runtime would be well-supported by moving parts of it to Rust, where there is more memory safety. In addition, the Julia community seems pretty on board with Rust, since juliaup is written with it.

(I do realise that this decision would lie with only a handful people though. Just planting some seeds.)

Oscar_Smith · January 9, 2025, 1:58am

see Building Julia with MMTk using BinaryBuilder by udesou · Pull Request #56989 · JuliaLang/julia · GitHub

d-netto · January 9, 2025, 2:37am

Having experienced several bugs from race conditions inside Julia itself from 1.11’s multithreaded GC alone

The bulk of the multithreaded GC was introduced in 1.10 and the GC threading changes between 1.10 and 1.11 were minimal.

As mentioned here, here and here these issues don’t reproduce on 1.10, so they don’t seem thread safety issues with Julia GC itself.

Indeed, the mallocarrays structure refactored in gc: improve mallocarrays locality by vtjnash · Pull Request #56801 · JuliaLang/julia · GitHub was present in 1.9 when the GC was single threaded.

The other fix PR Utilize bitshifts correctly in signals-mach.c when storing/reading the previous GC state by gbaraldi · Pull Request #53868 · JuliaLang/julia · GitHub refactors a bunch of bit twiddling we got wrong when manipulating the GC bits.

Not sure how using Rust would help here.

MilesCranmer · January 9, 2025, 2:53am

I think this didn’t show up on 1.10 because Memory hadn’t been introduced at that point, even though the underlying data structure had a memory leak (since my code allocates tons of arrays). It sounds like we still don’t know what that bug was from though, only there was some leak from that structure(?). Rust would help greatly for such leaks, via ownership/lifetimes, no?

This sounds pretty interesting. Is there a devdocs guide somewhere? Or maybe it’s literally just landing?

Oscar_Smith · January 9, 2025, 3:04am

The first part (adding support for MMTK if you compile your own copy of it) landed yesterday, and the 2nd part (binarybuilder support so you can get it if you compile Julia regularly with the right build flag set) is the PR I linked which isn’t merged. Expect more devdocs on this sort of thing closer to release.

d-netto · January 9, 2025, 3:36am

It sounds like we still don’t know what that bug was from though, only there was some leak from that structure(?). Rust would help greatly for such leaks, via ownership/lifetimes, no?

I can think of two possible explanations for this bug.

One explanation (unlikely), is that we somehow got the list implementation wrong. This seems like a software engineering issue that could have been solved by implementing a high-quality container inside Julia and covering it with extensive unit testing, or using a well-tested implementation provided by some language’s standard library (e.g. C++'s STL).

Rust could have helped here, but it would have helped just because it’s a language with a rich and well-tested standard library, not because of its memory safety properties (after all, the linked list implementation provided by their standard library has a considerable amount of unsafe code that manipulates raw pointers). It doesn’t differ from C++ here.

The other explanation for this bug that I can think of is that the list has a very poor layout that’s fragmenting Libc’s allocator and making it request more and more pages. This is an issue of the underlying allocator and we could be vulnerable to that even if we used Rust.

d-netto · January 9, 2025, 3:50am

That being said, Rust seems like a fine language for implementing GCs. Steve Blackburn’s team even wrote a paper about it: https://www.steveblackburn.org/pubs/papers/rust-ismm-2016.pdf.

I just don’t see how it would be better than other languages at solving the particular issues outlined above.

MilesCranmer · January 9, 2025, 8:43am

This isn’t correct - you couldn’t write such code unless you were to write unsafe { ... }, which loses all memory safety guarantees provided by the compiler and is almost always avoided. In other words this would not be normal Rust code.

Safe Rust doesn’t allow you to write code exactly as you would in C - because such patterns aren’t memory safe. It requires a redesign to satisfy the safety requirements. Which helps prevent issues such as leaks and races.

giordano · January 9, 2025, 8:56am

Yes: Julia better rewritten in rust?

One problem no one raised about Rust is that in BinaryBuilder we don’t have Rust toolchains for i686-w64-mingw32, aarch64-freebsd and riscv64-linux, because they are either unsupported by rustup or have incompatible runtimes with what we use. Which means we can’t compile Julia dependencies for those platforms, which would significantly complicate Julia build system there.

Benny · January 9, 2025, 8:57am

That has hit a snag, but more for philosophical and political reasons than anything to do with Rust’s merits. The more disheartening thing is how it exposed that the wider community still viewed Rust in Linux as an optional experiment at best, a tumor at worst.
Rust in Linux lead retires rather than deal with more “nontechnical nonsense” - Ars Technica

Users definitely routinely call code with unsafe blocks even if they don’t write any personally. Pushing to a vector has one, a blogpost estimated 7.5k/35k of standard library functions are unsafe. Everything gets unsafe deep down, the advantage of Rust is that its idea of safety is a statically knowable language semantic.

MilesCranmer · January 9, 2025, 8:58am

That thread is different. It’s about rewriting Julia in Rust. I’m not suggesting that. Just introducing Rust into the source tree, like what Linux did.

(That thread also looks to not motivate their question by anything, whereas this thread is motivated by real concerns about memory safety)

MilesCranmer · January 9, 2025, 9:11am

This is introducing confusion between direct use of unsafe code (unsafe) and indirect use of unsafe in safe abstractions (safe). Of course everything is unsafe deep down, but there’s a difference here. Those internal methods within LinkedList are actually unsafe to call and shouldn’t be used.

Benny · January 9, 2025, 9:30am

Internals are discouraged already, the unsafety is the lesser factor there. You’re technically right, but I’m trying to say it doesn’t mean you shouldn’t use either at all, it’s that you have to handle either carefully. Rust marking unsafe code makes it easier to analyze and handle, Julia started to mark public API for similar purposes. We’re not going to get a safe-Rust linked list anytime soon, so if we need a linked list now, no reason not to handle it carefully. Whether we need a linked list is a different question, often answered “no.”

MilesCranmer · January 9, 2025, 11:48am

Thanks for the info. Could you expand on why these are incompatible? I see these toolchains in rustup, for example

> rustup target list | grep -e riscv64 -e i686-pc -e aarch64-unknown-none
aarch64-unknown-none
aarch64-unknown-none-softfloat
i686-pc-windows-gnu
i686-pc-windows-gnullvm
i686-pc-windows-msvc
riscv64gc-unknown-linux-gnu
riscv64gc-unknown-linux-musl
riscv64gc-unknown-none-elf
riscv64imac-unknown-none-elf

e.g., isn’t “i686-pc-windows-gnu” completely compatible with “i686-w64-mingw32”? And similarly there look to be multiple options for riscv64-linux.

giordano · January 9, 2025, 11:59am

No:

github.com/rust-lang/rust

Passing `-C panic=abort` still attempts to link in `libunwind` when targeting `i686-pc-windows-gnu` on `v1.44+`

opened 08:41PM - 01 Dec 20 UTC

staticfloat

A-runtime A-linkage O-windows A-cross P-high T-compiler regression-from-stable-to-stable C-bug O-x86_32

### Summary When cross-compiling from a Linux distribution that provides SLJL… mingw32, linker errors about `libunwind` symbols are a known issue (https://github.com/rust-lang/rust/issues/12859). The generally-accepted workaround is to disable exception handling (via `-C panic=abort`) which should disable the need to collect backtraces and eliminate the linker errors. This works on a 1.43.0 toolchain, but is broken on 1.44.0-1.48.0 ### Example of error Here's an example of the error, running within the [`BinaryBuilder.jl` cross-compilation environment](https://github.com/JuliaPackaging/BinaryBuilder.jl). ``` # cat hello_world.rs fn main() { println!("Hello, World!"); } # /opt/x86_64-linux-musl/bin/rustc --target=i686-pc-windows-gnu -C panic=abort -o /tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.exe hello_world.rs error: linking with `i686-w64-mingw32-gcc` failed: exit code: 1 | = note: "i686-w64-mingw32-gcc" "-fno-use-linker-plugin" "-Wl,--nxcompat" "-Wl,--large-address-aware" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/rsbegin.o" "-L" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.0.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.1.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.2.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.3.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.4.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.5.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.6.rcgu.o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.hello_world.7rcbfp3g-cgu.7.rcgu.o" "-o" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.exe" "/tmp/testsuite/i686-w64-mingw32/rust/hello_world/hello_world.1oeskw8cnf06rbmk.rcgu.o" "-Wl,--gc-sections" "-nodefaultlibs" "-L" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib" "-Wl,--start-group" "-Wl,-Bstatic" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libpanic_abort-905e0827b1faa99b.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libobject-b5919c53897ea4e7.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libaddr2line-09e1099705854178.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libgimli-a13132083f96cf01.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/librustc_demangle-1a2a881500c3aa11.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libhashbrown-12335b7735858229.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/librustc_std_workspace_alloc-03e236d940d65c13.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libunwind-67ee36f8c83e0d23.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libcfg_if-551dfddd5bf52674.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/liblibc-119051673c0a64ec.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/liballoc-e4d213396c740246.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/librustc_std_workspace_core-09a82c5ce50e9376.rlib" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libcore-e59a5606ba4e2b3b.rlib" "-Wl,--end-group" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libcompiler_builtins-c634db1b8ae16db1.rlib" "-Wl,-Bdynamic" "-ladvapi32" "-lws2_32" "-luserenv" "-lgcc_eh" "-l:libpthread.a" "-lmsvcrt" "-lmingwex" "-lmingw32" "-lgcc" "-lmsvcrt" "-luser32" "-lkernel32" "/opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/rsend.o" = note: /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o): In function `ZN64_$LT$std..backtrace..BytesOrWide$u20$as$u20$core..fmt..Debug$GT$3fmt17h42ce8df8b153bc33E': /rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\/library\std\src/backtrace.rs:231: undefined reference to `_Unwind_Resume' /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o): In function `ZN4core3ops8function6FnOnce9call_once17h47abe59cb2be30c1E': /rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\library\core\src\ops/function.rs:227: undefined reference to `_Unwind_Resume' /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o): In function `ZN4core3ops8function6FnOnce9call_once17h48798ced7406d4f4E': /rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\/library\std\src\io/stdio.rs:563: undefined reference to `_Unwind_Resume' /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o): In function `ZN4core3ops8function6FnOnce9call_once17h0a2adbee20aeb7b2E': /rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\library\core\src\ops/function.rs:227: undefined reference to `_Unwind_Resume' /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o): In function `ZN71_$LT$alloc..vec..IntoIter$LT$T$GT$$u20$as$u20$core..ops..drop..Drop$GT$4drop17h9354c90820e79e8dE': /rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\library\alloc\src/vec.rs:3069: undefined reference to `_Unwind_Resume' /opt/x86_64-linux-musl/toolchains/1.48.0-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/libstd-fe449066d03836b9.rlib(std-fe449066d03836b9.std.3uongtsb-cgu.0.rcgu.o):/rustc/7eac88abb2e57e752f3302f02be5f3ce3d7adfb4\library\core\src\ptr/mod.rs:175: more undefined references to `_Unwind_Resume' follow collect2: error: ld returned 1 exit status error: aborting due to previous error ``` ### Testing with multiple versions I use the following script to install new versions of `rustc`: ```bash #!/bin/bash # Usage: change_win32_toolchain.sh [version] TOOLCHAIN_VER=${1:-1.48.0} rustup toolchain add ${TOOLCHAIN_VER} rustup target add --toolchain ${TOOLCHAIN_VER} i686-pc-windows-gnu # Copy crt2.o in to rust, see https://github.com/rust-lang/rust/issues/32859#issuecomment-573423629 cp /opt/i686-w64-mingw32/i686-w64-mingw32/sys-root/lib/crt2.o /opt/x86_64-linux-musl/toolchains/${TOOLCHAIN_VER}-x86_64-unknown-linux-musl/lib/rustlib/i686-pc-windows-gnu/lib/crt2.o # Set this new toolchain as the default for all invocations of `rustc` export RUSTUP_TOOLCHAIN=${TOOLCHAIN_VER}-x86_64-unknown-linux-musl ``` ### Version it worked on This works on Rust 1.43.0. ### Version with regression This does not work on Rust 1.44.0-1.48.0

github.com

JuliaPackaging/Yggdrasil/blob/a9b28744937ada4bee7c111053646fc979854f47/0_RootFS/Rust/build_tarballs.jl#L58


      
              aarch64-unknown-linux-gnu
              aarch64-unknown-linux-musl
              arm-unknown-linux-gnueabihf
              arm-unknown-linux-musleabihf
              armv7-unknown-linux-gnueabihf
              armv7-unknown-linux-musleabihf
              i686-pc-windows-gnu
              i686-unknown-linux-gnu
              i686-unknown-linux-musl
              powerpc64le-unknown-linux-gnu
              # riscv64-unknown-linux-gnu   # toolchain is not available
              x86_64-apple-darwin
              x86_64-pc-windows-gnu
              x86_64-unknown-freebsd
              x86_64-unknown-linux-gnu
              x86_64-unknown-linux-musl
          )
          
          for rust_target in "${RUST_TARGETS[@]}"; do
              # Install our target-specific stuffs for the toolchain we're requesting
              ${CARGO_HOME}/bin/rustup target add --toolchain ${version} ${rust_target}

MilesCranmer · January 9, 2025, 1:00pm

Would this reply be possible?

Re Yggdrasil:

    # riscv64-unknown-linux-gnu   # toolchain is not available

Oh, well this is just because the name changed. It’s now riscv64gc-unknown-linux-gnu.

StefanKarpinski · January 9, 2025, 2:29pm

The Julia runtime is a bit unusual. It is not actually a terribly big or complicated program, and we generally try to move anything we can into Julia.

The code generation part of the runtime uses C++ because that’s really the only first-class API for LLVM. The way it uses C++ has been criticized by real C++ programmers as “C with method calls”, which is entirely accurate and intentional. I suspect that using Rust to interface with LLVM would be a major impediment and cause us to have to wait not only for new LLVM releases but for Rust interfaces to LLVM to catch up to those releases, and of course it’s an extra layer of potential bugs. So I don’t think replacing code generation stuff with Rust would be a win.

Then there’s the basic OS runtime stuff. This is written in C and could more plausibly be implemented in Rust. However, a very large amount of this would have to be unsafe. As I said, it’s a very unusual program: it inherently needs to do a lot of unsafe (in the Rust sense) low level memory manipulation and does very little dynamic memory allocation that isn’t subsequently managed by Julia’s own GC. There’s a very small amount of concurrent data structure work, but it’s pretty minimal and unlikely to grow or change too much. Rust could maybe help there, but it seems better to keep the runtime as simple and lowest common denominator as possible, which favors C.

MilesCranmer · January 9, 2025, 4:12pm

Thanks Stefan - I really appreciate you taking the time to think this through and share a thorough explanation. Your reasoning about simplicity makes perfect sense, especially given Julia’s philosophy of moving more into Julia itself. The MMTk work seems like an interesting exploration in this space and I’m excited to see that progress.

Craig_Hamel · January 10, 2025, 6:53pm

Over the holidays I did mess around with trying to implement parts of the julia runtime in rust via inkwell as an LLVM generator just as a fun exercise to learn a little rust and learn some of the inter-workings of IR that was a mystery to me prior. pest also made it really easy to make a pretty robust parser/lexer for “most” of of the language with out too much thought.

I got as far as implementing a mirror of Symbol, DataType, Val, etc. and some of boot.jl in Base before I started getting over my head and it turning into more than a “fun side project”.

But like @StefanKarpinski mentioned there is a lag in the rust LLVM wrappers and LLVM itself. I was using LLVM 14 I think to get everything to play nice.

The lack of file IO support in the MPI wrappers in the rust ecosystem could also be a detriment to some of the julia ecosystem.

Topic		Replies	Views
Rust vs Julia Offtopic rust	11	33554	May 19, 2022
RustCall.jl? Tooling	36	3120	May 25, 2024
Does knowing Rust make you a better Julia programmer Offtopic rust	17	868	November 17, 2024
Comparison of Rust to Julia for scientific computing? Offtopic question , rust	123	33873	July 17, 2023
Blog post: Rust vs Julia in scientific computing Offtopic rust	140	21053	July 25, 2023

Introducing Rust alongside C in Julia's source tree?

Related topics