[ANN] RemoteREPL.jl

c42f · June 26, 2021, 9:57am

Hi all, I recently released RemoteREPL.jl which makes it simple to connect your local Julia REPL to a remote Julia process. I’ve mentioned the package in a couple of places but thought it could do with an announcement thread.

RemoteREPL can attach a REPL to any cooperating Julia process which has called serve_repl(). The client REPL started with connect_repl() supports executing code in the Main module of the remote process, has tab completion, help mode with the ? prefix, and automatically reconnects if the connection is dropped.

For remote machines, RemoteREPL automatically tunnels traffic over SSH so should have decent network security. (For multi-user environments there’s still some security caveats.)

Demo:

Thanks to @Mason for ReplMaker.jl which makes setting up new repl modes quite easy.

jling · June 26, 2021, 10:29am

this is the coolest thing I’ve seen in this type of tooling for a while. Now I can run Julia session on cluster nodes (that cannot be ssh-ed into) and debug my script this way! Do you think it’s possible to support socket-file protocol in addition to ssh? (see ClusterManagers.jl for example)

c42f · June 26, 2021, 12:30pm

Glad you like it!

All communication currently happens over a normal TCP socket (SSH is only used for tunneling the connection over an insecure network). It should be fairly easy to set up a connection via a socket file rather than a TCP socket, it’s just a matter of doing the plumbing work. Without knowing more about your cluster I’m not quite sure how this would help you though. Which cluster manager are you using from ClusterManagers?

Slightly related - RemoteREPL also supports alternative tunnels (a) for Kubernetes via kubectl and (b) for Amazon EC2 session manager thanks to @Nishanth_Kottary.

jling · June 26, 2021, 12:42pm

sorry for not being clearer. The login node (I can ssh into) and the cluster note share some part of the storage (other wise cluster can’t read config files or data I prepared). Even thought I cannot directly ssh into a cluster node easily.

github.com

JuliaParallel/ClusterManagers.jl/blob/a609b40ceb03de3c106bf1f5bbf90dc6b95ae90e/src/qsub.jl#L66-L71


      
          filenames(i) = "$wd/julia-$(getpid()).o$id-$i","$wd/julia-$(getpid())-$i.o$id","$wd/julia-$(getpid()).o$id.$i"
          
          println("Job $id in queue.")
          for i=1:np
              # wait for each output stream file to get created
              fnames = filenames(i)

tbeason · June 26, 2021, 2:08pm

Bookmarking this to try it out later! I think I could benefit from this. Thanks

c42f · June 26, 2021, 11:00pm

Right, so there’s a shared filesystem mounted on the login and the compute nodes. This won’t help with socket communication though because socket files can’t be used to communicate between different machines. (If I understand correctly, they’re not like normal files at all, mainly they only exist as a path name to identify a kernel resource on a single machine.)

If the compute nodes are on the same network as the login node you could use the RemoteREPL protocol directly rather than an ssh tunnel. Unfortunately this is completely insecure in a shared cluster environment and I don’t recommend it. We could add a shared token (“password”) to the connection header so that it’s as secure as Julia’s existing Distributed protocol (which is to say, not very secure IIUC). Then it could be justified to use RemoteREPL anywhere that Distributed.jl itself is currently used.

jling · June 26, 2021, 11:04pm

oh, then is the code I quoted just using a regular file? I mean that could work I guess haha

c42f · June 26, 2021, 11:50pm

Yes it seems to be just a regular log file (as the comment indicates, it’s kind of hacky). But this isn’t for normal communication between the nodes; it’s only to get the host and port (julia/stdlib/Distributed/src/managers.jl at 05e41385de563a2e8aeea9000a7e381fd9bb3318 · JuliaLang/julia · GitHub) after which the rest of the communication happens via a TCP socket.

What you really need for your use case is to connect directly to the RemoteREPL port. It’s technically possible to expose this port, but in the current version there’s no security: it allows any user on the network to log into your node and execute arbitrary code.

Ideally we’d add security to the RemoteREPL protocol itself. At least with a basic token, or better with libssh2_jll — in that case, it would be like RemoteREPL acting as its own ssh server, though I’m not sure how much work that would be.

Topic		Replies	Views
RemoteREPL cannot connect on Windows over SSH General Usage windows , ssh	1	527	February 14, 2022
Connecting to Remote REPL on Atom New to Julia question	0	315	October 14, 2021
Call for help - Network latency of RemoteREPL Specific Domains repl , latency , network	0	181	June 21, 2023
Manual port forwarding to use REPL New to Julia juno , repl	10	1209	July 26, 2020
Embedding the REPL New to Julia question , embedding	2	278	February 13, 2024

[ANN] RemoteREPL.jl

Demo:

Related topics