[Help Wanted] gRPCClient2.jl: Production Grade gRPC Client

csvance · October 14, 2025, 2:38pm

Hello Julia Community,

I have been working on a new gRPC client with an emphasis on production grade performance and reliability. The client is already thread / async safe and uses the fast and up to date 1.0 version of ProtoBuf.jl. There is zero extra memory copying or buffering between the client and libCURL and optimizations to reduce overhead of having many small streams over multiplexed HTTP/2.

Repo: GitHub - csvance/gRPCClient2.jl: Production Grade gRPC Client for Julia
Docs: gRPCClient2.jl · gRPCClient2

The name of the package is just a placeholder while it is under rapid development. The client borrows some code from gRPCClient.jl and Downloads.jl, so thanks to the maintainers/contributors of those packages for helping bootstrap this effort.

Looking for collaborators in general but right now I need the following:

general usage testing / feedback on interfaces and API
more test coverage for streaming / test against more gRPC servers than just Python

Of course I am working through most these myself but I would appreciate any help if you are interested in having a production grade gRPC client in Julia.

The latency / overhead / resource usage is currently quite minimal. Some benchmarks bellow (API not final):

// Benchmark a unary RPC with small protobufs to demonstrate overhead per request
// subset of grpc_predict_v2.proto for testing

syntax = "proto3";
package inference;

message ServerReadyRequest {}

message ServerReadyResponse
{
  // True if the inference server is ready, false if not ready.
  bool ready = 1;
}

service GRPCInferenceService
{
  // The ServerReady API indicates if the server is ready for inferencing.
  rpc ServerReady(ServerReadyRequest) returns (ServerReadyResponse) {}
}

using ProtoBuf
using BenchmarkTools
using gRPCClient2
using Base.Threads

include("grpc_predict_v2_pb.jl")

const grpc = gRPCCURL()

function bench_ready(n)
    @sync begin

        requests = Vector{gRPCRequest}()
        for i in 1:n
            request = ServerReadyRequest()
            # once we generate bindings from the service definition this will be much cleaner
            req = grpc_unary_async_request(grpc, "grpc://rpctest.local:8001/inference.GRPCInferenceService/ServerReady", request)
            push!(requests, req)
        end

        for req in requests
            response = grpc_unary_async_await(grpc, req, ServerReadyResponse)
        end
    end
end

# Sync usage (must wait for response before sending next request)
@benchmark bench_ready(1)

BenchmarkTools.Trial: 6821 samples with 1 evaluation per sample.
 Range (min … max):  370.152 μs …   6.193 ms  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     520.608 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   722.812 μs ± 671.093 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▂▆█▆▅▄▃▂▂▁                                                    ▁
  ███████████▇▆▅▆▄▄▅▅▄▆▄▄▅▆▆▆▅▂▄▅▅▅▃▅▅▄▅▅▅▅▄▅▄▄▆▆▆▆▆▆▇▇▇▆▅▄▄▄▄▂ █
  370 μs        Histogram: log(frequency) by time        3.9 ms <

 Memory estimate: 5.36 KiB, allocs estimate: 109.

# Async usage (send all requests as fast as possible and then wait for all responses)
@benchmark bench_ready(1000)

BenchmarkTools.Trial: 32 samples with 1 evaluation per sample.
 Range (min … max):  149.132 ms … 181.694 ms  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     157.729 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   160.711 ms ±   7.917 ms  ┊ GC (mean ± σ):  0.00% ± 0.00%

       ▁        ▄█▁    ▁                 ▁                       
  ▆▁▁▆▁█▆▁▆▆▁▁▁▆███▆▁▁▆█▆▆▁▁▁▆▁▁▆▁▁▁▁▁▆▁▁█▁▁▆▁▆▁▁▁▁▁▁▁▁▁▆▁▁▁▁▁▆ ▁
  149 ms           Histogram: frequency by time          182 ms <

 Memory estimate: 2.37 MiB, allocs estimate: 54179.

Dividing by 1000 we get a mean of 160.711 μs down from 722.812 μs in the sync case, around a 4.5x speedup not having to wait for a response before sending the next request. The ICMP RTT to this server is ~300 μs from my computer on the LAN.

csvance · October 16, 2025, 1:53am

Opened a pull request to add support for RPC client stub code generation with ProtoBuf.jl. Depending on how fast I’m able to get services support fully worked out and merged, we could have the v0.1 release in the next few weeks. Tests and CI/precompile infrastructure have been setup and considerable work has been done to smooth out rough edges in terms of having useful exception messages, fixing memory/handle leaks, etc.

I also cleaned up the public interface / API:

using gRPCClient2

# Include the protobuf definitions and RPC client stubs
include("gen/proto/test_pb.jl")

# Initialize the gRPC package - grpc_shutdown() does the opposite for use with Revise.
grpc_init()

# Create a client from the generated client stub
client = TestService_TestRPC_Client("localhost", 8001; secure=false)

# Sync API
test_response = grpc_sync_request(client, TestRequest(1))

# Async API
requests = Vector{gRPCRequest}()
for i in 1:10
    push!(
        requests, 
        grpc_async_request(client, TestRequest(1))
    )
end

for request in requests
    response = grpc_async_await(client, request)
end

Now that the API is relatively stable I’m going to start writing documentation and will continue to stress test the client and fix any remaining undiscovered bugs.

tecosaur · October 16, 2025, 5:28am

Looks like some neat work!

As someone who is specifically not a fan of xyz2, xyz3 packages, are you interested in talking to the gRPCClient.jl folks about potentially replacing it with your package?

simsurace · October 16, 2025, 6:00am

Nice to see some interest and movement in the gRPC support for Julia. This has long been a stumbling block when integrating Julia services in larger projects, and there aren‘t that many alternatives really, e.g. there is also no AMQP v1 support in the ecosystem either (although I did make a start there, maybe I need to request for help there too). So it is nice to see that someone is willing to push this domain further. Maybe someday we‘ll also have a gRPC server in Julia.

Nice to see some basic integration testing set up as well. Maybe I could help (if I get some spare time, lol) with setting up test servers in a few more languages to test against. At least JS/TS and Go would be nice to catch any inconsistencies in implementations (I heard that there can be some).

csvance · October 16, 2025, 6:30am

Good idea. Once we are a little farther along with documentation and testing we can open the discussion. I just didn’t want to bother them until it was clear how serious this effort is

csvance · October 16, 2025, 6:36am

Indeed, when I was first trying to adopt Julia for a project at work this ended up being such a large roadblock I almost gave up on the language. So it will be good if no one else ever has to go through that again

gRPC server in Julia is on my radar currently, it may be possible to do with nghttp2 which already has a JLL package. Once the client initiative is complete I will look into it more.

That would be much appreciated! Get me some spare time too

csvance · October 19, 2025, 10:01pm

I just merged streaming RPC support. Request, response, and bidirectional are all supported. Test coverage for streaming RPC isn’t nearly as comprehensive as it is for unary RPC currently, but the basics are working.

This should bring us to feature parity with gRPCClient.jl now. I also reached out to the maintainers to ask about doing a 1.X version release with the new codebase.

csvance · October 28, 2025, 12:15am

Streaming RPC should be stable now after a decent amount of stress testing and bugfixing sessions. We are also now outperforming Python’s gRPC client which tends to have pretty solid performance in my experience.

I ran some benchmarks using the Python gRPC client to compare the overhead between Python’s grpcio package and gRPCClient2.jl. I gave it 24 threads, same as julia -t auto on my system when I run benchmark_workload_smol() from workloads.jl in the gRPCClient2.jl repo. I’m aware the GIL is a thing, but calling into grpcio releases the GIL so it shouldn’t be a significant bottleneck. Pretty much none of grpcio is written in Python, which is why its fast

 % uv run grpc_test_client.py 30
average: 7019.47 RPS
std: 202.31 RPS
min: 6153.18 RPS
max: 7278.98 RPS

In Julia running the same benchmark with all the recent changes I get (keep in mind this is doing 1000 requests per trial, so we will have divide the results by 1000).

julia> benchmark_workload_smol()
BenchmarkTools.Trial: 41 samples with 1 evaluation per sample.
 Range (min … max):  108.345 ms … 135.084 ms  ┊ GC (min … max): 0.00% … 7.75%
 Time  (median):     123.482 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   122.444 ms ±   6.091 ms  ┊ GC (mean ± σ):  0.33% ± 1.41%

                               █   ▃▃▃█ █  █ ▃                   
  ▇▁▇▁▁▁▁▇▇▁▁▁▇▁▁▁▁▇▇▇▇▇▁▇▁▇▇▁▁█▁▇▁████▁█▇▁█▇█▁▁▁▁▁▇▇▇▁▁▁▁▇▁▁▁▇ ▁
  108 ms           Histogram: frequency by time          135 ms <

 Memory estimate: 4.27 MiB, allocs estimate: 93559.

There are 1000 requests per trial, so divide the mean by 1000 and convert to RPS:

julia> 1 / (0.122444 / 1000)
8166.998791284178

We are beating Python by over 1000 RPS on average.

csvance · November 13, 2025, 4:14pm

Does anyone care if we start support with Julia 1.12? It looks like there are some issues specifically with supporting streaming in 1.10/1.11 that were resolved in 1.12. I don’t really have time to dig deeper into it, but as things stand now I plan on supporting all Julia versions >= 1.12. CI has been updated to test against 1.12 and nightly. I’m also testing as part of a production system that uses gRPC, so far so good.

Sometime in the next few weeks the package will be submitted for registration. I’m hoping that as part of the registration process I can get in contact with the people I need to about the package name + up streaming gRPC codegen into ProtoBuf.jl.

xlxs4 · November 14, 2025, 9:17am

Thank you very much for taking the time to work on this. Lack of proper gRPC support in Julia was severely limiting IMO

ndortega · November 19, 2025, 10:01pm

@csvance

I’d be super interested in a gRPC server for julia.

When you get this to a decent point I’d love to experiment targeting this as an alternative backend for Oxygen.jl. The package already performs introspection on all inputs and outputs of its handlers so I could potentionally just generate the proto schema & files and hook them into your gRPC server.

I could even find some way to enable both gRPC and HTTP servers to run in the same application, to give people multiple ways to connect to their app

In theory the handlers could look something like:

# Hypothetical macro for gRPC handlers (auto-generates proto and hooks into some grpc server)
@grpc function add(request::MathRequest)
    result = request.a + request.b
    return MathResponse(result)
end

@grpc function create_peron()
    person = Person("joe", 20)
    return PersonResponse(person)
end

# ...existing code (e.g., @get "/add/{a}/{b}" remains for HTTP)...

serve(port=8080)  # HTTP server; gRPC would run separately or integrated

Below is the theorectical schema that could be generated from those handlers

syntax = "proto3";

package oxygen_example;

import "google/protobuf/empty.proto";

message MathRequest {
  double a = 1;
  double b = 2;
}

message MathResponse {
  double result = 1;
}

message Person {
  string name = 1;
  int32 age = 2;
}

message PersonResponse {
  Person person = 1;
}

service OxygenService {
  rpc Add(MathRequest) returns (MathResponse);
  rpc GetPerson(google.protobuf.Empty) returns (PersonResponse);
}

csvance · November 19, 2025, 11:38pm

The gRPC server is in development but I’m not making any promises on the timeline yet. The current approach I’m in favor of is to use nghttp2 together with Julia TCP Sockets, but I have no idea how much worse that would perform compared to working directly with Julia’s libuv interface. The reason I like this approach is it could produce something that’s actually useful to many people even before its written in a completely optimal way while not requiring a complete rewrite to take it the rest of the way.

Update on gRPCClient2.jl

There is now an active pull request / code review in progress for using gRPClient2.jl as the 1.0.0 release for gRPCClient.jl. Not sure exactly when it will be done, but things are looking good so far!

csvance · December 3, 2025, 3:10pm

gRPCClient2.jl is now gRPCClient.jl with a new home in JuliaIO

As for actually registering the 1.0.0 release, all that remains is to finish up-streaming code generation support into ProtoBuf.jl.

The test server was rewritten in Go as it turns out some of the benchmarks were bottlenecked by Python’s GIL. Throughput doubled in cases with very small messages. @atthom reworked the benchmark scripts to use PrettyTables.jl with proper units showing everything all together nicely.

`julia -t auto`

╭──────────────────────────────────┬─────────┬────────┬─────────────┬──────────┬────────────┬──────────────┬─────────┬──────┬──────╮
│                        Benchmark │       N │ Memory │ Allocations │ Duration │ Throughput │ Avg duration │ Std-dev │  Min │  Max │
│                                  │   calls │    MiB │             │        s │    calls/s │           μs │      μs │   μs │   μs │
├──────────────────────────────────┼─────────┼────────┼─────────────┼──────────┼────────────┼──────────────┼─────────┼──────┼──────┤
│                    workload_smol │   91000 │   3.75 │       85123 │     5.03 │      18079 │           55 │    3.96 │   48 │   67 │
│        workload_32_224_224_uint8 │    2900 │  63.78 │        9188 │     5.01 │        579 │         1728 │   97.86 │ 1614 │ 1899 │
│       workload_streaming_request │ 1841000 │   0.89 │        6482 │     4.99 │     368669 │            3 │    1.35 │    2 │   21 │
│      workload_streaming_response │  330000 │   13.0 │       27838 │     5.02 │      65771 │           15 │     5.2 │    6 │   37 │
│ workload_streaming_bidirectional │  405000 │   1.48 │       25672 │      5.0 │      80948 │           12 │    8.52 │    3 │   62 │
╰──────────────────────────────────┴─────────┴────────┴─────────────┴──────────┴────────────┴──────────────┴─────────┴──────┴──────╯

Topic		Replies	Views
State of gRPC in Julia General Usage	2	858	April 22, 2022
gRPC API (community request) Data	13	2422	November 8, 2024
Pyjulia in desperate need of attention form someone who knows what they're doing General Usage	22	5834	November 24, 2017
GRPC, ProtoBuf, and HTTP General Usage	5	4040	June 25, 2021
HTTP.jl doesn't seem to be good at handling over 1k concurrent requests, in comparison to an alternative in Python? Web Stack	30	7712	December 16, 2020

[Help Wanted] gRPCClient2.jl: Production Grade gRPC Client

Update on gRPCClient2.jl

julia -t auto

Related topics

`julia -t auto`