[ANN] ContextTracking.jl - writing context-aware applications

tk3369 · July 3, 2020, 7:08am

Hi everyone,

Today, I would like to announce ContextTracking.jl, a package that helps you keep track of contextual information during program execution.

A quick example

Suppose that you have 4 functions: A calls B, B calls C, and C calls D. Normally, if you have gathered some data in A and want to access it in D, you would have to pass the data downstream via function arguments. With ContextTracking, you just access it directly in D.

Unlike global variables, context information is kept only during the lifetime of the execution call chain. All that is maintained in a stack structure. When the function returns, the data is cleaned up and removed.

How is it useful?

Just taking a use case description from the project’s README:

Suppose that we are processing a web request. We may want to create a correlation id to keep track of the request and include the request id whenever we write anything to the log file during any part of the processing of that request.

It may seems somewhat redundant to log the same data multiple times but it is invaluable in debugging production problems. Imagine that two users are hitting the same web service at the same time. If we look at the log file, everything could be interleaving and it would be quite confusing without the context.

As context data is stored in a stack structure, you naturally gain more “knowledge” when going deeper into the execution stack. Then, you naturally “forget” about those details when the execution stack unwinds. With this design, you can just memoize the most valuable knowledge needed in the log file.

Demo

using ContextTracking

@ctx function foo()
    @memo x = 1
    bar()
end

@ctx function bar()
    c = context()
    @info "context data" c.data
end

Result:

julia> foo()
┌ Info: context data
│   c.data =
│    Dict{Any,Any} with 2 entries:
│      :_ContextPath_ => [:foo, :bar]
└      :x             => 1

yakir12 · July 3, 2020, 8:55am

Love this. I’ve needed it quite a few times. Thank you.

Any ideas/stats on the cost of not/using this in terms of time and memory for a few examples?

oschulz · July 3, 2020, 9:19am

Maybe relevant in this context (no pun intended ):

https://discourse.julialang.org/t/propagation-of-available-assigned-worker-ids-in-hierarchical-computations

https://github.com/JuliaLang/julia/issues/35757

oschulz · July 3, 2020, 9:20am

It’s not about exactly the same thing, obviously, but related in spirit.

tk3369 · July 3, 2020, 4:09pm

Memory utilization should be the same as how much data you want to track.

From a performance perspective, you may want to avoid using @ctx for a hot function that’s called in a tight loop because of the overhead of maintaining the stack. However, you can still use context function to access previously recorded data.

I have used this package in a production setting for a data engineering process, and memory/performance has not been a concern for my use case. The context data that I need to track is minuscule compared to the data that I need to process. So I haven’t done much testing in regards to memory/performance overhead.

I would be delighted to see if it works well with other use cases in case that you want to give it a try!

Topic		Replies	Views
Need advice: tracing function calls non-invasively General Usage question	7	407	April 18, 2023
Best practices for implicit function parameters General Usage	1	528	May 18, 2020
How to make a function store data to avoid repeating computation General Usage question , functions , style , memoize	45	1783	October 17, 2022
Track memory usage General Usage question	14	7947	December 23, 2020
Discussion: Context Dispatch - yes? no? questions? answers? Internals & Design	11	2310	May 17, 2019

[ANN] ContextTracking.jl - writing context-aware applications

A quick example

How is it useful?

Demo

Related topics