How efficient is dictionary lookup in Julia?

manuelma · July 30, 2019, 2:48pm

I have an application where I have to lookup keys in a dictionary several times, and depending on the particular case it could be that I lookup the same key hundreds of times. How much overhead should I expect?

In Python dictionary lookup complexity is O(1) or so I’ve heard. What about Julia?

zgornel · July 30, 2019, 3:16pm

Its pretty efficient.

using BenchmarkTools
using Random

n=1_000_000
mydict = Dict(i=>randstring(3) for i in 1:n);

@btime mydict[rand(1:n)]   # counts the random generation as well
# 117.599 ns (1 allocation: 47 bytes)
@btime mydict[433]
# 20.692 ns (0 allocations: 0 bytes)

fborda · July 30, 2019, 3:29pm

Yes, Julia Dictionaries are implemented as Hash Tables just like in Python, so it has O(1) lookup complexity (outside of cases with lots of collisions - when many keys have the same hash, which is usually handled internally as a linked list). I think it’s mostly functional languages that occasionally implements Dicts/Maps as balanced binary trees, which gives O(log n) lookup, but plays very nice with the concept of immutable data structures.

Tamas_Papp · July 30, 2019, 4:21pm

You could write a caching wrapper.

mutable struct CachingLookup{S,T}
    last_key::S
    last_value::T
    dict::Dict{S,T}
end

CachingLookup(d::Dict) = CachingLookup(first(d)..., d)

function Base.getindex(c::CachingLookup, key)
    c.key == key && return c.last_value
    c.last_value = c.dict[key]
end

(this is an untested sketch, does not handle corner cases and could be made more elegant)

foobar_lv2 · July 30, 2019, 4:56pm

Compared to python, dictionary lookup is very fast.

Compared to specialized implementations, it’s not so fast. Part of it is that hashing and equality are complicated in julia (e.g. 0x01==1 and therefore both must have the same hash). Another part is that default hashing is not very specialized:

julia> struct x
       x::Int
       end
julia> @btime hash($x(1));
  33.641 ns (0 allocations: 0 bytes)

This is ridiculously slow, because it involves a non-inlined runtime call to jl_object_id. At some point in the future it will probably get faster (someone writes an if @generated fallback that treats objects as blobs of memory).

manuelma · July 30, 2019, 7:23pm

So if I want a fast implementation in theory I could provide my own key type and then implement a fast version of hash for it?

JeffreySarnoff · July 30, 2019, 11:32pm

Yes, or consider using https://github.com/andrewcooke/AutoHashEquals.jl

Topic		Replies	Views
Updating values in a dictionary with a minimum of lookups General Usage question , dictionary	2	381	February 21, 2024
Really Fast Hash Set and Map Internals & Design	20	6244	August 29, 2018
Fastest way to do memoization? Performance performance	11	663	February 23, 2023
Different Dicts hashing to the same value General Usage question	6	1258	January 13, 2017
Dictionary with custom hash function General Usage question , dictionary , hash	7	2503	November 23, 2023

How efficient is dictionary lookup in Julia?

Related topics