A faster way of getting image size from jpg files?

rafelafrance · August 3, 2021, 8:31pm

I was trying to figure out why my Python script was well over 1000 times faster than my translation to Julia and the profiler narrowed it down to a single line, the loading of the images to get image dimensions for clipping bounding boxes.

The slowness is probably due to the way that I’m using the Images.load() function, or not cleaning up after.

The question is, What am I doing wrong here?

Slow version

function init_subjects(by_subject, image_dir)::Vector{Subject}
    # <<snip>>

    for old_sub in ProgressBar(by_subject)
        image_file = old_sub.subject_Filename

        image = try
            load("$image_dir/$image_file")
        catch
            @warn "Could not load: $image_file"
            continue
        end

        image_size = size(image)
        # <<snip>>

Fast version

function init_subjects(by_subject, image_dir)::Vector{Subject}
    PILImage = pyimport("PIL.Image")

   # <<snip>>

    for old_sub in ProgressBar(by_subject)
        image_file = old_sub.subject_Filename

        image = try
            PILImage.open("$image_dir/$image_file")
        catch
            @warn "Could not load: $image_file"
            continue
        end

        image_size = reverse(image.size)
        image.close()
        # <<snip>>

I suppose that I can keep the pycall version, but I’d prefer to not have to juggle virtual environments in Julia.

sampope · August 3, 2021, 8:42pm

While still learning myself, I’ve found try/catch/finally/end are slow. It looks like you’re using try/catch because you’re not sure if the image file is there or PILImage.open will fail. You could use isfile() as a quick test to confirm the file is there. And use filesize() to get the files as an Int.

sampope · August 3, 2021, 8:44pm

Sorry, you wanted image size, not filesize. Sounds like images.jl does what you want.

stillyslalom · August 3, 2021, 9:05pm

I think PIL is loading the images lazily (ref):

This is a lazy operation; this function identifies the file, but the file remains open and the actual image data is not read from the file until you try to process the data (or call the load() method).

PIL is probably identifying the image size from the jpeg header, avoiding the need to load the entire image. You could probably do the same thing via LibJpeg along the lines of this post: Get DCT coefficients of jpeg image - #5 by stevengj

rafelafrance · August 3, 2021, 9:13pm

I think your analysis about why the speed difference is occurring is right.

But Images must be reading the header info too to properly render the image. I guess that I should look into short-circuiting the full load either with the library you mention or in the Images package itself.

thanks.

contradict · August 3, 2021, 9:32pm

If you are willing to use ImageMagick.jl, this seems substantially faster on my machine:

using ImageMagick

function imagesize(filename)
       wand = ImageMagick.MagickWand()
       ccall((:MagickPingImage, ImageMagick.libwand), Bool, (Ptr{Cvoid}, Ptr{UInt8}), wand, filename)
       size(wand)
end

rafelafrance · August 3, 2021, 10:00pm

I’m already using ImageMagick.jl so that’s easy. The code works as fast as the PIL.Image solution. I just needed to reverse(size(wand)) for my particular case.

Thank you very much.

Edit: This points to a very useful strategy of calling into libraries directly. TIL.

contradict · August 3, 2021, 10:07pm

That’s great, glad it helped. It is probably worth checking the return value of the :MagickPingImage if you use this in real code, it is just a boolean indicating success or failure.

Good catch about reversing the order of size I didn’t notice that.

Topic		Replies	Views
Loading Images is very slow, is there a work around? Performance question , images	2	1913	October 31, 2019
Why is my Julia code so slow? New to Julia performance	11	5369	March 3, 2017
Very slow loading speed of TIFF images New to Julia	15	1121	October 3, 2023
Loading 60k images from a folder. Python code is way faster than Julia New to Julia question , images , speed-optimization	11	499	June 14, 2025
Efficiently Loading and Processing Large Number of Images New to Julia question , images	4	1601	March 31, 2021

A faster way of getting image size from jpg files?

Related topics