I guess Tesseract is a good tool to begin at least (from version 4.0, it has very good results in images with good enough resolution, even noisy ones). Then you can start thinking on adding good pre-processing steps to improve the image or even better ML models for the OCR task.
As far as I know, there are no Julia frameworks for OCR tasks as you asked. I’ve developed OCReract.jl to be just a simple wrapper for Tesseract to allow retrieving results in a Julia session and, as @uwechsler mentioned, it was not maintained. But recently, I’ve released a stable version with proper documentation for usage so you can try it to see if that meets your needs.
Repo: GitHub - leferrad/OCReract.jl: A simple Julia wrapper for Tesseract OCR
Doc: Home · OCReract.jl
5 Likes