I’m trying to use Tesseract OCR in Foundry, and I need to know the specific steps to install Tesseract in a containerized transformation. Can anyone guide me on how to add language packs in the Docker file and make pytesseract recognize Tesseract in Python scripts?
You probably don’t need a containerized transform. Here is an old answer that I gave on StackOverflow, I would recommend to check if this still works:
https://stackoverflow.com/a/72197666/3652805
Thank you for your answer.
How can I install tesseract in code repository environment?
Add package tesseract as dependency, it’s available on conda-forge:
https://anaconda.org/conda-forge/tesseract
Thank you for your reply.