Fastest way to install Tesseract on EC2
Tesseract it’s a great library Open Source library to supply free OCR solutions for multiple libraries. You can use it directly from command line, or in your own software using supplied libraries.
Tesseract lives in EPEL repo, but adding repo doesn’t solve it, as the packages in the amzn-main repository seem to override those in the epel repository. To install tesseract we need libwebp package to.
yum --enablerepo=epel --disablerepo=amzn-main install libwebp yum --enablerepo=epel install tesseract