-
Notifications
You must be signed in to change notification settings - Fork 76
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add benchmark - ( 312.OCR ) #219
base: master
Are you sure you want to change the base?
Conversation
Important Review skippedDraft detected. Please check the settings in the CodeRabbit UI or the You can disable this status message by setting the Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media? TipsChatThere are 3 ways to chat with CodeRabbit:
Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (invoked as PR comments)
Additionally, you can add CodeRabbit Configuration File (
|
@prajinkhadka Thank you! One comment from my side: can you please estimate how large is the installation of tesseract? All libraries and binaries? |
Ideas how to build tesseract and include with libraries: |
I have added a new benchmark: 312.OCR
-> Here, I have used Tesseract for the OCR.
-> One of the issues is, that we use Pytesseract ( a wrapper for Tesseract). Tesseract needs to be installed so that py-tesseract finds the tesseract, and I have not found any prebuilt static binary of Tesseract. One of the ways is to use init.sh to install the tesseract according to OS package manager but this will need discussion and extensive testing for all the platforms.
To Do: