Tesseract Open Source OCR Engine (main repository)
-
Updated
Jun 11, 2024 - C++
Tesseract Open Source OCR Engine (main repository)
Shared services serves as a ready made solutions to most of the code snippets required for back-end services development in spring boot.
Build and deploy a fully-featured, observable, user-facing RAG backend in minutes.
Leverage Deep Learning to digitize old Vietnamese handwritten for historical document archiving (Made with national pride in every single line of code): https://www.kaggle.com/datasets/quandang/nomnaocr
Ruby gem for communicating with the Veryfi OCR API.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
OpenAPI definitions of Regula Document Reader web application
Retrieve files from Hydrus Network and run them through OCR.
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
Repository for the MA Digital Text Analysis thesis.
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
Open-source infrastructure and data orchestration platform for risk decisioning
App deployed on Google Cloud Platform allowing for OCR and note summaries, developed under the supervision of Google
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."