This is an OCR Solution, which uses the Modern Neural Network for Word Detection and Classical OCR techniques to Read the text from any image file. The complete solution works in 2 Stages:
- Detection of individual words from an Image.
- Passing the detected words to OCR Engine to identify the characters in the word
+We first detect the Bounding Boxes around the Words, using the Deep Learning Model EAST: An Efficient and Accurate Scene Text Detector.
- Using the Bounding Boxes, individual Words are also saved as seperate Words. The words are saved in the Order of Top-to-Bottom, Right-to-Left.
+Using one of the most famous OCR engine Tesseract, the characters are recognized from the individual Words from the Word Detection method. +The individual characters are appended to a txt file which is saved after the characters are recognized from all the individally detected words
- Models trained on ICDAR 2013 (training set) + ICDAR 2015 (training set): BaiduYun link GoogleDrive
- Resnet V1 50 provided by tensorflow slim: slim resnet v1 50
- Push my Commits to GitHub
- Update Readme for working of the Solution
- Original Implementation of EAST Detector
- PyTesseract