Custom OCR for Identity Documents:OCRXNet
DOI:
https://doi.org/10.34306/att.v2i2.87Keywords:
Text Detector, Tesseract, Yolo, CRAFT, Noise Removal.Abstract
Recent advancements in the area of Optical Character Recognition (OCR) using deep learning techniques made it possible to use for real world applications with good accuracy. In this paper we present a system named as OCRXNet. OCRXNetv1, OCRXNetv2 and OCRXNetv3 are proposed and compared on different identity documents. Image processing methods and various text detectors have been used to identify best fitted process for custom ocr of identity documents. We also introduced the end to end pipeline to implement OCR for various use cases.
References
[2] Brzeski, Adam, et al. "Evaluating performance and accuracy improvements for attention-OCR." IFIP International Conference on Computer Information Systems and Industrial Management. Springer, Cham, 2019.
[3] Saluja, Rohit, et al. "OCR On-the-Go: Robust End-to-end Systems for Reading License Plates & Street Signs." 15th IAPR International Conference on Document Analysis and Recognition (ICDAR). 2019.
[4] Achanta, Rakesh, and Trevor Hastie. "Telugu OCR framework using deep learning." arXiv preprint arXiv:1509.05962 (2015).
[5] Namysl, Marcin, and Iuliu Konya. "Efficient, lexicon-free OCR using deep learning." arXiv preprint arXiv:1906.01969 (2019).
[6] Wei, Tan Chiang, U. U. Sheikh, and Ab Al-Hadi Ab Rahman. "Improved optical character recognition with deep neural network." 2018 IEEE 14th International Colloquium on Signal Processing & Its Applications (CSPA). IEEE, 2018.
[7] Kundaikar, Teja, and Jyoti D. Pawar. "Multi-font Devanagari Text Recognition Using LSTM Neural Networks." First International Conference on Sustainable Technologies for Computational Intelligence. Springer, Singapore, 2020.
[8] Chandrakala, H. T., and G. Thippeswamy. "Deep Convolutional Neural Networks for Recognition of Historical Handwritten Kannada Characters." Frontiers in Intelligent Computing: Theory and Applications. Springer, Singapore, 2020. 69-77.
[9] Ali, Hazrat, et al. "Pioneer dataset and automatic recognition of Urdu handwritten characters using a deep autoencoder and convolutional neural network." SN Applied Sciences 2.2 (2020): 152.
[10] Springmann, Uwe, et al. "OCR of historical printings of Latin texts: problems, prospects, progress." Proceedings of the First International Conference on Digital Access to Textual Cultural Heritage. 2014.
[11] Wang, Jianfeng, and Xiaolin Hu. "Gated recurrent convolution neural network for ocr." Advances in Neural Information Processing Systems. 2017.
[12] Ahmed, Saad Bin, et al. "Deep learning based isolated Arabic scene character recognition." 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR). IEEE, 2017.
[13] Naseer, Asma, and Kashif Zafar. "Meta features-based scale invariant OCR decision making using LSTM-RNN." Computational and Mathematical Organization Theory 25.2 (2019): 165-183.
[14] Maalej, Rania, and Monji Kherallah. "Improving MDLSTM for offline Arabic handwriting recognition using dropout at different positions." International conference on artificial neural networks. Springer, Cham, 2016.
[15] Breuel, Thomas M. "High performance text recognition using a hybrid convolutional-lstm implementation." 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR). Vol. 1. IEEE, 2017.
[16] Liu, Wei, et al. "Ssd: Single shot multibox detector." European conference on computer vision. Springer, Cham, 2016.
[17] Zhou, Xinyu, et al. "EAST: an efficient and accurate scene text detector." Proceedings of the IEEE conference on Computer Vision and Pattern Recognition. 2017.
[18] Baek, Youngmin, et al. "Character region awareness for text detection." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019.
[19] Kettunen, Kimmo, and Mika Koistinen. "Open Source Tesseract in Re-OCR of Finnish Fraktur from 19th and Early 20th Century Newspapers and Journals-Collected Notes on Quality Improvement." DHN. 2019.
Published
How to Cite
Issue
Section
License
Copyright (c) 2020 Kawal Arora, Ankur Singh Bist, Roshan Prakash, Saksham Chaurasia
This work is licensed under a Creative Commons Attribution 4.0 International License.
This journal permits and encourages authors to post items submitted to the journal on personal websites while providing bibliographic details that credit its publication in this journal.
Authors are permitted to post their work online in institutional/disciplinary repositories or on their own websites. Pre-print versions posted online should include a citation and link to the final published version in Journal of Librarianship and Scholarly Communication as soon as the issue is available; post-print versions (including the final publisher's PDF) should include a citation and link to the journal's website.