A custom-built deep learning approach for text extraction from identity card images

Geerish Suddul, Jean Fabrice Laurent Seguin


Information found on an identity card is needed for different essential tasks and manually extracting this information is time consuming, resource exhaustive and may be prone to human error. In this study, an optical character recognition (OCR) approach using deep learning techniques is proposed to automatically extract text related information from the image of an identity card in view of developing an automated client onboarding system. The OCR problem is divided into two main parts. Firstly, a custom-built image segmentation model, based on the U-net architecture, is used to detect the location of the text to be extracted. Secondly, using the location of the identified text fields, a (CRNN) based on long short-term memory (LSTM) cells is trained to recognise the characters and build words. Experimental results, based on the national identity card of the Republic of Mauritius, demonstrate that our approach achieves higher accuracy compared to other studies. Our text detection module has an intersection over union (IOU) measure of 0.70 with a pixel accuracy of 98% for text detection and the text recognition module achieved a mean word recognition accuracy of around 97% on main fields of the identity card.


Computer vision; Data augmentation; Deep learning; Optical character recognition; Text detection and recognition

Full Text:


DOI: http://doi.org/10.11591/ijict.v13i1.pp34-41


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The International Journal of Informatics and Communication Technology (IJ-ICT)
p-ISSN 2252-8776, e-ISSNĀ 2722-2616
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Web Analytics View IJICT Stats