Full Text (PDF)
Original Article

Offline Script Identification from Handwritten Gujrati Script Documents

Akash Sharma, Chhote Raja Patle, Anuwanshi Sharma, Anita Yadav

Author Information

Licence:
Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Journal of Clinical Forensic Sciences 2(2):p 59-64, 2024-12-01. | DOI: N\A
How Cite This Article:
Akash Sharma, Chhote Raja Patle, Anuwanshi Sharma et al. Offline Script Identification from Handwritten Gujrati Script Documents. Int Jr of Forensic Sci. 2024;2(2):59–64.

Received : August 09, 2023         Accepted : November 01, 2023          Published : December 01, 2024

Abstract

This study focuses on offline script identification of handwritten Gujarati script documents using Optical Character Recognition (OCR) techniques. The goal is to develop an efficient system capable of accurately identifying the Gujarati script from handwritten documents. The process begins with the collection of a diverse dataset of offline handwritten Gujarati script documents. The dataset includes various handwriting styles to ensure the model's adaptability. Ground truth labels are annotated for training and evaluation purposes. Preprocessing techniques are employed to enhance the image quality of the handwritten documents. These techniques involve noise removal, image resizing, and normalization, resulting in clearer and standardized input for the subsequent steps. OCR techniques are then applied to perform the script identification task. These techniques involve the extraction of features and patterns specific to the Gujarati script from the pre-processed images. Machine learning algorithms, such as Support Vector Machines (SVM) or Convolutional Neural Networks (CNN), are trained on the extracted features to learn the script identification patterns. The trained model is evaluated using standard performance metrics, including accuracy, precision, recall, and F1 score. The dataset is divided into training and testing sets to assess the model's effectiveness in identifying the Gujarati script. Once the model is trained and evaluated, it can be deployed for practical use. Given an input handwritten document, the OCR system utilizes its learned patterns to accurately identify and classify the Gujarati script. Overall, this study presents a concise approach to offline script identification of handwritten Gujarati script documents using OCR techniques. The proposed system shows promise in accurately reorganizing the Gujarati script, paving the way for further advancements in this field.


References
  • 1.   Prasad JR, Kulkarni UV, Prasad RS. Offline handwritten character recognition of Gujrati script using pattern matching. In 2009 3rd international conference on anti-counterfeiting, security, and identification in communication 2009 Aug 20 (pp. 611-615). IEEE.
  • 2.   Goswami MM, Mitra SK. Offline handwritten Gujarati numeral recognition using low-level strokes. International Journal of Applied Pattern Recognition. 2015;2(4):353-79.
  • 3.   h t t p s : / / a r x i v . o r g / f t p / a r x i v / papers/2009/2009.07435.pdf accessed on 20/07.2023
  • 4.   https://www.researchgate.net/profile/VinayakVi na y/ pu bl ic ation / 36 0946 905 _L an gu ag e_ Translation_for_Impaired_People_using_NLP_ Semantics/links/6294e4d2c660ab61f852a211/ Language-Translation-for-Impaired-People-usingNLP-Semantics.pdf accessed on 23/07/2023.
  • 5.   Savani M, Vadera D, Limbachiya K, Sharma A. Character Segmentation from Offline Handwritten Gujarati Script Documents. In Information and Communication Technology for Competitive Strategies (ICTCS 2021) ICT: Applications and Social Interfaces 2022 Jun 23 (pp. 61-70). Singapore: Springer Nature Singapore.
  • 6.   Pareek J, Singhania D, Kumari RR, Purohit S. Gujarati handwritten character recognition from text images. Procedia Computer Science. 2020 Jan

Funding


Author Information

Authors and Affiliatione

  • Akash Sharma
    Sanjeev Agrawal Global Educational University, India
  • Chhote Raja Patle
    Sanjeev Agrawal Global Educational University, India
  • Anuwanshi Sharma
    Galgotias University, India
  • Anita Yadav
    Sanjeev Agrawal Global Educational University, India

Conflicts of Interest

Supplementary Information

Below is the link to the supplementary material.


Rights and Permissions



About this article


Cite this article

Akash Sharma, Chhote Raja Patle, Anuwanshi Sharma et al. Offline Script Identification from Handwritten Gujrati Script Documents. Int Jr of Forensic Sci. 2024;2(2):59–64.


Licence:
Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Download citation

Received Accepted Published
August 09, 2023 November 01, 2023 December 01, 2024
DOI: N\A
Keywords Offline Script IdentificationHandwritten; Gujarati ScriptDocument; OCR; Optical Character RecognitionDatasetPreprocessingFeature ExtractionMachine LearningSupport Vector MachinesSVMConvolutional Neural NetworksCNNPerformance EvaluationAccuracyPrecisionRecallF1 Score.

Article Level Metrics

Last Updated

Sunday 08 June 2025, 11:26:00 (IST)


208

Accesses

00
0
00

Citations


22
11
23

View full article metrics including social shares, article views and publishing history


Article Keywords


Keyword Highlighting

Highlight selected keywords in the article text.


Timeline


Received August 09, 2023
Accepted November 01, 2023
Published December 01, 2024

licence


Attribution-Non-commercial 4.0 International (CC BY-NC 4.0)

This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format for noncommercial purposes only, and only so long as attribution is given to the creator.


Access this article

Open access


Share