layer_1--2
layer_1--2
1 Introduction
Character recognition is winding up increasingly vital and huge in the advanced
world. It encourages people to carry out their jobs effortlessly.Handwritten char-
acter recognition (HCR) is most challenging and demanding research area as far
as image processing is concerned. Goal of this research is to facilitate automation
in order to minimize human efforts. Handwritten character recognition (HCR)
mainly divided into two classes: On-line HCR and Off-line HCR. In off-line HCR
method the image of the written character is detected from a paper by optical
scanning called scanner. In on-line HCR method input is detected by movement
of pen tip called Digitizer. Handwritten character recognition is very complex
c Springer Nature Singapore Pte Ltd. 2019
K. C. Santosh and R. S. Hegadi (Eds.): RTIP2R 2018, CCIS 1037, pp. 37–48, 2019.
https://doi.org/10.1007/978-981-13-9187-3_4
38 S. P. Deore and A. Pravin
due number of reasons [20]. Firstly, shape similarity between characters. Sec-
ondly, handwriting of a person is different in various circumstances. Thirdly,
noise present while collection of data. With this we also have to consider large
variations in stroke primitives due to different handwriting styles. In English
language there are only 26 characters but Indian scripts consist of vowels and
consonants total as well as Compound characters so recognition of Indian lan-
guages is challenging as compared to English language. The proposed work done
for Devanagari scripts mainly adopted for different languages such as Marathi,
Hindi, Sanskrit, Nepali etc. Devanagari scripts consist of 12 vowels and 36 basic
consonants and 10 numeral characters. So there is challenge to create a software
system that will recognize handwritten Devanagari characters of different font
size, font style and shape.
In Fig. 1 shows different samples of Devanagari Handwritten characters. We
can observe that for same characters also there exist variations in each sam-
ple due different handwriting styles and their number of strokes. We can also
observed shape similarity for some characters.
The paper covers literature review in Sect. 2 and proposed work in Sect. 3.
Section 4 is about Data Collection followed by the Preprocessing in Sect. 5.
Section 6 describes methods for feature extraction followed by classifier used for
the recognition in Sect. 7. Section 8 elaborates the results followed by conclusion
in Sect. 9.
2 Literature Review
Digitizers are mostly electromagnetic-electrostatic tablets, which send the coordi-
nates of the pen tip to the computer at regular intervals. The on-line handwriting
On-Line Devanagari Handwritten Character Recognition System 39
3 Proposed Work
We implemented a system which is used to recognize 12 vowels of Handwritten
Devanagari character. In India the Devanagari script used by masses especially in
Gujarat, Maharashtra, North India, Madhya Pradesh and West Bengal. Devana-
gari script is read and written from left to right direction. Here input image is
captured using a pen tablet and while writing we have recorded its x and y coor-
dinates and pressure information. Then captured on-line information of image
is stored in text file. This information is passed to feature extraction phase for
calculating features of different characters, which is implemented using various
non-orthogonal Moments.Then neural network is applied to classify characters.
Figure 2 depicts architecture of proposed Character Recognition system.
4 Dataset Creation
On-line handwritten data is inputted with a digitizing pen tablet and special
stylus. These pen movements gives us (x, y) pixel coordinates of a character
image. These coordinates, along with the pressure information are stored in text
file. Out of which only x and y coordinates information is used by our on-line
HCR system. Figure 3 shows GUI of our developed application and Fig. 4 shows
contents of text file. No restriction was forced on the content or style of writing,
the only constraint is the requirement of isolated characters. We have collected
100 samples of each 12 characters from different persons. We have created our
own on-line data set of 1200 characters.
On-Line Devanagari Handwritten Character Recognition System 41
5 Preprocessing
Preprocessing is one of the essential phase of handwriting recognition. It removes
the noise introduced due to software and device constraints thus improving over-
all recognition rate. When user writes a character its on-line information (i.e. x
and y coordinate information) is captured in timely basis form. We are storing
N no. of samples in text file. We are taking on-line data for feature calculation
in which very less noise is present. So that not much preprocessing is required
for on-line data set. We have done only sample normalization and thinning oper-
ation on image. Figure 5(a) shows original input character and (b) shows same
processed character.
6 Feature Extraction
In the process of handwriting recognition, it is important to identify correct feature
set. Feature extraction is very important phase for efficient data representation and
for further processing. To differentiate between one class to another a set of char-
acteristics are mined. This would minimize the intricacies and improve precision
of algorithm. Moments are shape descriptors used to characterize the shape and
size of the image. Moments provide important properties of image like area, cen-
troid, orientation etc. Main advantage of moment feature is their ability to provide
invariant measures of shape. We have implemented recognition system by using 1-
D Moments as feature extraction method using on-line data set.
Where i = 1..N, X(i) denotes the x pixel array, Y(i) is y pixel array, N denotes
number of samples and p denotes order of moment. We calculated 4 features of
complex moment.
Where i = 1..N, X(i) denotes the x pixel array, Y(i) is y pixel array, N denotes
number of samples and p denotes order of moment. Where,
m10 m01
X̄ = and Ȳ = (8)
m00 m00
To handle deviation in size of character image the central moments is normalized
using below expression:
μpq p+q
ηpq = and γ= +1 (9)
μ00 γ 2
For p, q = 0,1,.. and for (p+q) = 2,3,..
As per above transformation done in central moment, Hu’s moments can be
derived as follows [12]:
φ1 = η20 + η02 (10)
φ2 = (η20 − η02 )2 + 4η11
2
(11)
φ3 = (η30 − 3η12 )2 + (3η21 + η03 )2 (12)
2 2
φ4 = (η30 + η12 ) + (η21 + η03 ) (13)
2 2
φ5 = (η30 − 3η12 ) (η30 + η12 ) (η30 + 3η12 ) − 3 (η21 + η03 )
2 2
+ (3η21 − η03 ) (η21 + η03 ) 3 (η30 + η12 ) − (η21 + η03 ) (14)
φ6 = (η20 − η02 )[(η30 + η12 )2 − (η21 + η03 )2 ] + 4η11 (η30 + η12 )(η21 + η03 )(15)
N
(f eature(i) − mean(i))
[ ] (17)
i=1
σ(i)
Data used for the present work were collected from different individuals. Class
1 consists of samples of character of 100 different peoples; class 2 consists
of samples of character and so on. Like this we have total 12 classes for
12 Devanagari characters so we have total 1200 samples of Devanagari basic
characters (only vowels) for the experiment of the proposed work. Table 1 show
On-Line Devanagari Handwritten Character Recognition System 45
9 Conclusion
Implemented HCR system which will recognize handwritten Devanagari charac-
ters using on-line data and moments as feature. We collected 100 samples of each
12 characters from different persons so our data set consists of total 1200 charac-
ters. Out of 100 samples we used 80 samples for training Neural Network and 20
samples for testing. We observed that: (i) Central Moment (CM) gives the high-
est recognition rate 71% and Complex Moment (COM) gives lowest recognition
rate 18% for on-line features compared to other moments when each moment is
applied separately (ii) Combination of any two moments improves the recogni-
tion rate. (iii) Combination of all 1-D moments gives recognition rate to 90%
(iv) 1-D Moments can also use as feature for on-line system which is not influ-
enced by stroke sequence. For future work we plan to recognize the character set
On-Line Devanagari Handwritten Character Recognition System 47
References
1. Shelke, S., Apte, S.: A fuzzy based classification scheme for unconstrained hand-
written Devanagari character recognition. In: International Conference on Commu-
nication, Information and Computing Technology (ICCICT), pp. 1–6. IEEE Press,
Mumbai (2015). https://doi.org/10.1109/ICCICT.2015.7045738
2. Wang, Y., Ding, X.: Topic language model adaption for recognition of homologous
on-line handwritten Chinese text image. IEEE Signal Process. Lett. 21, 550–553
(2014). https://doi.org/10.1109/LSP.2014.2308572
3. Deore, S., Ragha, L.: Moment based online and online handwritten character recog-
nition. CiiT Int. J. Biometrics Bioinf. 3, 111–115 (2011). BB032011004
4. Omer, M., Ma, S.: Online Arabic handwriting character recognition using matching
algorithm. In: 2nd International Conference on Computer and Automation Engi-
neering (ICCAE), pp. 259–262. IEEE Press, Singapore (2010) . https://doi.org/
10.1109/ICCAE.2010.5451492
5. Prasad, J., Kulkarni, U., Prasad, R.: Offline handwritten character recognition of
Gujarati script using pattern matching. In: 3rd International Conference on Anti-
counterfeiting, Security, and Identification in Communication, pp. 611–615, IEEE
Press, Hong Kong (2009). https://doi.org/10.1109/ICASID.2009.5276999
6. Radhika, K., Venkatesha, M., Shekar, G.: On-line signature authentication using
Zernike moments. In: 3rd International conference on Biometrics: Theory, applica-
tions and systems, pp. 109–112. IEEE Press, Washington (2009). https://doi.org/
10.1109/BTAS.2009.5339022
7. Sharma, A., Kumar, R.: On-line handwritten Gurmukhi character recognition
using elastic matching. In: Congress on Image and Signal Processing (CISP), pp.
391–396. IEEE Press, Sanya (2008). https://doi.org/10.1109/CISP.2008.297
8. More, V., Rege, P.: Devnagari handwritten numeral identification based on Zernike
moments. In: IEEE Region 10 Conference (TENCON), pp. 1–6. IEEE Press, Hyder-
abad (2008). https://doi.org/10.1109/TENCON.2008.4766863
9. Shu, H., Luo, L.: Moment-based approaches in imaging part 1 basic features. IEEE
Eng. Med. Biol. Mag. 26, 70–74 (2007)
10. Connell, S.D., Sinha, R., Jain, A.: Recognition of unconstrained on-line Devanagari
characters. In: 15th International Conference on Pattern Recognition, pp. 368–371.
IEEE Press, Barcelona (2000). https://doi.org/10.1109/ICPR.2000.906089
11. Liao, S., Pawlak, M.: On image analysis by Moments. IEEE Trans. Pattern Anal.
Mach. Intell. 18, 254–266 (1996). https://doi.org/10.1109/34.485554
12. Agui, T., Takahashi, H., Nagahashi, H.: Recognition of handwritten katakana in a
frame using moment invariants based on neural network. In: IEEE International
Joint Conference on Neural Networks, pp. 659–664. IEEE Press, Singapore (1991).
https://doi.org/10.1109/IJCNN.1991.170475
13. Santosh, K., Nattee, C., Lamiroy, B.: Relative positioning of stroke based cluster-
ing: a new approach to on-line handwritten Devangari character recognition. Int. J.
Image Graph. (IJIG) 12, 1–24 (2012). https://doi.org/10.1142/S0219467812500167
14. Santosh, K., Iwata, E.: Stroke-Based Cursive Character Recognition. IntechOpen
(2012). https://doi.org/10.5772/51471
48 S. P. Deore and A. Pravin
15. Santosh, K., Nattee, C., Lamiroy, B.: Spatial similarity based stroke number and
order free clustering. In: 12th International Conference on Frontiers in Handwriting
Recognition (ICFHR), pp. 652–657. IEEE Press, Kolkata (2010). https://doi.org/
10.1109/ICFHR.2010.107
16. Santosh, K.: Character recognition based on DTW-radon. In: 11th International
Conference on Document Analysis and Recognition (ICDAR), pp. 264–268. IEEE
Press, Beijing (2011). https://doi.org/10.1109/ICDAR.2011.61
17. Santosh, K., Wendling, L.: Character recognition based on non-linear multi-
projection profiles measure. Front. Comput. Sci. 9, 678–690 (2015). https://doi.
org/10.1007/s11704-015-3400-2
18. Santosh, K.C., Nattee, C.: Stroke number and order free handwriting recogni-
tion for Nepali. In: Yang, Q., Webb, G. (eds.) PRICAI 2006. LNCS (LNAI), vol.
4099, pp. 990–994. Springer, Heidelberg (2006). https://doi.org/10.1007/978-3-
540-36668-3 120
19. Deore, S.P., Pravin, A.: Ensembling: model of histogram of oriented gradient based
handwritten Devanagari character recognition system. Traitement du Signal 34,
7–20 (2017). https://doi.org/10.3166/ts.34.7-20
20. Jagtap, A.B., Hegadi, R.S.: Offline handwritten signature recognition based on
upper and lower envelope using eigen values. In: World Congress on Computing
and Communication Technologies (WCCCT), pp. 223–226. IEEE (2017)