0% found this document useful (0 votes)

42 views6 pages

IJERT Segmentation and Detection of Text

Uploaded by

Bertrand Tahte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views6 pages

IJERT Segmentation and Detection of Text

Uploaded by

Bertrand Tahte

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

International Journal of Engineering Research & Technology (IJERT)

ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

Segmentation and Detection of Text in Natural

Scene Images

Meghana Thodaskar N R Rama Devi P

M.Tech, ISE Assistant Professor
PESIT ISE PESIT
Bangalore VTU INDIA Bangalore, VTU INDIA

Abstract— Any text data present in natural scene images Along with so many applications the scanner has great use. It
contains useful information like text based landmark etc. was invented to convert printed, handwritten and historical
Extraction of text from scene images involves many stages. Each documents into digital format. This helps the conversion
every stage is equally important to get efficient results. Detecting process in archives. Depending on the required scanning
the text, localising the text, and segmentation, recognition are
resolution a single scanned image can consumes a large
important steps. Extract.ion from scene text images is very
difficult due to variations in size, orientation, alignment from storage space. There is a huge difference in ratio between
one image to another. From all these difficulties extract.ion of storage space consumed by a scanned image and information
text from scene images is a challenging task. However, text in stored as coded text. This has motivated many researchers to
such images is not confined to any page layout, and its location take up this topic for their research work. The main goal is to
within in the image is random in nature. In addition, motion work on methods to recognize the textual content of a
blur, non-uniform illumination, skew, occlusion and scale-based scanned image. By this pattern recognition taught the
degradations increase the complexity in locating and recognizing complexities to recognize Roman characters. Thus, digital
the text in a scene/born-digital image. In our proposed method documents gave birth to the analysis of actual content in the
we have used The otsu binarization TECHNIQUE. It is applied
digital documents such as text or data mining, information
on separated R, G, B channels. Based on connected component
information the text is extracted. This method is implemented to retrieval, and relation between documents.
handle light and night scene images.
In today‟s world each and everybody has mobile phones. Due
Keywords— Connected-Components, Dilation, Otsu-Method, to the availability of low-cost cameras and mobiles with a
Logical “And” Operation, Morphological, Rgb Channel camera, one can able to create a camera captured documents.
The portable handheld devices overcome the limitations of a
I. INTRODUCTION scanner and also increased the range of documents. Imaging
In computer_vision field document analysis and recognit ion the text on a notice board with a glass frame is an example
(DAR) has a great history for more than four decades. This for the limitation of a scanner. This limitation can overcome
duration it self indicates the complexity involved in the field. by using mobile device. The mobility of cameras makes it
DAR has expanded rapidly by frequently adding new sub- possible to capture the contents of a notice board from any
fields for innovative research. This makes more interest in the reasonable distance. Billboards and signboards are the
process of evolution. The major difficulties found in the field examples of camera captured documents. Usually a captured
are photocopying, printing, scanning and image capturing image has less text than scene. Sometimes text may or may
technologies. Hence the idea of completely recognizing the not be present in many scenes. Therefore detecting the
content of a document is amazing and is a task still difficult presence or absence of text in a scene image is a major
to achieve by a machine, unlike the human brain. There are a problem. This process is called text detection problem. It is
large number of other applications that have been developed almost like asking a blind person to analyze the scene. To
by researchers, with various advancements in technology reduce this complexity the problem itself broken down into
such as photocopying , Xerox, revolutionized the field of pats such as text localization and recognition
documents. The photocopying machine does not possess this
capability. Today optical character recognition (OCR)
engines reasonably perform this task by creating a soft copy
of the document.

IJERTV4IS061118 www.ijert.org 1272

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

elements and used for text regions. To identify the boundary

of the text regions projection profile analysis is performed.
Sivasankaran et al [5] the authors used grayscale
transformation and smoothing using median filter as pre
processed steps. Canny edge detection and Gaussian filter
method is used to remove weak edges. Further with dilation
and connected component labelling techniques text part is
extracted.
Figure 1: Sample born-digital/scene images from KAIST dataset
Angadi et al [6] proposed work is based on
tex.ture_analysis and uses discre.te co.sine trans.form
II. RELATED WORK
(D.C.T). This uses high pass filter to remove similar
Hongliang et al [1] in this paper author describes an background. The resultant texture_features are then applied
efficient technique of locating and extracting license plate on each 50*50 block of the input and strong text blocks are
and recognizing each segmented character [4]. The proposed identify.ied using discrim.in-ant fu.nctions. At last, the
model can be subdivided into four parts- Digitization of detected text blocks are added and get the extracted text
image, Edge Detection, Separation of characters and regions.
Template Matching. Morphological– operations with Wei et al [7] the authors used a pyramidal concept to
structuring element (SE) used to eliminate non-license plate detect_text in video im.ages with variations of background,
region and enhance only the plate region. Character size_of text font and colour. In the first step, t wo down_sized
segmentation is done using Connected Component Analysis. images are obtained from the origin_al image. Then, the gradi
Correlation based template matching technique is used for ent difference is calculated for three differentl y sized images.
recognition of characters. k-m.eans clustering procedu.res are applied to separate the
Bai et al [2] presents method for Chinese tex.t pixels. Next, determine the boundaries of candidate text
recognition in images/videos. The method is different from regions using projection _profile analysis. Finally, text
existing one which binarized text images, fed binarized image candidates are identified using two verification phases. One is
to an OCR and gets the recognized results. The proposed geometric_properties. Another is text candidate using D.WT.
scheme implements the recognition directly on gray pixels To reduce the number of dimensions of these features
followed by segmentation, building recognition graph, principal component analysis is used. SVM is used to classify
Chinese character recognition and beam search the text and non-text.
determination. The advantages lie in, it does not depend on III. PROPOSED METHOD
the performance of binarization, which is not perfect in
The Algorithm and flowchart of the proposed
practical and thus decrease the performance of OCR and method using Otsu binarization is as shown below:
grayscale image gives more information of the text which in
turn helps in improving recognition rates. Algorithm 1
Neha et al [3] employ a dis.crete wave_let
trans.form (D.W T) method to extr.act text infor.mation from Step1: Read input image.
complex-background. The input coul.d be a color image or a Step2: Resize the input image to 530*600.
Step3: Separate the R, G, B channel.
grayscale. Sobel edge detection method is used to find out
Step4: Apply Otsu binarization on each individual plane
edges on each sub-image. The obtained result is considered to Step5: Complement form is used to identify the text of inverse
form an edge map. In the next step morphological operations polarity on each binarized plane.
are applied on edge map and further thresholding is applied to Step6: Perform logical “and” operation to combine step 4
improve performance. and step 5.
Dutta et al [4] the method is based on the gradient Step7: Use morphological operations for segmentation.
information and edge map selection. As an initial step the Step8: Binarized the edge image enhancing only the text
regions against a plain black background.
algorithm first find the gradient of the image and then
Step9: Apply bounding box to localize the text region using
enhance the gradient information. In the next step binarized connected components.
the enhanced gradient image and select the edges by taking Step10: Final result.
the intersection of the edge map with the binary information
of the enhanced gradient image. To generate the edge map
canny edge detector is used. The selected edges are then
morphologically dilated and opened using suitable structuring

IJERTV4IS061118 www.ijert.org 1273

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

Pre Processing: Extract RGB channel

Otsu Thresholding

Complement form to identify the text

of inverse polarity 2(b)

Logical „and‟ operation

Morphological operations

Apply bounding box using CC to locate 2(c)

text
Figure 3: (a)(b)(c) shows extracted R, G, B, channels

B. Otsu Thresholding
Figure 2: Flow chart of proposed method.
Each plane is separately segmented using Otsu
Global Thresholding (ogt). Otsu binarization is an
A. Preprocessing effective method for segmentation when the variations in
The proposed method is based on color based character lighting and colour are minimal. The histogram of an
extraction. However, color information is also important image is used to arrive at the threshold that maximizes the
because, usually, related characters in a text have almost the discrimination value. The values in the histogram are
same color for a given instance encountered in the scene. The normalized before calculating the discrimination value. At
color of each pixel is determined by the combination of the each gray value, the histogram is split into two parts. The
red, green, and blue intensities stored in each color plane at mean and weight of each histogram part are calculated,
the pixel's location. In the RGB model, an image consists of and also the discrimination value. The gray value at which
three independent image planes, one in each of the primary a peak is found for the discrimination value is used as the
colors: red, green and blue. The three (Red, Green and Blue) global threshold. The sum of vari-ance is calculated using
plane pixel values were used as feature vectors. the below formula:
Algorithm 2
Step1: Read input image.
Step2: Resize input size to 550*600.
Step3: separate R,G,B channels.
The extracted R,G,B channel is as shown in the figure 2. The Results of Otsu‟s binarization of colour channels is as
We apply otsu binarization technique on each extracted shown below:
R,G,B plane.We will explain this step in detail in section B.

Redmi 4(b) 4(c)

Figure 4: Results of Otsu‟s binarization of colour channels for

one of the sample image in Figure1 (a) Binarized red plane. (b)
2(a) Binarized green plane (c) Binarized blue plane.

IJERTV4IS061118 www.ijert.org 1274

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

We observed that applying Otsu separately on RGB value 1 represents text and pixels having value 0 represents
components often recover lost texts efficiently. Otsu‟s background. However, the final image may contain some
method as described above converts several pixels from non-text part. Final result is the white text in black
foreground to background and also vice versa background or vice versa, dependent on the original image.
Hence, text present is an image is well segmented from the
background. Segmentation of the scene image into text and
C. Complement form to identify the text of inverse
foreground is usually referred as binarization where grayscale
Apply morphological open operation on each intensities are classified into two groups, one is foreground
binarized plane and take complement of these images to white pixels and background black pixels (text) [13] [14].
eliminate the noises of an image. Complement version of the The result is as shown in the figure: 7
results of the figure 5 is shown in figure 6. It is observed that
part of the text that was merged with the background
becomes foreground in the complement image and can be
successfully captured. In the complement of an RGB image,
each pixel value is sub-tracted from the maxi-mum pixel
value and the difference is used as the pixel value in the
out_put im.age. In the output image, dark areas become
lighter and vice-versa.

Figure 7: Combination of each binarized plane and its complement

version after connected component analysis.

E. Morphological operations and detection of text in image

Using morphological dilation operation we can
localize text in the scene images. The dilat-ion of an image a
by a SE b produce a new binary image k =a⊕b with ones in
all locat-ions (x, y) of a SE origin at which that SE s hits the
input image a , i.e. k(x, y) = 1 if b hits a .and 0 otherwise,
repeating for all coordinates (x, y) [12]. Let a c denote the
5(a) 5(b) 5(c) complement of an image a, i.e., the image produced .by
Figure 5: (a) (b) (c) Dilation on binarized red, green, blue channel. It can
replacing 1 with 0 and vice versa. Formally, the duality is
be seen that the text that is lost in binarization in one color plane is
captured in some other color plane written as:

a⨁b=a^c⨁b_rot (2)

where brot is the SE b rotated by〖 180〗^∘. If a SE is

symmetrical with respect to rotation, then brot does not differ
from b [8]. Morphological dilation is used for this purpose as
dilation adds pixels to the boundaries. of objects in an image
there by thickening that object. Measure thickness is defined
by the type and size SE. Proper sized SE should be chosen
such that least non-text area should be clustered within. Here,
structuring element “line” with size (a line of degree 150) is
used. The localized text region obtained is as shown below

6(a) 6(b) 6(c)

Figure 6: (a)(b)(c) Complement version of the results of the Figure
5(a)(b)(c). It is observed that part of the text that was merged with the
background becomes foreground in the complement image and can be
successfully captured

D. Logical operations
As a next step we apply and operation. In our
proposed method we take combination of each binarized
plane and its complement version. This step refers to identify
the characters as they are in original image. This is done by
multiplying resultant image (figure: 5) with binary converted
complement image (figure: 6). In this method, pixels having Figure 8: localization of text

IJERTV4IS061118 www.ijert.org 1275

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

IV. RESULTS AND DISCUSSIONS

After localizing text a bounding box is used to detect the For evaluating the performance of the proposed text detection
text regions. Once we put bounding box we can easily extract method, we used the dataset made available with the occasion
text regions. As expected, many of the extracted connected of KAIST [10] and SVT [11].As each image contains
components do not actually contain text characters. At this approximately four characters of different font styles of
different font size. The result of the system for 500 images is
point simple rules are used to filter out the false detections.
given in the Table (1).
We use the aspect ratio and area size to decrease the number
of non-character candidates [9]. Wi and Hi are the width and TABLE1: Summary of tested images.
height of an extracted area; Δx and Δy are the distances SEGMENTATION AND DETECTION
DATASET Number of Images Segmented Detected
between the centers of gravity of each area. Aspect ratio is KAIST 200 155 105
computed as width / height. We use the following rules to SVT 300 245 135
further eliminate from all the detected connected components
those that do not actually correspond to text characters. The experimental results is shown in Table 2. In the
second row (second image) even though the text is well
segmented from an image, the method is failed to detect all
text regions due to reflection of light in the image. Hence, the
method is restricted to extract text on glass surface. On rest of
the lost images the algorithm either partially extracted
relevant text components or extracted text along with a few
non-text components. High accuracy result obtain if an image
has clear background and normal font styles.

In summary, the precision and recall values of our

algorithm obtained on the basis of the present set 500 images
are respectively 69.8% and 71.2%. The proposed algorithm
Selected C.Cs form the segmented text at the pixel level.
The bounding box list provides the localisation of text in the works well even on slanted or curved text components of
given image. The detected text is placed in a green boundary. English.
The result is shown in the Figure 8. Hence, opening of an
object A with a linear structuring element B can effectively
identify the horizontal line segments present in a connected
component

Figure 9: Detected Text is placed in green boundary

IJERTV4IS061118 www.ijert.org 1276

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
International Journal of Engineering Research & Technology (IJERT)
ISSN: 2278-0181
Vol. 4 Issue 06, June-2015

TABLE2: Result Images of Binarized Text and Detected text. CONCLUSION

The proposed method for text localization and
Original Image Binarized Text-region segmentation the image, different from the known algorithms
.Text localisation is required to segment and detect correctly.
Image detection We have used SVT and KAIST dataset. Our method is tested
for more than 400 scene images and detection rate 83.75% is
achieved. Implemented algorithm has used morphological
operations and connected component analysis using special
domain features for extraction of text and detection of text in
natural scene images. But due to diversity in scene images
detection and segmentation performs well for the images with
simple font style medium intensity variance, and simple
background. Achieving higher accuracy, addressing complex
background and light intensity variance will be the scope of
the further research work.
REFERENCES
[1] Hongliang, Bai, and Liu Changping. "A hybrid license plate extraction
method based on edge statistics and morphology." Pattern
Recognition, ICPR. Proceedings of the 17th International Conference
on. Vol. 2. IEEE, 2004
[2] Bai, Jinfeng, et al. "Chinese Image Character Recognition Using DNN
and Machine Simulated Training Samples." Artificial Neural Networks
and Machine Learning–ICANN Springer International Publishing,
2014. 209-216.
[3] Gupta, Neha, and V. K. Banga. "Image Segmentation for Text
Extraction." Proceedings of the 2nd International Conference on
Electrical, Electronics and Civil Engineering (ICEECE'2012),
Singapore, April 28-29. 2012.
[4] Dutta, A., Pal, U., Bandyopadhya, A., & Tan, C. L. (2009). Gradient
based Approach for Text Detection in Video Frames 1
[5] Sivasankaran, V., P. Chitra, and L. Roja. "Recognition of Text in
Mobile Captured Images Based on Edge and Connected Component
Hybrid Algorithm." International Journal of Advanced Research in
Computer Science and Electronics Engineering (IJARCSEE) 3.6
(2014): pp-358.
[6] Angadi, S. A., and M. M. Kodabagi. "Text region extraction from low
resolution natural scene images using texture features." Advance
Computing Conference (IACC), IEEE 2nd International. IEEE, 2010
[7] Wei, Yi Cheng, and Chang Hong Lin. "A robust video text detection
approach using SVM." Expert Systems with Applications 39.12 (2012):
10832-10840.
[8] [8]. Xiaoqing Liu and Jagath Samarabandu, An Edge-based text region
extraction algorithm for Indoor mobile robot navigation, Proceedings of
the IEEE, July 2005.
[9] [9]. Xiaoqing Liu and Jagath Samarabandu, Multiscale edge-based Text
extraction from Complex images, IEEE, 2006
[10] KAIST:http://www.iaprtc11.org/mediawiki/index.php/KAIST_Scene_
Text_Database.
[11] SVT http://tc11.cvc.uab.es/datasets/SVT_1.

IJERTV4IS061118 www.ijert.org 1277

(This work is licensed under a Creative Commons Attribution 4.0 International License.)

Methodology For Eliminating Plain Regions From Captured Images
No ratings yet
Methodology For Eliminating Plain Regions From Captured Images
13 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
A Robust and Fast Text Extraction in Images and Video Frames
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
7 pages
Scene Text Detection Using Machine Learning Classifiers
No ratings yet
Scene Text Detection Using Machine Learning Classifiers
5 pages
Efficient Text Extraction Algorithm
No ratings yet
Efficient Text Extraction Algorithm
6 pages
Text Detection
No ratings yet
Text Detection
17 pages
OCR for Complex Color Image Text Extraction
No ratings yet
OCR for Complex Color Image Text Extraction
6 pages
OCR Using Image Processing
No ratings yet
OCR Using Image Processing
8 pages
Text Extraction in Complex Images
No ratings yet
Text Extraction in Complex Images
8 pages
Comarison PDF
No ratings yet
Comarison PDF
16 pages
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
No ratings yet
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
18 pages
IJCRT2108410
No ratings yet
IJCRT2108410
5 pages
2005 6606 1 PB
No ratings yet
2005 6606 1 PB
21 pages
Text Detection with Stroke Width
No ratings yet
Text Detection with Stroke Width
8 pages
Character Recoganization
No ratings yet
Character Recoganization
6 pages
Text Detection in Images Algorithm
No ratings yet
Text Detection in Images Algorithm
14 pages
Department of Electronics and Communication Engineering
No ratings yet
Department of Electronics and Communication Engineering
25 pages
Text Detection Techniques Overview
No ratings yet
Text Detection Techniques Overview
16 pages
7sem Project Report
No ratings yet
7sem Project Report
27 pages
Multi-Resolution Text Detection in Video
No ratings yet
Multi-Resolution Text Detection in Video
6 pages
Text Extraction and Localization From Captured Images: Taufin M Jeeralbhavi Dr. Jagadeesh D. Pujari Shivananda V. Seeri
No ratings yet
Text Extraction and Localization From Captured Images: Taufin M Jeeralbhavi Dr. Jagadeesh D. Pujari Shivananda V. Seeri
3 pages
Cohesive Multi-Oriented Text Detection and Recognition Structure in Natural Scene Images Regions Has Exposed
No ratings yet
Cohesive Multi-Oriented Text Detection and Recognition Structure in Natural Scene Images Regions Has Exposed
15 pages
MATLAB-Based Text Extraction Method
No ratings yet
MATLAB-Based Text Extraction Method
3 pages
Detection of Text From Lecture Video Images
No ratings yet
Detection of Text From Lecture Video Images
5 pages
Yerrijdnewpaper
No ratings yet
Yerrijdnewpaper
5 pages
Automatically Detect and Recognize Text in Natural Images
No ratings yet
Automatically Detect and Recognize Text in Natural Images
19 pages
Ote-Ocr Based Text Recognition and Extraction From Video Frames
No ratings yet
Ote-Ocr Based Text Recognition and Extraction From Video Frames
4 pages
System For Identifying Texts Written in Kazakh Language
No ratings yet
System For Identifying Texts Written in Kazakh Language
5 pages
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
No ratings yet
Event Info Extraction From Flyers: Yang Zhang Hao Zhang, Haoranli
7 pages
Text Detection in Natural Scenes
No ratings yet
Text Detection in Natural Scenes
6 pages
Investigating The Effect of Bd-Craft To Text Detection Algorithms
No ratings yet
Investigating The Effect of Bd-Craft To Text Detection Algorithms
16 pages
Text Detection in Document Images: Highlight On Using FAST Algorithm
No ratings yet
Text Detection in Document Images: Highlight On Using FAST Algorithm
11 pages
Automatic Text Detection Using Morphological Operations and Inpainting
No ratings yet
Automatic Text Detection Using Morphological Operations and Inpainting
5 pages
Scene Text Recognition Using Co-Occurrence of Histogram of Oriented Gradients
No ratings yet
Scene Text Recognition Using Co-Occurrence of Histogram of Oriented Gradients
5 pages
Text Detection Based On Morphological Operations and Inpainting
No ratings yet
Text Detection Based On Morphological Operations and Inpainting
5 pages
Text Detection and Recognition Using Enhanced MSER Detection and A Novel OCR Technique
No ratings yet
Text Detection and Recognition Using Enhanced MSER Detection and A Novel OCR Technique
7 pages
Paper 10793
No ratings yet
Paper 10793
5 pages
Text Detection in Scene Images
No ratings yet
Text Detection in Scene Images
4 pages
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
No ratings yet
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
8 pages
Implementation of A Video Text Detection System
No ratings yet
Implementation of A Video Text Detection System
5 pages
Char RCG TH
No ratings yet
Char RCG TH
11 pages
المشروع
No ratings yet
المشروع
17 pages
Tesseract OCR: A Comprehensive Study
No ratings yet
Tesseract OCR: A Comprehensive Study
12 pages
Text Extraction Using DWT Techniques
No ratings yet
Text Extraction Using DWT Techniques
4 pages
Signboard Detection and Text Recognition Using Artificial Neural Networks
No ratings yet
Signboard Detection and Text Recognition Using Artificial Neural Networks
4 pages
Mca1414garbybaby 170131175855
No ratings yet
Mca1414garbybaby 170131175855
44 pages
Text Region Segmentation in Images
No ratings yet
Text Region Segmentation in Images
6 pages
Confluence 2018 8442875
No ratings yet
Confluence 2018 8442875
4 pages
Recognition and Detection of Language On Inscriptions: Dr. C Parthasarathy, R.Sarvanan, M Sathish, U.Sai Sri Teja
No ratings yet
Recognition and Detection of Language On Inscriptions: Dr. C Parthasarathy, R.Sarvanan, M Sathish, U.Sai Sri Teja
3 pages
Dip Project
No ratings yet
Dip Project
36 pages
Enhancement and Segmentation of Historical Records
No ratings yet
Enhancement and Segmentation of Historical Records
19 pages
Smart Glasses For Blind People: Abstract
No ratings yet
Smart Glasses For Blind People: Abstract
7 pages
Ijarcce 208
No ratings yet
Ijarcce 208
3 pages
Text Extraction from Camera Images
No ratings yet
Text Extraction from Camera Images
17 pages
Reference
No ratings yet
Reference
4 pages
Text Detection Based On MSER and CNN Features: Houssem Turki, Mohamed Ben Halima, Adel M. Alimi
No ratings yet
Text Detection Based On MSER and CNN Features: Houssem Turki, Mohamed Ben Halima, Adel M. Alimi
6 pages
Few-shot 3D Segmentation for AI Experts
No ratings yet
Few-shot 3D Segmentation for AI Experts
15 pages
Dinesh Masters IISc Thesis-Compressed
No ratings yet
Dinesh Masters IISc Thesis-Compressed
75 pages
V3I5201499a84 PDF
No ratings yet
V3I5201499a84 PDF
6 pages
Nighttime Vehicle Detection Optimization
No ratings yet
Nighttime Vehicle Detection Optimization
11 pages
B.Tech EIE IV Year Syllabus R16
No ratings yet
B.Tech EIE IV Year Syllabus R16
46 pages
User Clustering in Online Advertising Via Topic Mo
No ratings yet
User Clustering in Online Advertising Via Topic Mo
10 pages
Dynamic Sketching:: Simulating The Process of Observational Drawing
No ratings yet
Dynamic Sketching:: Simulating The Process of Observational Drawing
31 pages
Intelligent Systems With Applications: Nur A-Alam, Md. Saikat Islam Khan, Mostofa Kamal Nasir
No ratings yet
Intelligent Systems With Applications: Nur A-Alam, Md. Saikat Islam Khan, Mostofa Kamal Nasir
14 pages
Vision for Bio-Inspired Robots
No ratings yet
Vision for Bio-Inspired Robots
47 pages
CS231A Project Proposal Guidelines
No ratings yet
CS231A Project Proposal Guidelines
46 pages
AP4011 Lab Manual
No ratings yet
AP4011 Lab Manual
42 pages
SQL Injection Attack Detection Framework Based On HTTP Traffic
No ratings yet
SQL Injection Attack Detection Framework Based On HTTP Traffic
7 pages
Da Unit IV Notes
No ratings yet
Da Unit IV Notes
23 pages
Artint S 25 00204
No ratings yet
Artint S 25 00204
12 pages
Homgeneious Section FWD
No ratings yet
Homgeneious Section FWD
16 pages
ISRO Submission Team Optimizers
No ratings yet
ISRO Submission Team Optimizers
13 pages
Handwritten Digit Recognition with CNN
100% (1)
Handwritten Digit Recognition with CNN
6 pages
Microwave Medical Image Segmentation For Brain Stroke Diagnosis Imaging-Process-Informed Image Processing
No ratings yet
Microwave Medical Image Segmentation For Brain Stroke Diagnosis Imaging-Process-Informed Image Processing
5 pages
Remote Sensing: Jointnet: A Common Neural Network For Road and Building Extraction
No ratings yet
Remote Sensing: Jointnet: A Common Neural Network For Road and Building Extraction
22 pages
Technical - Report (1) Anil123
No ratings yet
Technical - Report (1) Anil123
26 pages
1266 Technseminar
No ratings yet
1266 Technseminar
23 pages
SIRE: Scale-Invariant, Rotation-Equivariant Estimation of Artery Orientations Using Graph Neural Networks
No ratings yet
SIRE: Scale-Invariant, Rotation-Equivariant Estimation of Artery Orientations Using Graph Neural Networks
16 pages
Ieee Python 2024
No ratings yet
Ieee Python 2024
2 pages
Image Biomarker Standardisation Initiative: Reference Manual
No ratings yet
Image Biomarker Standardisation Initiative: Reference Manual
169 pages
AIfor Civil Engineers Module 1
No ratings yet
AIfor Civil Engineers Module 1
21 pages
Improved U-Net for Retinal Vessel Segmentation
No ratings yet
Improved U-Net for Retinal Vessel Segmentation
11 pages
D2L2 Caetano Classification Techniques PDF
No ratings yet
D2L2 Caetano Classification Techniques PDF
73 pages
A Novel Image Segmentation Algorithm Based On Neutrosophicsimilarity Clustering
No ratings yet
A Novel Image Segmentation Algorithm Based On Neutrosophicsimilarity Clustering
8 pages
ACS Recognition of Prior Learning (RPL) Form 2024 v2
No ratings yet
ACS Recognition of Prior Learning (RPL) Form 2024 v2
17 pages
Key Technologies and Future Development Trends of Intelligent Earth-Rock Dam Construction
No ratings yet
Key Technologies and Future Development Trends of Intelligent Earth-Rock Dam Construction
19 pages

IJERT Segmentation and Detection of Text

Uploaded by

IJERT Segmentation and Detection of Text

Uploaded by

International Journal of Engineering Research & Technology (IJERT)

Segmentation and Detection of Text in Natural

Meghana Thodaskar N R Rama Devi P

IJERTV4IS061118 www.ijert.org 1272

elements and used for text regions. To identify the boundary

IJERTV4IS061118 www.ijert.org 1273

Pre Processing: Extract RGB channel

Complement form to identify the text

Logical „and‟ operation

Apply bounding box using CC to locate 2(c)

Redmi 4(b) 4(c)

Figure 4: Results of Otsu‟s binarization of colour channels for

IJERTV4IS061118 www.ijert.org 1274

Figure 7: Combination of each binarized plane and its complement

E. Morphological operations and detection of text in image

where brot is the SE b rotated by〖 180〗^∘. If a SE is

6(a) 6(b) 6(c)

IJERTV4IS061118 www.ijert.org 1275

IV. RESULTS AND DISCUSSIONS

In summary, the precision and recall values of our

Figure 9: Detected Text is placed in green boundary

IJERTV4IS061118 www.ijert.org 1276

TABLE2: Result Images of Binarized Text and Detected text. CONCLUSION

IJERTV4IS061118 www.ijert.org 1277

You might also like