Preview

Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in Ocr

Powerful Essays
Open Document
Open Document
2858 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in Ocr
Two-Stage Rejection Algorithm to Reduce Search Space for Character Recognition in OCR

Srivardhini Mandipati, Gottumukkala Asisha, Preethi Raj S, and Chitrakala S

Department of Computer Science and Engineering, Easwari Engineering College, Chennai, India

Abstract. Optical Character Recognition converts text in images into a form that the computer can manipulate. The need for faster OCRs stems from the abundance of such text. This paper presents a Two-Stage Rejection Algorithm for reducing the search space of an OCR. It is tacit that the reduction in search space expedites an OCR. Preprocessing operations are applied on the input and features are extracted from them. These feature vectors are clustered and the Two-Stage Rejection Algorithm is applied for character recognition. With about the same character recognition rate as other OCRs, an OCR reinforced with the Two-Stage Rejection Algorithm is considerably faster.

Keywords: Optical Character Recognition, Feature Extraction, K-means.
1 Introduction Optical character recognition has been an active area of research for many decades. The fact that OCRs have the potential to simplify data entry in the future adds value to research in this area. OCRs use various pattern matching techniques for character recognition. Most OCRs typically use classifiers like SVM or neural networks for character recognition. The training process for these classifiers is time consuming. Moreover, with an increase in the number of classes, the comparisons made increases and consequently the time taken for character recognition increases. Hence, they cannot be easily extended to recognize characters from additional languages. The proposed system uses a structural approach as opposed to statistical approach for feature extraction. The strength of the structural method over the statistical one is its representation of a pattern that is similar to the way human perceive it. The structural features help



References: [1]GWeijie Su, Xin Jin, “Hidden Markov Model with Parameter-Optimized K-means Clustering for Handwriting Recognition”, International Conference on Internet Computing and Information Services, pp:435-438, 2011 [2]Karthik Sheshadri, Pavan Kumar T Ambekar, Deeksha Padma Prasad and Dr.Ramakanth P Kumar, “An OCR system for Printed Kannada using K-means clustering”, International Conference on Industrial Technology ,pp:183-187, 2010 [3]Mu-King Tsay, Keh-Hwashyu, Pao-Chung Chang, “Feature Transformation with Generalized Learning Vector Quantization for Hand-Written Chinese Character Recognition”, IEICE Transactions on Information & System, Vol.E82-D, 1992 [4]B. Vijay Kumar, A. G. Ramakrishnan, “Radial Basis Function And Subspace Approach For Printed Kannada Text Recognition”, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp: V-321-4 vol.5, 2004 [5]Premnath Dubey, Wasin Sinthupinyo, “New Approach on Structural Feature Extraction for Character Recognition”, International Symposium on Communications and Information Technologies, pp:946-949, 2010 [6]Igor Kleiner, Daniel Keren, Llan Newman, Oren Ben-Zwi,“Applying property testing to an image partitioning problem”, IEEE Transactions On Pattern Analysis And Machine Intelligence, Vol. 33, No.2, 2011 [7]Sanghamitra Mohanty, Himadri Nandini Dasbebartta, Tarun Kumar Behera, “An Efficient Bilingual Optical Character Recognition(English-Oriya) System for Printed Documents”, Seventh International Conference on Advances in Pattern Recognition, pp: 398 – 401, 2009 [8]Oivind Due Trier, Anil K Jain, and Torfinn Taxt ,“Feature Extraction Methods For Character Recognition–A Survey ”, Pattern Recognition, Vol 29, pp 641-662, 1995 [9]Vuokko Vuori, Jorma Laaksonen , “A Comparison of Techniques for Automatic Clustering of Handwritten Characters”, 16th International Conference on Pattern Recognition, Vol 3, pp:168-171, 2002

You May Also Find These Documents Helpful

  • Powerful Essays

    Why Did You Kill Me?

    • 2033 Words
    • 9 Pages

    Layton, Julia. "How Handwriting Analysis Works." HowStuffWorks. Discovery Communications, Oct. 2008. Web. 07 Aug. 2012.…

    • 2033 Words
    • 9 Pages
    Powerful Essays
  • Good Essays

    Biometrics technology aims at utilizing major and distinctive characteristics such as behavioral or biological, for the sake of positively indentifying people. With the help of a combination of hardware and specific identifying sets of rules, a basic human attribute, automated biometric recognition mimics to distinguish and categorize other people as individual and unique. But the challenges surrounding biometrics are great as well.…

    • 1008 Words
    • 5 Pages
    Good Essays
  • Good Essays

    There is a massive disagreement in the present day concerning the biometric identification technology which is used to boost the security through travel. The research inside these technologies has been used to extend ways in how the individuals identify faces for detection and develop the similar strategy in a replicated mechanical system that will scan faces and conclude their likeness with those in a database.…

    • 990 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Spatial Analysis

    • 453 Words
    • 2 Pages

    Text/graphics separation works very smoothly in this case (Fig. 18), as do vectorization and arcs detection (Fig. 19) and loop extraction.…

    • 453 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    (optical character recognition), and specify the language setting for OCR. Click OK to close the…

    • 305 Words
    • 2 Pages
    Satisfactory Essays
  • Powerful Essays

    How should Red Bull market its brand in the future? I think, although Red Bull has been extremely successful in the past, times have changed and the company and products should change with it, otherwise we probably lose market share to the tremendous increased number of competitors in no time.At the height of early mornings and late nights, Red Bull energy drink became the fuel of choice for people from all walks of life. So how is Red Bull marketing its brand to meet the changing needs and budgets of its customers? How will the privately owned Austrian company expand its product line beyond the silver-bullet beverage that "gives you wings"? My conclusion is that we should focus on direct marketing and use this to bring in a more diverse population of users.…

    • 1379 Words
    • 5 Pages
    Powerful Essays
  • Powerful Essays

    Decoding. Recognizing the pronunciation of printed words by applying the many correspondences between particular letters and phonemes (Neuman & Dickinson, 2003).…

    • 2364 Words
    • 10 Pages
    Powerful Essays
  • Powerful Essays

    Gcvmdbkjvdhf

    • 3380 Words
    • 14 Pages

    OCR Advanced Subsidiary GCE in Applied ICT: H115/H315 Candidate A: Unit G040: Using ICT to communicate…

    • 3380 Words
    • 14 Pages
    Powerful Essays
  • Better Essays

    Extinct Smilodon

    • 1325 Words
    • 6 Pages

    Biederman, Patricia. “Tar-Pit Bones Show Ailments of Extinct Cats” Los Angeles Times 11 June 1989. 20 April 2013. <http://articles.latimes.com/1989-06-11/news/we-2965_1_saber-toothed-cat-tar-pits-george-c-page-museum>>…

    • 1325 Words
    • 6 Pages
    Better Essays
  • Powerful Essays

    An optical scanner is a hardware input device that allows a user to take an image or text and convert it into a digital file, allowing the computer to read or display the scanned object. Features of the optical scanner include-its design, software compatibility, type of feeder, speed, resolution, etc. Some advantages of the optical scanner include :…

    • 2633 Words
    • 11 Pages
    Powerful Essays
  • Good Essays

    Question: Compare and contrast spelling (orthography) and phonetic transcriptions. Your terms should be well defined and your discussion should be well illustrated. Write full sentences in Standard English.…

    • 1393 Words
    • 6 Pages
    Good Essays
  • Powerful Essays

    In this document we analyse the two programs that are presently in operation in the Textprint…

    • 3135 Words
    • 22 Pages
    Powerful Essays
  • Satisfactory Essays

    R.O.G Sdn Bhd aims to become the market leader in hand-held text scanning consumer electronic products. The product and its future derivatives are aimed for rich variety of market niches including students, researchers, professionals, manager etc. providing innovation through product and service enhancements; including wireless capabilities and smartphone applications.…

    • 306 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    cs 322

    • 492 Words
    • 2 Pages

    An algorithm is a large-scale continuous study and research for the most time-convenient and resource-efficient mode of systematically doing things accurately. It predates the existence of computers. As such, algorithms arise more significantly as compared to computing technology.…

    • 492 Words
    • 2 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Table of content Chapter 1 1.1 Introduction ………………………………………… page 5 Chapter 2 2.1.0 Trends of Industry …………… ……………………. 2.1.1 Increasing in the demand for petrochemical product ……page 6 2.1.2 Lesser price elasticity ………………………………………page 6 2.1.3 Oversupply ………………………………………………….page 6, 7 2.1.4 Lean and Mean ……………………………………………..…

    • 9659 Words
    • 39 Pages
    Satisfactory Essays