Patent number 8805093 is assigned to
The following quote was obtained by the news editors from the background information supplied by the inventors: "According to widely known methods of text pre-recognition a bit-mapped image is parsed into regions, containing text and/or non-text regions, with the further dividing said text regions into objects, containing strings, words, character groups, characters etc.
"Some known methods uses preliminarily document type identification for narrowing a list of possible documents types, examined in an analysis of the document logical structure
"According to this group of methods the document type identification is an independent step of document analysis, forestalling logical structure identification. At that the document type and its properties list become defined up to the moment of defining the logical structure thereof. Or wise versa, a document structure identification may be an integral part of logical structure identification process. In this case the document type that fits closer the analyzed image is selected.
"A spatial orientation direction verification is present in a number of documents.
"In the U.S. Pat. No. 5,031,225 (
"The most reliably matching model indicates the orientation direction of the image.
"The method causes a mistake in the case of possible different directions of text orientation to be present in the document. It also may cause mistake if the character is not reliably recognized after converting into image state.
"In the U.S. Pat. No. 5,235,651 (
"The method can't work if various orientation directions of text can be present on the form.
"In the U.S. Pat. No. 5,471,549 (
"The method can't work if various orientation directions of text can be present on the form as in the previous example.
"In the U.S. Pat. No. 5,592,572 (
"The main shortcoming of the method lies in that the orientation estimation is performed along with recognition of text portions, thus reducing the method output.
"In the U.S. Pat. No. 6,137,905 (
"The shortcoming of the method is the low method output, depending greatly upon the recognition results.
"In the U.S. Pat. No. 6,169,822 (
"To achieve the reliable result via the said method the large number of text portions are to be recognized. That surely reduces the method output."
In addition to the background information obtained for this patent, VerticalNews journalists also obtained the inventors' summary information for this patent: "One or more objects of the form are assigned thereon, composing graphic image, unambiguously defining its direction of spatial orientation. The said graphic image properties comprise a description of a special model for defining the direction of spatial orientation. Identification of the image with the said model the right direction of image spatial orientation is defined. The said model properties are stored in a special data storage means, one of the embodiment of which is the form image model description.
"In the similar way one or more form objects are assigned thereon, composing graphic image, unambiguously defining its type. Additionally one or more form objects may be assigned, for the case of profound form type analysis, if two or more forms are close in appearance or in properties list. The graphic image properties comprise description of a special model for form type definition. The said model properties are stored in a special data storage means, one of the embodiment of which is a form model description.
"After converting the form image is parsed into regions containing text images, data input fields, special reference points, lines and other objects.
"The possible distortion, caused by the document conversion to electronic state, is eliminated from the image.
"Objects, comprising the graphic image for spatial orientation verification, are identified on the form image. The orientation direction accuracy is verified and corrected if necessary.
"The objects, comprising the graphic image for form type definition, are identified on the form image. The proper model is selected via identification of the said graphic image. In the case of multiple identification result, the profound analysis of the form type is performed. The profound analysis is performed in the similar way adding the supplementary objects to the graphic image and performing new identification.
"The profound analysis is performed automatically or fully or partly manually."
URL and more information on this patent, see: Zuev, Konstantin; Filimonova, Irina; Zlobin, Sergey. Method of Pre-Analysis of a Machine-Readable Form Image. U.S. Patent Number 8805093, filed
Keywords for this news article include:
Our reports deliver fact-based news of research and discoveries from around the world. Copyright 2014, NewsRx LLC
Most Popular Stories
- U.S. Families 'Extraordinarily Vulnerable': Yellen
- Hillary Clinton to Address CHCI Conference
- Larry Ellison Steps Down as Oracle CEO
- Alibaba Prices IPO at $68 a Share
- Apple Locks Itself Out of Devices
- Veterans to Get Training as Solar Panel Installers
- Hispanics Doubt Marco Rubio's Chances
- Wildfires Rage in California
- John Cantlie Delivers ISIS Message to Save Life
- Alibaba: Today China, Tomorrow the World