"The actual process of data cleansing may involve removing typographical errors or validating and correcting values against a known list of entities. The validation may be strict (such as rejecting any address that does not have a valid postal code) or fuzzy (such as correcting records that partially match existing, known records).
"A list of know entities can be provided by a data cleansing service provider configured for cleansing a specified type of data. For example, a data cleansing service can be configured to cleanse postal address or telephone numbers in
Supplementing the background information on this patent, VerticalNews reporters also obtained the inventors' summary information for this patent: "The present invention extends to methods, systems, and computer program products for comparing and selecting data cleansing service providers. In some embodiments, a reference data service provider is identified for cleansing. A sample source of data is mapped to a selected data domain. The data domain is associated with data elements having specified arrangement of data. The sample source of data has known data inconsistencies.
"A list of a plurality of reference data service providers configured to cleanse data elements for data in the selected data domain. A selection of a subset of plurality of reference data service providers that are to be explored is received. The sample source of data is submitted to each reference data service provider in the subset of reference data service providers. Results of cleansing the sample source of data received back from each reference data service provider in the subset of reference data service providers. For each reference data service provider, the results include an allegedly cleansed sample source of data derived from the sample source of data.
"The results from each of the reference data service providers in the subset of the plurality of reference data service providers are profiled. Profiling includes determining how each reference data service provider dealt with the known data inconsistencies in the sample source of data. A comparison between the subset of the plurality of reference data service providers is displayed on a display device. The displayed comparison is based on the profiled results. A user selection of a reference data service provider is received from the displayed comparison. The selected reference data service provider is indicated as appropriate for cleansing further data in the data domain.
"This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter.
"Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the invention. The features and advantages of the invention may be realized and obtained by means of the instruments and combinations particularly pointed out in the appended claims. These and other features of the present invention will become more fully apparent from the following description and appended claims, or may be learned by the practice of the invention as set forth hereinafter."
For the URL and additional information on this patent, see: Haiby, Neta; Ziklik, Elad; Hudis, Efim; Peleg, Gad. Comparing and Selecting Data Cleansing Service Providers. U.S. Patent Number 8510276, filed
Keywords for this news article include:
Our reports deliver fact-based news of research and discoveries from around the world. Copyright 2013, NewsRx LLC
Most Popular Stories
- Twitter Names Woman to Board
- Obamacare Doing Just Fine, Ky. Governor Says
- Aspen Contracting Adding 300 Jobs
- Rand Paul Signs up for Obamacare
- Hispanic Employment Improves in November
- U.S. Chamber to Run Ads in Idaho, W.Va.
- U.S. Unemployment Rate Dips to 7 Percent
- Consumer Spending Rises, Incomes Fall
- Trapped Florida Whales Head for Deeper Waters
- American Eagle Issues Weak Q4 Outlook