The patent's inventors are Haiby, Neta (Hertzelia, IL); Ziklik, Elad (Modiin, IL); Hudis, Efim (
This patent was filed on
From the background information supplied by the inventors, news correspondents obtained the following quote: "Computer systems and related technology affect many aspects of society. Indeed, the computer system's ability to process information has transformed the way we live and work. Computer systems now commonly perform a host of tasks (e.g., word processing, scheduling, accounting, etc.) that prior to the advent of the computer system were performed manually. More recently, computer systems have been coupled to one another and to other electronic devices to form both wired and wireless computer networks over which the computer systems and other electronic devices can transfer electronic data. Accordingly, the performance of many computing tasks are distributed across a number of different computer systems and/or a number of different computing environments.
"In some computing environments, various different types of data are stored in and accessed from databases. Some databases store data of specified types, such as, for example, customer addresses, employee information, etc. Often, data is manually entered into databases. Manual data entry is prone to human error. Further, many errors once entered become virtually undetectable, because once the data is entered there is little if any further human interaction with the data. Additionally, some errors, such as, for an extra digit in a telephone number, may be difficult to identify. Other data inconsistencies can be caused by different data dictionary definitions of similar entities in different stores or by transmission and/or storage errors. Thus over time, a database can accumulate various data errors and inconsistencies
"Administratively, incorrect or inconsistent data can lead to false conclusions and misdirected investments on both public and private scales. For example, the government may want to analyze population census figures to decide which regions require further spending and investment on infrastructure and services. In this case, it will be important to have access to reliable data to avoid erroneous fiscal decisions.
"In the business world, incorrect data can be costly. Many companies use customer information databases that record data like contact information, addresses, and preferences. If for instance the addresses are inconsistent, the company will suffer the cost of resending mail or even losing customers.
"Accordingly, from time to time, database administrators can 'cleanse' or 'scrub' data. Data cleansing or data scrubbing includes detecting and correcting (or removing) corrupt or inaccurate records from a record set, table, or database. Often used in databases, data cleansing or scrubbing can identify incomplete, incorrect, inaccurate, irrelevant etc. parts of data and then replace, modify or delete this 'dirty' data. After cleansing, a data set is (at least more) consistent with other similar data sets.
Most Popular Stories
- Saab Gets Back into the Game; U.S. Auto Sales Soar
- Dell Offers Undisclosed Number of Employee Buyouts
- Authorities Close to Deal with JPMorgan Chase over Madoff Response
- Apple Activates Customer-Tracking iBeacon
- 2013 Tech Gift Guide: iPad Mini Still Hot; Chromecast a Great Low-Cost Option
- A Biography of Jonathan Ive, Apple's Creative Chief
- It's No Yolk: Food-tech Startups Take Aim at Replacing Eggs