Used by a wide variety of companies such as Dow Chemical, Fairpoint Communications,
Multi-OCR processing - contentCrawler takes advantage of faster processing using multi-threading to optimize support for 4, 8, 16 and 32 CPU cores. For example, with 4 CPU core processing, contentCrawler will be able to OCR 1 page per second, or 85,000 pages per day. This represents a significant improvement over other OCR solutions and remains unique in its ability to OCR documents already stored in a DMS. 16 CPU core processing will be capable of OCR'ing 4 pages per second, or up to 350,000 pages per day!
Apply Advanced Search filters - New Advanced Search filters provide users with greater control over document types to be processed. Users can exclude certain document types from the search to decrease processing time, including those saved as email message attachments.
Easy administration and reporting
Set up Service email notifications - Users can establish various email notifications to report on the progress of the crawl and request that the Service Statistics and Error reporting be emailed to them.
Monitor progress status - Users can instantly see the progress status of individual documents being processed at the OCR stage. This information is displayed to the user as a percentage.
Document information display - Provides document information such as total page number and size of documents being processed, including an overall total size of documents requiring OCR.
Configurable Multilingual OCR - Users can easily configure multilingual OCR’ing across all services. contentCrawler supports over 180 languages.
Export Report - Users can export processing reports as CSV files for analysis and review.
Configurable minimum disk space limit - Users can specify minimum free space threshold for document cache directory.
20% of documents in content repositories are invisible to search
contentCrawler was developed to address the very real and serious issue of non-searchable content in enterprise content management systems. More than 20% of documents in a content repository are "invisible" to search technology. These documents are often profiled as a result of ingestion of legacy or litigation documents, saving emails with attachments, mobile technology and employee workarounds that bypass the OCR'ing process. Failure to produce documents on demand impacts the bottom line, workplace efficiency, regulatory compliance, and productivity, and exposes an organization to unnecessary risks.
Download the contentCrawler 2.1 trial to see how much non-searchable content is in your content repositories. Email firstname.lastname@example.org for more information.
contentCrawler integrates with Autonomy iManage, Autonomy TRIM, OpenText eDOCS DM, ProLaw, MS SharePoint as well as MS Windows file systems. Integration with OpenText Content Server and Worldox will be available soon.
DocsCorp provides document professionals who use enterprise content management systems with integrated, easy-to-use software and services that extend document processing, review, manipulation and publishing workflows inside and outside their environment to drive business efficiency and to increase the value of their existing technology investment. DocsCorp operates in all countries around the world with customers located throughout the
Most Popular Stories
- Slow Week Ahead of December FOMC Meeting
- Hispanics Seek to Grow School Board Members
- GM Bailout Saved 1.2 Million U.S. Jobs, Report Says
- Questions Remain in Jenni Rivera's Death
- 'Knockout Game': Myth or Menace?
- Bitcoin Used to Buy Tesla Car
- Banks Fret as Volcker Vote Approaches
- Paul Walker Fans Pay Respects
- U.S. Companies Eager for Iranian Business
- Yellen Set to Become One of World's Most Powerful Women