NetDocuments is your central library for documents, and yet our experience shows that 30% of documents may be invisible to NetDocuments search because they contain images of text, rather than searchable text. Image-based documents in NetDocuments may consist of TIFF and JPEG files, as well as scanned PDFs. They can also be in email messages as attachments.
Optical Character Recognition (OCR) is used to add a text layer to a document to make it text searchable. contentCrawler uses OCR to convert image-based documents to compressed, text-searchable PDFs, ensuring that ALL documents in NetDocuments are 100% searchable and retrievable.
contentCrawler cloud does not charge for use, but does require a subscription license to be purchased from DocsCorp to run in production. An audit process can be run without the license from DocsCorp, but you will be charged by Microsoft Azure for the hours the VM is running. The audit itself will run for 48 hours and there may be some additional hours used for reviewing results.
Keywords: Content Crawler Net Compression