How Cloud Capture Works
The problem with document images
When a document is scanned, faxed or photographed it is stored as an image file such as TIF, JPEG or PDF image. The problem is that as an image file, the text is no longer accessible by most I.T. applications.
This is why business process involving scanned or faxed images frequently take longer to complete and susceptible to errors.
Cloud Capture Basics
Cloud Capture solves image-based document problem by automatically converting those images to text documents. The converted documents can then be delivered in a variety of ways, depending on the application
By automating this process, Cloud Capture enables image-based documents like scanned images and faxes to move through workflows and business processes as efficiently as native text documents.
No human intervention required.
Convert .TIF .PDF .JPEG and .PNG Files
Set up an email inbox and Cloud Capture will scan inbound messages for images to process. You can create inboxes for individual recipients, groups or business processes. Once the images have been converted to searchable PDF, plain text or XML, they are delivered to the destinations you set up in Cloud Capture (keep reading for more information on destinations).
Folders can be easily linked to Cloud Capture. Once linked, any images placed in the folder will be sent to the Cloud Capture service for processing. Once processing is finished, the converted document is delivered to the destination you set up.
Cloud Capture offers developers a flexible set of web services for submitting images to the service. Cloud Capture’s web services are available to anyone seeking to integrate document capture and conversion into other applications or business processes.
Capturing Text and Data
Full Page Capture
Full Page Capture finds, captures and converts all the text on the page. This method is ideal for capturing letters, printed emails, notes and any documents which where the text text is not constrained by a box (such as forms).
Cloud Capture’s barcode recognition system allows you to capture 1D and 2D barcodes.
Index Field Capture
Index Fields is used for capturing specific text on known document types. A template is created for each document type. This template tells Cloud Capture where to find the target values. Should Cloud Capture come across a document it does not possess a template for, it will simply revert to Full Page capture and extract all the text on the document.
Advanced Data Extraction
Advanced Data Extraction greatly expands on Index Fields by using Natural Language Processing (NLP) techniques within the capture process. NLP is capable of identifying the data you want to extract by finding patterns and word relationships in documents. This method enables target data points to be found regardless of where they appear in a document.
Searchable PDF combines the submitted image and the captured text, creating a fully searchable document which looks identical to the original. This format is ideal for preserving the original appearance of documents while making them searchable and more importantly, accessible to full text indexing services. Searchable PDF also enables users select and copy text directly from the document.
Cloud Capture’s Plain Text output provides a small, lightweight text file, free from formatting and layout data. This is often the preferred format for moving unstructured content from one application to another.
Cloud Capture’s XML output simplifies the process of moving highly structured content from system to system. Concord provides XML output with a schema customized for your target application.
Email / SMTP
Deliver converted documents to any email address.
Converted documents can be stored in any folder synchronized with Cloud Capture.
Web Services offer the most flexible approach to integrating converted documents and extracted data into your own applications.
HTTP / HTTPS
Ideal for moving finished documents and data to web destinations.
FTP / FTPS
FTP individual or batches of finished documents.