Cloud Capture Advanced
Cloud Capture Advanced simplifies the process of extracting targeted data from images.
Unlike Full Page Capture which extracts all the text on the page, Advanced Capture targets only the values you need.
Cloud Capture Advanced is ideal for extracting document data for integration into your applications and business processes.
Extract data from almost any document from free-flowing correspondence to highly structured forms.
Eliminate the need to configure batches or zone templates.
Attain exceptional accuracy with Cloud Capture’s industry, process and application-specific Advanced Capture Dictionaries.
Avoid costly implementations, set-up costs and long term licenses – pay only for what you capture.
How it works
This is your document before it has been “imaged”. In it’s native format, the document’s text is completely accessible.
Once a document has been imaged, the text is no longer directly accessible by computer applications. Imaged documents cannot be searched for, indexed or managed based on their content.
The captured data is packaged into an XML file, ready for integrating into your target application.
This is where your document is imaged. Imaging is the process of creating an electronic image from a paper document (by scanning or photographing it) or converting a text document to an image.
Concord Cloud Capture analyzes the imaged document and intelligently extracts the information you need. Unlike zone-based capture and OCR products, Cloud Capture does not require the target values to be located in specific locations on the document. Instead, Cloud Capture finds target values using pattern recognition techniques to identify relationships in the text.
Convert .TIF .PDF .JPEG and .PNG Files
Set up an email inbox and Cloud Capture will scan inbound messages for images to process. You can create inboxes for individual recipients, groups or business processes. Once the images have been converted to searchable PDF, plain text or XML, they are delivered to the destinations you set up in Cloud Capture (keep reading for more information on destinations).
Folders can be easily linked to Cloud Capture. Once linked, any images placed in the folder will be sent to the Cloud Capture service for processing. Once processing is finished, the converted document is delivered to the destination you set up.
Cloud Capture offers developers a flexible set of web services for submitting images to the service. Cloud Capture’s web services are available to anyone seeking to integrate document capture and conversion into other applications or business processes.
Capturing Text and Data
Cloud Capture automates document identification by analyzing relationships between words. This pattern-based approach provides significantly more flexibility than “zone” or location based techniques which are limited to looking at static locations on a page. Cloud Capture’s Document Identification method is not impacted by changes in formatting or layout.
Cloud Capture’s barcode recognition system provides a simple approach to document identification and indexing values. Cloud capture supports multiple 1D and 2D barcodes on a single page.
Index Field Capture
Index Fields are used for capturing specific text on known document types. A template is created for each document type. This template tells Cloud Capture where to find the target values. Should Cloud Capture come across a document it does not possess a template for, it will simply revert to Full Page capture and extract all the text on the document.
Advanced Data Extraction
Advanced Data Extraction greatly expands on Index Fields by using Natural Language Processing (NLP) techniques within the capture process. NLP is capable of identifying the data you want to extract by finding patterns and word relationships in documents. This method enables target data points to be found regardless of where they appear in a document.
Searchable PDF combines the submitted image and the captured text, creating a fully searchable document which looks identical to the original. This format is ideal for preserving the original appearance of documents while making them searchable and more importantly, accessible to full text indexing services. Searchable PDF also enables users select and copy text directly from the document.
Cloud Capture’s Plain Text output provides a small, lightweight text file, free from formatting and layout data. This is often the preferred format for moving unstructured content from one application to another.
Cloud Capture’s XML output simplifies the process of moving highly structured content from system to system. Concord provides XML output with a schema customized for your target application.
Email / SMTP
Deliver converted documents to any email address.
Converted documents can be stored in any folder synchronized with Cloud Capture.
Web Services offer the most flexible approach to integrating converted documents and extracted data into your own applications.
HTTP / HTTPS
Ideal for moving finished documents and data to web destinations.
FTP / FTPS
FTP individual or batches of finished documents.