Leverage the benefits of IDC technology, we have successfully helped both local SMEs and MNCs automate their document processing and data processing. From consulting, to implementation and customization, we deliver solutions that fit into user’s IT environments and bring instant values.
Why Intelligent Data Capture
Streamlined Process: faster workflows equal greater efficiency, integrates with existing systems and eliminates the need for manual data entry
Costs Saving: Boasts automated processing rate averaging 90% and saving cost of 80%
Fast ROI: Enjoy the immediate benefits of accelerated business transactions and processes
More about IDC technology
For less-structured document types, the subfield of intelligent data capture was developed. This new approach takes a content-based, rather than layout-based, approach to documents. Most modern capture solutions that utilize IDC depend on a pre-production learning phase, during which human operators provide example documents. The software then scans and analyzes all the words on every page in order to build a statistical model of word relationships and probabilities. For example, an operator may provide an example of both a mortgage document and a land usage document; the system will build a model that effectively notes the presence of terms like borrower, SSN, interest, and principal in the former document, while prioritizing words such as title, bounds, survey, easement, and so on for the latter. In actuality, this example is quite simplistic, whereas the extensive matrices that today’s systems can generate are quite nuanced and sophisticated.
Having created predictive models for these different types of documents, a modern capture system can then easily and correctly recognize other instances of the same document – e.g. two title surveys from the same company. But, much more usefully, it will also be able to correctly recognize and classify completely novel documents of the same type, like a title survey from a different surveyor, which might have an entirely different layout, and a handful of different terms too. How is this possible? Since IDC leverages probabilities rather than absolute relationships, it is flexible enough to tolerate slight differences in data. That novel set of title surveys might have somewhat different verbiage, but will likely retain > 90% of the same overall vocabulary because it is still a survey. This is the paradigm at the heart of IDC – document recognition in today’s solutions is no longer a rote and mechanical process, but is actually semantically-based, adaptable, and truly intelligent.