Automated Data Classification

Automatically Classify Structured and Unstructured Data

SoftWorks AI’s Trapeze Classification Module is a flexible, server-based software solution designed to automatically classify documents and route them to the proper folder or recipient. Manually filing documents can be an expensive and time-consuming task. In order to maximize employees’ time, organizations can leverage this technology designed to organize and classify scanned image documents and electronic PDF files.

The Trapeze Classification Module allows organizations to import their taxonomy, train the system to accurately identify specific document types, and automatically organize and classify scanned and electronic documents. The system can process both structure and unstructured documents with a high degree of accuracy. Based upon specified routing rules, the system can file documents to an organization’s content management repository, email copies to specified user(s), and facilitate downstream document-centric workflows.

automated data classification types

Machine Learning for Improved File Classification

SoftWorks AI’s file classification software leverages advanced machine learning, enabling organizations to easily train the software on an unlimited number of document types and document taxonomies. As the solution processes more documents, it gets smarter by understanding the organization’s classification and routing rules. As more documents are classified, organizations can realize more automation and greater cost reductions over time.


Reduce labor costs through automation

Automatically classify documents to minimize human intervention and time spent sorting through numerous files.

Easily classify unstructured documents and complex files

Trapeze processes structured, semi-structured, and unstructured documents. It also supports scanned image documents, digitally born files, and complex hybrids.

Achieve greater accuracy over time with machine learning

Machine learning enables organizations to train the software on an unlimited number of document types, improving touchless automation rates over time as new document types are added to the workflow.

Maintain compliance

Preserve compliance with record retention requirements with stable processing even at extreme volumes and support for PDF/A to ensure integrity and long term access to vital documents.

Exception Validation for Additional Control

Documents satisfying the specified confidence criteria can be automatically routed with no further intervention. Any other documents requiring user review are presented within our CVIEW Validation interface, which enables designated users to either approve or modify the suggested classification. Modifications made during validation sessions are leveraged as part of Trapeze Classification’s feedback loop and machine learning process, which continually enhances the software’s accuracy. Following validation, approved documents are automatically routed to the appropriate locations according to the routing rules.

Classify Electronic PDF Files Faster

The module gives users the unique ability to process both scanned documents and electronic or “native” PDF files with maximal efficiency. Whenever an electronic document is detected, Trapeze will bypass the resource-intensive and time-consuming OCR process for pre-existing text layers, a powerful function that typically improves system performance speed by as much as 80%. In terms of accuracy, organizations can expect to achieve a 90-100% text confidence factor on average for electronic PDFs, allowing for greater auto-validation rates and a faster, more efficient workflow.

Optimize Turnaround Time with Fast, High-Volume Processing

The automated document classification module is a server-based solution designed for high-volume processing, making it a highly reliable addition to a demanding business workflow environment. Files can be processed in batch mode or through watch folders to enable unattended, automated workflows. Trapeze Classification supports parallel processing by maximizing available CPUs, enabling faster turnaround times.