Learn about docWorks products, view online manuals, get the latest downloads, and more.
Enter the next level: Machine learning based layout analysis
Dear clients,Dear partners,
Heading towards 2020 we are happy to uplift docWorks to the version 7.1.
On top of improved performance, up to date OCR engines, and an easy way to handle multiple OCR engines, docWorks 7.1 offers the option to adapt your projects to our first generation of machine learning based layout analysis.
T A K E Y O U R P R O J E C T T O T H E N E X T L E V E LW I T H M A C H I N E L E A R N I N G B A S E D L A Y O U T A N A L Y S I S
Contact us now to order your update, scheduled from 15th December 2019.
We look forward to hearing from you!
Your CCS docWorks Team
T H E N E X T L E V E L
Machine learning based layout analysisdocWorks 7.1 offers the option to improve layout analysis based on machine learning. The machine learning support in docWorks 7.1 increases automation through better zoning quality. Over time the proportion of manual post processing will be minimized, accuracy and precision in the layout analyzing process will be increased.
Including pre-trained modeldocWorks 7.1 includes a pre-trained model, ready to be used for newspapers with conservative layout. Large scale digitization projects can greatly benefit from custom trained models.
Get your custom modelA custom machine learning model, based on your particular training data set, may increase the zoning quality significantly. CCS docWorks offers the creation of such models as a service.
Contact our sales team for your personal, non-binding offer:firstname.lastname@example.org.
S M A R T B A S I C S
Updated OCR enginesdocWorks 7.1 supports the latest OCR engines:- Tesseract 4.1,- ABBYY SDK Runtime FineReader 12- OmniPage SDK v.20.
OCR engine handlingdocWorks 7.1 allows you to manage OCR engines via user interface. Assign suitable OCR engines to specifc projects in an easy way. You may also select older OCR engine versions (if available), e.g. to maintain consistency throughout your projects.
ALTO 4.1 outputdocWorks 7.1 supports ALTO version 4.1. The inclusion of glyphs export is optional in ALTO 4.1, based on configuration.
PerformancedocWorks 7.1 is built on 64 bits, improving the performance and resource usage noticeably.
A detailed overview of all new features can be found in theCCS Extranet Information Center.
Be nice to the world. Please dont't print this e-mail unless you really need to. The information contained in this e-mail message is intended only for the personal and confidential use of the recipient(s) named above. If the reader of this message is not the intended recipient or an agent responsible for delivering it to the intended recipient, you are hereby notified that you have received this document in error and that any review, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this communication in error, please notify us immediately by e-mail, and delete the original message. Thank you.
The conversion of print material into searchable and metadata-rich digital collections is one of the most important tasks for many libraries. However, investments in the necessary conversion technology and labor are expensive and often oversized. This is particularly true for small to medium-sized libraries that only want to convert occasionally and do not have the budget for a constant setup.
That's why we developed docWorks SX.
docWorks SX is a cloud-based version of our proven docWorks Service (Digitization Services), which is used by libraries that prefer to outsource their digitization and conversion work to a professional service provider. docWorks SX is specifically designed to meet the needs of small to medium-sized libraries or libraries that want to perform smaller test runs before investing in large digitization projects.
Features of docWorks SX
Please contact me to learn more about how docWorks SX can help you create metadata-rich digital collections.
George Schlukbier Director North America
Phone: +1 919 889 9050 Email: email@example.com
OCR verification and correction is one of the most challenging tasks of the conversion workflow, especially when digitizing publications in foreign languages or writing systems. However, in-house staff do not always have the specific language skills needed, which means that this data cannot be checked sufficiently or only at high cost. That's why we developed ORCA.
“ORCA is a great product. It simplifies the OCR correction process and thereby solves a central bottleneck problem. I am pleased that CCS has made the additional effort to develop it further, with my team giving first-hand feedback.”
—Vincent Tan, CD Imaging Singapore
ORCA enables a completely new, decentralized and flexible distribution of OCR post-editing, so that projects can be completed faster, better and more cost-effectively.
Please call us at +49 40 227130-0 to learn more about how ORCA can help your project.
Your docWorks team