
ABBYY OCR SDK integrates advanced OCR capabilities into applications, converting images and PDFs into searchable and editable formats. It enhances data accuracy, supports multiple languages, and improves document processing efficiency.
Vendor
ABBYY
Company Website

The most comprehensive AI OCR SDK for software developers
Integrate AI-powered OCR features into your text and data processing applications.
What is ABBYY FineReader Engine?
The ABBYY FineReader Engine software development kit uses AI OCR to allow software developers to create applications that extract textual information from paper documents, images or displays. This best-in-class AI OCR SDK provides your application with excellent text recognition, PDF conversion, and data capture functionalities, enabling it to convert scans into searchable PDF, Word or Excel documents, and access data on photos or screenshots.
Features & Benefits
Whether you are a software vendor, system integrator or an enterprise company developing your own IT systems, ABBYY OCR SDK will help you create highly accurate text and data processing applications. Create desktop or server applications for Windows, Linux or Mac and deploy them in the Cloud or on Virtual Machines. The diverse OCR features can add value to applications within many areas, such as DMS, ERP, RPA, insurance, banking, healthcare, legal and machine vision.
Advanced features for developers
ABBYY FineReader Engine offers advanced features for software developers creating applications for Windows, Linux, and iOS platforms.
Comprehensive set of recognition technologies
The SDK offers award-winning, highly accurate recognition of machine-printed text (OCR) and hand-printed text (ICR) that can be performed as full-page or field-level recognition. In addition, it provides recognition of barcodes (OBR).
Powerful PDF processing tools
The SDK offers document conversion into searchable PDF and PDF/A formats. When importing PDFs, the missing text layer can be injected while PDF properties are preserved. XML data can be extracted from imported PDF/A-3 files as well as inserted when saving to PDF/A-3 formats
Artificial intelligence and machine learning
When converting many pages such as complete document archives or books, developers can leverage the SDK's flexible and scalable AI OCR architecture. By using multi-core CPUs and processing images in parallel on multiple threads, the OCR steps can be performed significantly faster.
Support for Cloud and virtual environments
In addition to on-premise deployment, software developers have the option to deploy their applications in virtual environments or host their offerings on Cloud platforms like Microsoft Azure or Amazon Web Services.
Advanced image pre-processing functions
Image pre-processing increases the recognition accuracy by optimizing the image for OCR. Even low-quality images can deliver the best OCR results after de-skewing, rotation, distortion correction, text line straightening, page splitting, adaptive binarization, ISO noise reduction and other image correction steps.
Recognition profiles for fast implementation
Predefined processing profiles contain pre-set parameters for the best OCR results within frequent recognition tasks. They can be used in place of manually fine-tuning the recognition system and provide faster SDK integration.
Sample code library
A comprehensive set of code samples provide instructions on how to use the OCR SDK, illustrates different FineReader Engine technologies and can be used for development projects. Each library section contains code snippets accompanied by systematic instructions on how to perform the relevant task.
Detailed SDK documentation and support
Detailed documentation helps to streamline integration. Developers have access to additional information through ABBYY’s Knowledge Base and Developers’ Forum. In addition to these resources, ABBYY technical support is available to you during your free trial.
Compare documents’ function
The new functionality ‘Compare Documents’ allows comparing two versions of the same documents and detecting differences in their content. The documents can be in different formats, e.g. Microsoft Word, PDF, TIF, JPEG, and any other of the supported input documents formats.