Call Aspose.OCR Cloud from Java with this open source SDK to build applications that can read text from images and scanned PDFs on any platform with minimal system requirements.
Vendor
Aspose
Company Website
Aspose.OCR Cloud is a fast and reliable REST API for optical character recognition. With it, you can add optical character recognition to your applications without worrying about system requirements – all resource‑intensive tasks are performed by high‑performance servers maintained by Aspose. The API supports European, Cyrillic and Chinese languages and can recognize scanned images, smartphone photos, screenshots, areas of images, and scanned PDFs, returning results in the most popular document and data exchange formats, including JSON. This SDK greatly simplifies the integration of Aspose.OCR Cloud services into your Java applications. It wraps all the routine operations such as establishing connections, sending API requests, and parsing responses into a few readable and maintainable methods, allowing you to focus on the tasks at hand rather than the technical details. The Java SDK is completely open source without any restrictions or limitations. You can use it along with demo applications for any projects, including commercial applications, and modify any part of the code as needed.
Features
- Extracts text from scanned images and PDFs
- Supports raster and vector images
- Reads languages based on Latin, Cyrillic, Hindi, Arabic, and other alphabets
- Recognizes more than 6,000 Chinese characters
- Processes tables and receipts
- Processes entire images or selected areas
- Automatically corrects rotated, skewed, and noisy images
- Automatically finds and corrects misspelled words
- Requires minimal resources on end-user devices 45 Recognition Languages Supported Extended Latin alphabet: Azerbaijani, Albanian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Indonesian, Italian, Javanese, Latin, Latvian, Lithuanian, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Spanish, Swedish, Turkish, Uzbek, Vietnamese. Cyrillic: Bulgarian, Russian, Serbian, Ukrainian. Middle Eastern: Arabic, Hebrew, Persian (Farsi), Urdu. Indic: Bengali, Hindi. Far East: Chinese, Japanese, Korean, Thai, Tibetan. Other alphabets: Georgian, Greek. Read photos and low-quality scans Built‑in preprocessing filters remove skew, glare, dirt, noise, scratches, gradients, and other defects automatically, allowing reliable recognition of smartphone photos. Recognize and convert Recognizes images from: PDF, JPEG, PNG, TIFF, GIF, BMP, EMF, EPS, SVG. Returns results in: plain text, searchable PDF, Microsoft Excel, CSV, hOCR. Minimal system requirements As a cloud‑based OCR service, Aspose.OCR Cloud does not require special hardware or OS. OCR is powered by GPU‑based Amazon servers. Spell check Automatically replaces misspelled words caused by poor scan quality or print defects. Create Searchable PDFs Convert scanned PDF files to full‑text searchable PDFs with selectable text. Recognize images from the Internet Send an image URL directly to the API for processing—no upload required. Unlimited possibilities with Aspose Cloud solutions One Aspose Cloud account unlocks all APIs, allowing OCR integration with OMR, document conversion, data extraction, and more.
Benefits
- Offloads heavy OCR processing to cloud servers
- Enables high‑accuracy recognition from scans, photos, and low‑quality images
- Provides broad language and script support for global applications
- Easily integrates into any Java application thanks to simplified SDK
- Supports multi‑page PDFs and TIFFs
- Offers flexible output formats for documents and data workflows
- Open source with no usage restrictions
- Scales automatically with cloud infrastructure