Extract text from images and create searchable PDFs directly from the command line or Bash scripts without installing any software.
Vendor
Aspose
Company Website
Aspose.OCR Cloud for cURL is a powerful, cloud-based REST API that enables text extraction from images, scanned PDFs, screenshots, and smartphone photos directly through the command line. It requires no software installation—any environment capable of executing cURL commands or Bash scripts can use the service instantly. Developers can also integrate the API with third-party REST tools such as Postman. The solution supports Latin, Cyrillic, Middle Eastern, Indic, and Asian scripts and can recognize over 6,000 Chinese characters. Recognition results can be returned in widely used data exchange formats, including plain text, JSON, searchable PDF, Excel, CSV, and hOCR.
Features
- Extracts text from scanned images and PDFs.
- Supports raster and vector image formats: PDF, JPEG, PNG, TIFF, GIF, BMP, EMF, EPS, SVG.
- Recognizes 45 languages, including extended Latin, Cyrillic, Arabic, Hebrew, Persian, Urdu, Bengali, Hindi, Chinese, Japanese, Korean, Thai, Tibetan, Georgian, and Greek.
- Supports text extraction from entire images or selected regions.
- No installation required—works directly with cURL or Bash scripts.
- Processes tables and receipts.
- Automatically corrects rotated, skewed, and noisy images with built‑in preprocessing filters.
- Removes dirt, scratches, glare, gradients, and other image artifacts.
- Built-in spell checker corrects misspelled words automatically.
- Supports multi-page PDF and TIFF files.
- Recognition results returned as plain text, searchable PDF, Excel, CSV, hOCR, or JSON.
- Recognize images via URL without uploading files to cloud storage.
- Hosted on high-performance GPU-based Amazon servers for maximum speed.
- Requires minimal local resources, enabling use on low-power devices and serverless environments.
Benefits
- Enables OCR functionality entirely through command line automation, ideal for scripting, serverless workflows, and batch operations.
- Eliminates dependency on local OCR libraries or hardware capacity—processing occurs in the cloud.
- Supports a broad range of languages and scripts, making it suitable for multilingual document processing.
- The API handles low‑quality images effectively with advanced noise reduction, distortion correction, and preprocessing features.
- Produces machine‑readable and structured outputs for integration into downstream workflows such as document indexing, data extraction, analytics, and archiving.
- Flexible enough for use in automation pipelines, DevOps workflows, CI/CD systems, and cloud functions.
- Provides seamless integration with other Aspose Cloud APIs, enabling complex document workflows, OMR processing, conversion, and multi-format transformations.
- Ideal for businesses of all sizes requiring lightweight OCR integration without full application development.