Logo
/
Sign in
Product Logo
Aspose.OCR for Node.js via C++Aspose

Perform OCR in Node.js with Aspose OCR library. Extract texts from scanned documents, photos and screenshots in Node.js services and serverless applications.

Product details

Aspose.OCR for Node.js via C++ is a fast, reliable OCR library that brings the power of native C++ recognition to Node.js applications. Designed for cross-platform use, the API allows developers to extract text from scanned documents, photographs, screenshots, and various image formats on any system where Node.js runs—including desktop environments, web services, serverless platforms, AWS Lambda, Azure Functions, Docker, and more. Supporting over 140 languages, including Latin, Cyrillic, Arabic, Persian, Chinese, Hindi, Japanese, Korean, and other Asian scripts, the library ensures high OCR accuracy for multilingual and mixed-language content. With advanced preprocessing filters, layout detection, and GPU‑accelerated performance, Aspose.OCR for Node.js via C++ is suited for demanding OCR workloads and enterprise-grade document processing.

Features

Efficient and Accurate OCR

  • Extract text from images and photos with scan‑level accuracy.
  • High-speed OCR powered by optimized C++ cores running in Node.js.
  • Recognize text from scans, smartphone photos, PDFs, screenshots, and camera images.
  • Automatically adjust recognition for photos using detect‑areas and contrast filters. 140+ Recognition Languages Supports major global writing systems, including:
  • Extended Latin (English, Spanish, French, German, Italian, Portuguese, and 80+ more)
  • Cyrillic (Russian, Ukrainian, Kazakh, Serbian, Belarusan, Bulgarian)
  • Arabic, Persian, Urdu
  • Chinese & Asian scripts
  • Indic/Devanagari (Hindi, Marathi, Bhojpuri, etc.)
  • Mixed-language content detection and recognition. Supported Input Formats Compatible with optical sources typically captured by scanners or cameras:
  • JPEG, PNG, TIFF, BMP
  • Multi-page PDF (read-only recognition)
  • ZIP archives
  • Folder-based batch input Supported Output Formats Save OCR results in:
  • Text
  • JSON
  • XML Advanced OCR Capabilities
  • Photo OCR for smartphone images
  • Searchable PDF generation
  • URL-based OCR (recognize images from web URLs)
  • Bulk/batch recognition for archives, folders, and multipage files
  • Any font and style detection
  • Layout detection, including tables and document sections
  • Fine-tuned parameter adjustments for precision
  • Region-based recognition to read selected areas
  • Character‑set limiting to improve speed & accuracy
  • Image defect detection (low contrast, noise areas, etc.)
  • Image‑to‑image text comparison
  • Text search with regex Image Preprocessing Enhances recognition accuracy with filters:
  • Auto‑deskewing and rotation correction
  • Noise, dirt, glare, and scratch removal
  • Contrast enhancement
  • Upscaling and resizing
  • Grayscale or black‑and‑white conversion
  • Detection of problematic areas
  • Full preprocessing automation for low-quality images Cross‑Platform Integration Runs everywhere Node.js runs, including:
  • Windows
  • Linux
  • macOS
  • AWS
  • Azure
  • Docker
  • GitHub Actions / CI pipelines Easy Installation Available as an NPM package or self-contained download with no external dependencies. Simply install the module and start performing OCR.

Benefits

  • Add powerful OCR to Node.js applications in just a few lines of code.
  • Recognize text from low-quality, skewed, or noisy images using robust preprocessing.
  • Handle multilingual documents with high accuracy and mixed-language detection.
  • Process large file batches efficiently for enterprise-scale workloads.
  • Create searchable text outputs for indexing, automation, and analytics.
  • Ideal for serverless platforms (AWS Lambda, Azure Functions).
  • Suitable for web services, document workflows, data extraction, ID processing, logistics, financial document scanning, and more.