GroupDocs.Redaction supports both types of image documents for Optical Character Recognition (OCR):
image files, such as printed document scans (PNG, JPG, etc.) embedded images within office documents (PDF, DOCX, etc.) You have to implement IOcrConnector interface and pass the instance to RedactorSettings constructor.
For more details, see OCR Usage Basics article.
OCR usage limitations There are the following limitations of the OCR with GroupDocs.Redaction for Java v21.6:
textual replacements are not supported, so you have to use color box replacements to redact text in images.... Spreadsheets, HTML and Markdown document...
Release notes Package repository GroupDocs.Assembly for Python via .NET - A assembly is a document automation and reports generation API designed to create custom documents from templates. This Python reporting API intelligently assembles the given data with the defined template document and generates an output document based on the data source, in the template’s format as well as in the specified output format.
GroupDocs.Assembly supports over 170 popular file formats.... Load text documents, spreadsheets, presentations, PDF files...
A .NET API to manipulate metadata is being consumed as metadata viewer for CRUD operations. It helps developers to read, write, edit and remove meta information from all popular document file formats... Load text documents, spreadsheets, presentations, PDF files...
Release notes Package repository GroupDocs.Assembly for Java - A assembly is a document automation and reports generation API designed to create custom documents from templates. This Java reporting API intelligently assembles the given data with the defined template document and generates an output document based on the data source, in the template’s format as well as in the specified output format.
GroupDocs.Assembly supports over 170 popular file formats. Load text documents, Spreadsheets, presentations, PDF files, email messages, and images.... Load text documents, spreadsheets, presentations, PDF files...
Release notes Package repository GroupDocs.Assembly for Net - A assembly is a document automation and reports generation API designed to create custom documents from templates. This Net reporting API intelligently assembles the given data with the defined template document and generates an output document based on the data source, in the template’s format as well as in the specified output format.
GroupDocs.Assembly supports over 170 popular file formats. Load text documents, Spreadsheets, presentations, PDF files, email messages, and images.... Load text documents, spreadsheets, presentations, PDF files...
Document comparison is a process of finding differences in two documents of same format. It can be a content difference or can be any contextual differences. GroupDocs.Comparison for Java compares a source document with a target document and produces a result document which highlights the differences in a suitable way i.e. colored highlighting or by differing font information that has either been added to or removed from the reference document to produce the target document....documents (like Microsoft Word), spreadsheets (like Microsoft Excel),...
Optimize PDFs for web viewing or reduce file size with GroupDocs.Viewer. Adjust resources and improve loading times....file resources Optimize spreadsheets in a PDF file Optimize images...
Common Utilities Class Document Directories Applying Product License Metered Licensing Data Layer Class Custom Objects Populate Data (Getting Customers Details) Getting Order Details Getting Product Details Getting Single Customer Details Database Entities Getting Customer Details From Database Getting Order Details From Database Getting Product Details From Database Getting Single Customer Details From Database DataSet Source Getting Customer, Order Details From DataSet Getting Product Details From DataSet Getting Single Customer Details From DataSet XML DataSource Getting Complete Details From XML Files Getting Single Customer Details From XML DataSource JOSN DataSource Getting Customer Data From JSON File Getting Customer And Order Data From JSON File Getting Product Data From JSON File Getting Single Customer Data From JSON File Use Spreadsheets as Data Sources Use Tables of Word Processing Documents as Data Sources Use Tables of Presentation Documents as Data Sources Complete DataLayer Class Code Complete DynamicEntity Class Code Complete CommonUtilities Class Code...Data From JSON File Use Spreadsheets as Data Sources Use Tables...