GroupDocs.Redaction supports both types of image documents for Optical Character Recognition (OCR):
image files, such as printed document scans (PNG, JPG, etc.) embedded images within office documents (PDF, DOCX, etc.) You have to implement IOcrConnector interface and pass the instance to RedactorSettings constructor.
For more details, see OCR Usage Basics article.
OCR usage limitations There are the following limitations of the OCR with GroupDocs.Redaction for Java v21.6:
textual replacements are not supported, so you have to use color box replacements to redact text in images....Viewer Product Solution GroupDocs...text in images. Spreadsheets, HTML and Markdown document types...
Introduction to GroupDocs.Signature for .NET - what is it and why to use...Viewer Product Solution GroupDocs...PPTX/PPT, XLSX/XLS, JPG, PNG, TIFF, HTML and many others. With GroupDocs...
Working with search results consists in obtaining information from objects of search results and highlighting occurrences in the text of documents.
Obtain search result information When a search is complete, the search method returns an object of type SearchResult. This page describes the information available in an object of type SearchResult.
From the root object of the search result, information is available on the number of documents found, the number of occurrences of the words and phrases found, as well as detailed information on each individual document....Viewer Product Solution GroupDocs...'BasicUsage/WorkWithSearchResult/Highlighted.html' ; const outputAdapter = new...
API to annotate text or images in your documents using Java. It supports PDF, Microsoft Word DOCX, Excel XLSX and PowerPoint. PPTX...Viewer Product Solution GroupDocs...Microsoft Visio, as well as HTML pages, email messages, and even...
Introduction to GroupDocs.Signature for Java - what is it and why to use...Viewer Product Solution GroupDocs...PPTX/PPT, XLSX/XLS, JPG, PNG, TIFF, HTML and many others. Sign, search...
This page describes how to detect document file type, size and calculate pages count when annotate documents or images with GroupDocs.Annotation....Viewer Product Solution GroupDocs...all formats except Email and Html. Width and height are the same...
This guide demonstrates how to edit plain text files with encoding, lists recognition, pagination and other powerful features of GroupDocs.Editor for Java...Viewer Product Solution GroupDocs...below demonstrates, how to emit HTML markup from it, edit it and...
Let's look at how to convert Excel to PDF in C# and how to use the C# Excel to PDF sample code to convert a workbook, selected sheets, or any cell range to PDF....Viewer Product Family GroupDocs.Comparison...workbooks can be simply converted to HTML, Microsoft PowerPoint, and Word...
GroupDocs.Conversion for .NET is an advanced document conversion API developed to convert files of different formats from within C# applications....Viewer Product Solution GroupDocs...PowerPoint Convert images Convert HTML Convert audio HOW-TO GUIDES...