GroupDocs.Redaction supports both types of image documents for Optical Character Recognition (OCR):
image files, such as printed document scans (PNG, JPG, etc.) embedded images within office documents (PDF, DOCX, etc.) You have To implement IOcrConnecTor interface and pass the instance To RedacTorSettings construcTor.
For more details, see OCR Usage Basics article.
OCR usage limitations There are the following limitations of the OCR with GroupDocs.Redaction for Java v21.6:
textual replacements are not supported, so you have To use color box replacements To redact text in images.... Spreadsheets, HTML and Markdown document...with GroupDocs.Redaction in one of these guides: OCR Usage Basics...