OCR support means the ability to connect an external module (library) for the recognition of printed text (optical character recognition, OCR) on images, either separate or embedded in documents.
To connect OCR, you need to implement the IOcrConnector interface in the client code.
The following example demonstrates how to implement the OCR connector using com.aspose.ocr library for text recognition in images.
String indexFolder = "c:\\MyIndex"; String documentFolder = "c:\\MyDocuments"; // Creating an index Index index = new Index(indexFolder); // Setting the OCR indexing options IndexingOptions options = new IndexingOptions(); options....over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX and more with our...
HEre is the conversion code:
try (Converter converter = new Converter(fileName)) {
final ImageConvertOptions options = new ImageConvertOptions();
options.setFormat(FileType.fromEx…...mean the output of conversion pptx slides to images have good qyality...
An interface is used to receive the information about errors, warnings and events which occur while data extraction....data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article demonstrates how to create and edit WordProcessing documents using GroupDocs.Editor for .NET. It also covers supported formats like spreadsheets and presentations....workbooks (XLSX), presentations (PPTX), e-books (EPUB) and emails...
Learn about redaction API methods to reject or approve specific changes during redaction process...formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more...
This article explains how to detect the document file type and calculate the number of pages when converting a file with GroupDocs.Conversion for Java....Presentation Documents (PPT, PPTX) For presentation files, you...Converter ( "presentation-template.pptx" )) { PresentationDocument doc_info...