This article shows how to to extract data from Microsoft Office Excel spreadsheets....Annotation Product Solution GroupDocs...Excel spreadsheets: Extract text from Microsoft Office Excel...
In this article, you will learn how to apply watermark to document pages when converting document with GroupDocs.Conversion for Node.js via Java API....Annotation Product Solution GroupDocs...WatermarkOptions - watermark text - watermark font name - watermark...
Learn this article and check how to convert Microsoft Word DOC/DOCX and Open Document ODT/OTT files hiding comments and tracked changes panel, setting default font and applying font substitution using features of GroupDocs.Conversion for .NET API....Annotation Product Solution GroupDocs...fields in PDF or convert them to text. Default is false. If enabled...
OCR support means the ability to connect an external module (library) for the recognition of printed Text (optical character recognition, OCR) on images, either separate or embedded in documents.
To connect OCR, you need to implement the IOcrConnector interface in the client code.
The following example demonstrates how to implement the OCR connector using com.aspose.ocr library for Text recognition in images.
const indexFolder = 'c:/MyIndex/'; const documentsFolder = 'c:/MyDocuments/'; const query = 'Einstein'; // Creating an index const index = new groupdocs....Annotation Product Solution GroupDocs...for the recognition of printed text (optical character recognition...
GroupDocs.Metadata for .NET provides functionality that allows working with different kinds of WordProcessing documents such as DOC, DOCX, ODT, etc. For the full list of supported document formats please refer to Supported Document Formats.
Detecting the exact type of a document The following sample of code will help you to detect the exact type of a loaded document and extract some additional file format information.
Load a WordProcessing document Extract the root metadata package Use the FileType property to obtain file format information AdvancedUsage....Annotation Product Solution GroupDocs...content, obtain pieces of hidden text, work with document revisions...
Discover how to export indexed documents to HTML using Java with simple steps. Use Java export indexed documents to HTML to enhance document organization efficiently....Annotation Product Family GroupDocs...indexed, including properties like text storage options Create the Index...
Learn to perform phrase search in documents using Java. Explore methods to search documents by phrases in Java and simplify your document search tasks....Annotation Product Family GroupDocs...folder in the index Perform a text-based phrase search by providing...
Efficiently perform regular expression search using Java for advanced pattern matching in documents. Learn how to implement regular expression search in Java....Annotation Product Family GroupDocs...patterns for finding specific text within documents. This feature...