To extract tables from Microsoft Office Word document getStructure method is used. This method returns XML representation of the document. Tables are represented by “table” tag. For more details, see Extract text structure.
Warning getStructure method returns null value if text structure extraction isn’t supported for the document. For example, text structure extraction isn’t supported for TXT files. Therefore, for TXT file getStructure method returns null. If Microsoft Office Word document has no text, getStructure method returns an empty org....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Learn how to search for keywords and use regular expressions to find text in documents using GroupDocs.Parser for Python via .NET. Search text with case sensitivity and whole word options....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
With GroupDocs.Viewer for Java you can render files to HTML format. As a simple usecase please see how easily you can convert Microsoft Word to clean html....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to use OCR....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
Save assembled documents in the same format as the template (e.g., DOCX to DOCX, XLSX to XLSX)....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains that how to extract tables from document page....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to sign a document electronically with generated Stamp signatures by GroupDocs.Signature Api....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
To extract a text from PDF documents getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page.
Here are the steps to extract a text from PDF document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...
This article explains how to subscribe for search electronic signatures events like start, progress and completion with GroupDocs.Signature Api....Navigation Products GroupDocs.Total Product Family GroupDocs.Viewer Product...Solution GroupDocs.Annotation Product Solution GroupDocs.Conversion...