To extract a text from PDF documents getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page.
Here are the steps to extract a text from PDF document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Assembly Product Solution GroupDocs...
GroupDocs.Viewer for Java provides different plans for purchase or offers a Free Trial and a 30-day Temporary License for evaluation....Assembly Product Solution GroupDocs...
This article explains how to manage loading of external resources contained by a document with GroupDocs.Viewer within your .NET applications....Assembly Product Solution GroupDocs...
This article describes how to insert an edited worksheet into an existing spreadsheet using GroupDocs.Editor for Node.js....Assembly Product Solution GroupDocs...
This guide demonstrates how to edit CSV, TSV, comma-separated value and other text files with different settings and many other powerful features of GroupDocs.Editor for .NET....Assembly Product Solution GroupDocs...
Learn this article and check how to load and convert Email documents with advanced options using GroupDocs.Conversion for Java API....Assembly Product Solution GroupDocs...
This article shows that how Java redaction API allows you to replace or remove metadata using filters or search by regular expression....Assembly Product Solution GroupDocs...
This article shows how to implement one or several interfaces for the document's features and required redaction....Assembly Product Solution GroupDocs...