To extract a text from PDF documents getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page.
Here are the steps to extract a text from PDF document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Editor Product Solution GroupDocs...
GroupDocs.Viewer for Java provides different plans for purchase or offers a Free Trial and a 30-day Temporary License for evaluation....Editor Product Solution GroupDocs...
This article explains how to manage loading of external resources contained by a document with GroupDocs.Viewer within your .NET applications....Editor Product Solution GroupDocs...
Learn this article and check how to load and convert Email documents with advanced options using GroupDocs.Conversion for Java API....Editor Product Solution GroupDocs...
This article shows that how Java redaction API allows you to replace or remove metadata using filters or search by regular expression....Editor Product Solution GroupDocs...
This article shows how to implement one or several interfaces for the document's features and required redaction....Editor Product Solution GroupDocs...
Learn how to deal.If you have a corporate sensitive data removal policy as a list of redaction rules, you don't need to specify them in your code. You can specify an XML document with a list of pre-configured redactions....Editor Product Solution GroupDocs...
The page describes how to edit annotation properties - change annotation position, size, appearance etc. when annotate documents using GroupDocs.Annotation for .NET....Editor Product Solution GroupDocs...
This topic explains how to get document pages preview as images with various options by GroupDocs.Signature API....Editor Product Solution GroupDocs...