To extract a text from PDF documents getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page.
Here are the steps to extract a text from PDF document:
Instantiate Parser object for the initial document; Call getText method and obtain TextReader object; Read a text from reader. Warning getText method returns null value if text extraction isn’t supported for the document....Editor Product Solution GroupDocs...
This article describes the main functions of GroupDocs.Parser for Python via .NET. Extracting text, images, metadata, tables, and structured data from documents with template-based parsing support....Editor Product Solution GroupDocs...
This article explains how to manage loading of external resources contained by a document with GroupDocs.Viewer within your .NET applications....Editor Product Solution GroupDocs...
This article explains how to sign a document with Text signature using GroupDocs.Signature for Python via .NET API. Learn how to add a digital signature to a PDF programmatically in Python....Editor Product Solution GroupDocs...
Mari pelajari cara melindungi file spreadsheet Excel dengan kata sandi menggunakan C#. Ubah kata sandi yang ada atau hapus untuk membuka kunci file XLS/XLSX menggunakan .NET API....mencoba membuka file spreadsheet, editor atau penampil akan meminta...
This article shows that how to redact data of sensitive nature from images of various formats like JPG, PNG, TIFF and others....Editor Product Solution GroupDocs...
Learn how to extract metadata from PowerPoint presentations (.ppt, .pptx) using GroupDocs.Parser for .NET. Extract document properties like author, title, creation date, and comments from presentation files....Editor Product Solution GroupDocs...
Follow this guide and learn how to set document metadata when saving output document after files comparison within your Java applications....Editor Product Solution GroupDocs...
Learn how to efficiently compare large sets of Word (DOCX) files using GroupDocs.Comparison for Node.js with parallel processing and performance tuning....Editor Product Solution GroupDocs...