To extract data from PDF documents parseForm and parseByTemplate(Template) methods are used. Both methods return DocumentData object. For details, see Working With Extracted Data.
Here are the steps to extract data from PDF Form:
Instantiate Parser object for the initial document Call parseForm method and obtain the DocumentData object; Check if data isn’t null (parse form is supported for the document); Iterate over field data to obtain form data. The following example shows the use case when a user fills in PDF form and send it by email (for example)....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This article explains how to merge arbitrary pages from different documents into a single PDF, DOCX, Excel or PowerPoint document using GroupDocs.Merger for Java....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This article shows how to access IPTC metadata in a file of any supported format....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This topic describes how to use the GroupDocs.Viewer Java API to convert PowerPoint presentations to HTML, PDF, PNG, and JPEG formats....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This topic describes how to use the GroupDocs.Viewer Java API to convert email messages to HTML, PDF, PNG, and JPEG formats....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This article explains how to detect the document's file type and calculate the number of pages when converting a file with GroupDocs.Conversion for .NET....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
Following code snippet shows how to add text watermark to a document. If the document consists of multiple parts (pages, worksheets, slides, frames etc), the watermark will be added to all of them.
advanced_usage.adding_text_watermarks.AddTextWatermark
// Specify an absolute or relative path to your image. Ex: "C:\\Docs\\image.png" Watermarker watermarker = new Watermarker("image.png"); // Initialize the font to be used for watermark Font font = new Font("Arial", 19, FontStyle.Bold | FontStyle.Italic); // Create the watermark object TextWatermark watermark = new TextWatermark("Test watermark", font); // Set watermark properties watermark....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
GroupDocs.Metadata for Java provides functionality that allows working with different kinds of WordProcessing documents such as DOC, DOCX, ODT, etc. For the full list of supported document formats please refer to Supported document formats.
Detecting the exact type of a document The following sample of code will help you to detect the exact type of a loaded document and extract some additional file format information.
Load a WordProcessing document Extract the root metadata package Use the getWordProcessingType method to obtain file format information advanced_usage....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...
This article demonstrates how to convert PowerPoint presentations of PPT, PPTX, ODP to other formats with couple lines of C# code....Comparison Product Solution GroupDocs.Signature Product Solution GroupDocs...