To extract data from PDF documents parseForm and parseByTemplate(Template) methods are used. Both methods return DocumentData object. For details, see Working With Extracted Data.
Here are the steps to extract data from PDF Form:
Instantiate Parser object for the initial document Call parseForm method and obtain the DocumentData object; Check if data isn’t null (parse form is supported for the document); Iterate over field data to obtain form data. The following example shows the use case when a user fills in PDF form and send it by email (for example)....To extract data from PDF documents parseForm and parseByTemplate(Template) methods are used. Both methods return DocumentData object. For details, see Working With Extracted Data.
Here are the steps to extract data from PDF Form:
Instantiate Parser object for the initial document Call parseForm method and obtain the DocumentData object; Check if data isn’t null (parse form is supported for the document); Iterate over field data to obtain form data. The following example shows the use case when a user fills in PDF form and send it by email (for example).
Note This feature is only compatible with GroupDocs.Assembly for Java 19.10 or later releases. To access XML data while building a report, you can use facilities of DataSet to read XML into it and then pass it to the assembler as a data source. However, if your scenario does not permit to specify XML schema while loading XML into DataSet, all attributes and text values of XML elements are loaded as strings then....Note This feature is only compatible with GroupDocs.Assembly for Java 19.10 or later releases. To access XML data while building a report, you can use facilities of DataSet to read XML into it and then pass it to the assembler as a data source. However, if your scenario does not permit to specify XML schema while loading XML into DataSet, all attributes and text values of XML elements are loaded as strings then.
With GroupDocs.Viewer for Java you can render files to HTML, PNG, JPEG and PDF formats, list and save attachments, embedded files and compressed files, and extract document text....With GroupDocs.Viewer for Java you can render files to HTML, PNG, JPEG and PDF formats, list and save attachments, embedded files and compressed files, and extract document text.
With GroupDocs.Viewer for Node.js you can render files to HTML, PNG, JPEG and PDF formats, list and save attachments, embedded files and compressed files, and extract document text....With GroupDocs.Viewer for Node.js you can render files to HTML, PNG, JPEG and PDF formats, list and save attachments, embedded files and compressed files, and extract document text.
This article demonstrates how to save edited text documents, spreadsheets, and presentations with GroupDocs.Editor for Node.js via Java API....This article demonstrates how to save edited text documents, spreadsheets, and presentations with GroupDocs.Editor for Node.js via Java API.
This article explains how to detect the document's file type and calculate the number of pages when converting a file with GroupDocs.Conversion for .NET....This article explains how to detect the document's file type and calculate the number of pages when converting a file with GroupDocs.Conversion for .NET.
Learn this article to know how to get edited document HTML markup - body without head tag, content in a raw and base64 form and other using GroupDocs.Editor for .NET API....Learn this article to know how to get edited document HTML markup - body without head tag, content in a raw and base64 form and other using GroupDocs.Editor for .NET API.
Learn how to generate a preview for any page of an existing WordProcessing document in SVG format using GroupDocs.Editor for Node.js via Java....Learn how to generate a preview for any page of an existing WordProcessing document in SVG format using GroupDocs.Editor for Node.js via Java.