GroupDocs.Metadata for Java provides functionality that allows working with different kinds of spreadsheet formats such as XLS, XLSX, ODS, etc. For the full list of supported document formats please refer to Supported document formats.
Detecting the exact type of a document The following sample of code will help you to detect the exact type of a loaded spreadsheet and extract some additional file format information.
Load a Spreadsheet document Extract the root metadata package Use the getSpreadsheetType method to obtain file format information advanced_usage....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
This article shows how to access XMP metadata in a file of any supported format....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
Note This feature is only compatible with GroupDocs.Assembly for Java 19.10 or later releases. To access XML data while building a report, you can use facilities of DataSet to read XML into it and then pass it to the assembler as a data source. However, if your scenario does not permit to specify XML schema while loading XML into DataSet, all attributes and text values of XML elements are loaded as strings then....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
To extract data from PDF documents parseForm and parseByTemplate(Template) methods are used. Both methods return DocumentData object. For details, see Working With Extracted Data.
Here are the steps to extract data from PDF Form:
Instantiate Parser object for the initial document Call parseForm method and obtain the DocumentData object; Check if data isn’t null (parse form is supported for the document); Iterate over field data to obtain form data. The following example shows the use case when a user fills in PDF form and send it by email (for example)....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
This article explains how to detect the document's file type and calculate the number of pages when converting a file with GroupDocs.Conversion for .NET....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
Render documents to HTML, PNG, JPEG, PDF. Extract text, list attachments, and transform pages with GroupDocs.Viewer for Python....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
Introduction to GroupDocs.Assembly for Java GroupDocs.Assembly for Java is a Java API engine designed to generate documents based on templates. In essence, the engine accepts a template document and some data, assembles data with the template document as per syntax defined by the template document and then, generates the output document in the template’s format or in the specified output format. Since GroupDocs.Assembly for Java can generate documents based on a data source, essentially it serves two purposes: Document Automation and Report Generation....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...
This topic describes how to use the GroupDocs.Viewer Node.js API to convert images to HTML, PDF, PNG, and JPEG formats....Editor Product Solution GroupDocs.Merger Product Solution GroupDocs...