Learn this article and check how to load and convert HTML documents with advanced options using GroupDocs.Conversion for Java API....Signature Product Solution GroupDocs...conversion to formats such as PDF, DOCX, or PNG. The library ensures...
To extract a text from Microsoft OneNote Sections getText and getText(int) methods are used. These methods allow to extract a text from the entire document or a text from the selected page. Raw mode is not supported for Microsoft OneNote.
Here are the steps to extract a text from Microsoft OneNote Section:
Instantiate Parser object for the initial section; Call getText method and obtain TextReader object; Read a text from reader....Signature Product Solution GroupDocs...documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
To extract emails from Outlook Storage getContainer method is used. This method returns the collection of ContainerItem objects.
Outlook Storage item can contain the following metadata:
Name Description date The time and date at which the Outlook Storage item was last modified. email-sender The value of “sender” field. email-to The value of “to” field. subject The value of “subject” field. Outlook Storage container consists of email documents (msg files).
Here are the steps to extract an email text from outlook storage:...Signature Product Solution GroupDocs...documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article describes how to minimize the situation of resource shortage in the indexing process...Signature Product Solution GroupDocs...welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains that how to update indexed documents, as well as updating an index version....Signature Product Solution GroupDocs...welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article gives the knowledge about the complete specification of the search query DSL used in text queries using Java search API....Signature Product Solution GroupDocs...welcome to search over your PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
This article explains that how to extract tables from document....Signature Product Solution GroupDocs...welcome to extract images from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
GroupDocs.Parser allows you to extract emails from remote servers and data from the emails. It supports POP, IMAP and EWS protocols....Signature Product Solution GroupDocs...documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
Get ZIP format metadata The API allows detecting ZIP archives and reading format metadata. The following steps are needed to be followed:
Load a ZIP archive Get the root metadata package Extract the native metadata package using the ZipRootPackage.getZipPackage method Read the ZIP archive properties Loop through ZipPackage.getFiles to extract information about the archived files The following code snippet shows how to get metadata from a ZIP archive.
advanced_usage.managing_metadata_for_specific_formats.archive.ZipReadNativeMetadataProperties...Signature Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...
Reading AVI header properties The GroupDocs.Metadata API supports extracting format-specific information from AVI file headers.
The following are the steps to read the header of an AVI file.
Load an AVI video Get the root metadata package Extract the native metadata package using the AviRootPackage.getHeader method Read the AVI header properties advanced_usage.managing_metadata_for_specific_formats.video.avi.AviReadHeaderProperties
try (Metadata metadata = new Metadata(Constants.InputAvi)) { AviRootPackage root = metadata.getRootPackageGeneric(); System.out.println(root.getHeader().getAviHeaderFlags()); System.out.println(root.getHeader().getHeight()); System.out.println(root.getHeader().getWidth()); System.out.println(root.getHeader().getTotalFrames()); System.out.println(root.getHeader().getInitialFrames()); System.out.println(root.getHeader().getMaxBytesPerSec()); System....Signature Product Solution GroupDocs...to view and edit metadata of PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX...