This article explains that how to extract hyperlinks from documents....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
This article explains that how to extract formatted text from document....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for .NET /...
There might be cases when the document is presented only as a stream (without a copy on the local disk). To avoid the overhead of saving documents to the disk, GroupDocs.Parser enables to extract data from streams directly.
The following example shows how to load the document from the stream:
// Create the stream try (InputStream stream = new FileInputStream(Constants.SamplePdf)) { // Create an instance of Parser class with the stream try (ParserParser = new Parser(stream)) { // Extract a text into the reader try (TextReader reader = Parser....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
To extract emails from Outlook Storage getContainer method is used. This method returns the collection of ContainerItem objects.
Outlook Storage item can contain the following metadata:
Name Description date The time and date at which the Outlook Storage item was last modified. email-sender The value of “sender” field. email-to The value of “to” field. subject The value of “subject” field. Outlook Storage container consists of email documents (msg files).
Here are the steps to extract an email text from outlook storage:...Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
This article shows how to extract table of contents from Microsoft Word (DOC, DOCX etc), PDF documents and Ebooks (CHM, EPUB)....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for .NET /...
This article shows how to extract table of contents from Microsoft Word (DOC, DOCX etc), PDF documents and Ebooks (CHM, EPUB)....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
This article explains that how to extract Markdown formatted text from document page....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for .NET /...
This article shows how to get the basic document info....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
This article explains that how to extract HTML formatted text from document page in Java....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for Java /...
To extract a text from HTML documents GetText method is used. This method allows to extract a text from the entire document. Pagination and raw mode is not supported for emails....Parser Product Solution GroupDocs...GroupDocs.Parser Product Family / GroupDocs.Parser for .NET /...