This topic describes how to use the GroupDocs.Viewer Java API to convert PDF files to HTML, PNG, and JPEG formats....Signature Product Solution GroupDocs...elements of an HTML page (including text, graphics, and stylesheets)...
GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.
The following table provides the list of supported formats:
Format Description DOC Microsoft Office Word Document DOT Microsoft Office Word Document Template DOCX Microsoft Office Open XML Document DOCM Microsoft Office Open XML Macro-Enabled Document DOTX Microsoft Office Open XML Document Template DOTM Microsoft Office Open XML Document Macro-Enabled Template TXT Plain Text ODT Open Document Text OTT Open Document Text Template RTF Rich Text Format More resources GitHub examples You may easily run the code above and see the feature in action in our GitHub examples:...Signature Product Solution GroupDocs...Template Plain text Open Document Text Open Document Text Template...
Extract attachments from Emails To extract attachments from emails getContainer method is used. This method returns the collection of ContainerItem objects.
Email Attachment can contain the following metadata:
Name Description content-type The MIME type of the attachment content These metadata refer to a container element itself, not a document.
Here are the steps to extract an email Text from email attachments:
Instantiate Parser object for the initial document; Call getContainer method and obtain collection of ContainerItem objects; Check if collection isn’t null (container extraction is supported for the document); Iterate through the collection and obtain Parser object to extract a Text....Signature Product Solution GroupDocs...the steps to extract an email text from email attachments: Instantiate...
There might be cases when the document is presented only as a stream (without a copy on the local disk). To avoid the overhead of saving documents to the disk, GroupDocs.Parser enables to extract data from streams directly.
The following example shows how to load the document from the stream:
// Create the stream try (InputStream stream = new FileInputStream(Constants.SamplePdf)) { // Create an instance of Parser class with the stream try (Parser parser = new Parser(stream)) { // Extract a Text into the reader try (TextReader reader = parser....Signature Product Solution GroupDocs...Parser ( stream )) { // Extract a text into the reader try ( TextReader...
This article gives the knowledge about the regular expression (RegEx) search queries which are universal and very flexible, but at the same time, in large indexes, their performance becomes extremely low using Java search API....Signature Product Solution GroupDocs...separately, and not the entire text of a document field. Queries...
In this article, you will learn how to apply watermark to document pages when converting document with GroupDocs.Conversion for Java API....Signature Product Solution GroupDocs...of the watermark, such as its text, size, color, position, and...
Note This feature is only compatible with GroupDocs.Assembly for .NET 3.2.0 or later releases. Inserting Barcodes Dynamically You can insert barcode images to your documents dynamically using barcode tags. To declare a dynamically inserted barcode image within your template, perform the following steps:
Add a Textbox to your template at the place where you want a barcode image to be inserted. Set common image attributes such as size, rotation angle, and others for the Textbox, making the Textbox look like a barcode image without bars and Text....Signature Product Solution GroupDocs...barcode image without bars and text. Specify a barcode tag within...
What Is GroupDocs.Watermark? GroupDocs.Watermark for Java is a powerful API for managing watermarks in the documents of different file formats. It provides the effective watermarking methods that allow you to add watermarks that are hard to be automatically removed by third-party tools. It also allows you to search and remove previously added watermarks of popular types (including watermarks added by third-party tools) in a document. The API provides straightforward and easy to use set of methods to add, search, and remove watermarks in supported file formats....Signature Product Solution GroupDocs...Eliminate Watermark Containing Text with Particular Formatting Look...
This article explains document formats and format families supported by GroupDocs.Editor for Java and how to operate them in Java code....Signature Product Solution GroupDocs...also known as delimited text, that are text-based form of spreadsheets...