Tag Archives: Java Text Extractor

Support for Text and Presentation Templates in GroupDocs.Parser for Java 18.12

We are delighted to announce the release of GroupDocs.Parser for Java 18.12. The latest version allows you to extract the tables from PDF documents. Furthermore, we have added the support of extracting text and metadata from text and presentation templates. For more details, please have a look at the release notes of version 18.12. Features Introduced Extracting Tables from PDF Documents This feature is very useful when you want to extract only the tables form a PDF document. For extracting …

Continue reading

Posted in GroupDocs.Parser Product Family | Tagged , , , , , ,

Improved Text Area Extraction for PDF Documents in GroupDocs.Parser for Java 18.11

We are delighted to announce the release of GroupDocs.Parser for Java 18.11. The latest version came up with one new feature and three enhancements. It allows you to get information about the supported extractors for a document. Furthermore, we have improved the text area extraction for the PDF documents. For more details, please have a look at the release notes of version 18.11. Features Introduced Getting Information of Supported Extractors for a Document This feature helps to get the information …

Continue reading

Posted in GroupDocs.Parser Product Family | Tagged , , , , , , ,

Introducing Image Extraction in GroupDocs.Parser for Java 18.10

We are delighted to announce the release of GroupDocs.Parser for Java 18.10. The latest release has come with a useful feature of extracting images from the documents. This feature is introduced for PDF, spreadsheet, presentation and text document formats. For more details, please have a look at the release notes of version 18.10. Features Introduced Extracting Images from Documents To extract images from the page of the document, getImageAreas methods are used. getImageAreas has following overloads: public IList getImageAreas(int pageIndex); …

Continue reading

Posted in GroupDocs.Parser Product Family | Tagged , , , , , , ,

Releasing GroupDocs.Parser for Java – A Convenient Document Parser API

We are pleased to announce that the first version of GroupDocs.Parser for Java has been released. GroupDocs.Parser for Java allows the Java developers to extract raw and formatted text from the popular document formats. The API also supports working with containers such as ZIP and email containers. You can also access the metadata attached to the documents using a few lines of code. Please continue to read more about the features and the file formats supported by the API. Supported …

Continue reading

Posted in GroupDocs.Parser Product Family | Tagged , , , , , ,

Upcoming Release of GroupDocs.Parser for Java

We are excited to announce that GroupDocs.Parser is coming soon to Java platform as GroupDocs.Parser for Java. It will be an easy to use back-end API that will permit the users to extract raw and formatted text from the supported document formats. Besides, it will also allow the users to extract the metadata from the popular document formats. GroupDocs.Parser for Java will soon be available for download. Salient Features of GroupDocs.Parser for Java GroupDocs.Parser for Java will come with all the features …

Continue reading

Posted in GroupDocs.Parser Product Family | Tagged , , , , , , , ,