- News Release(3)
Category Archive: GroupDocs.Parser Product Family
Repetition of data can diminish the worth of the content. Working as a writer, you must follow DRY (don’t repeat yourself) principle. The statistics such as word count or the number of occurrences of each word can let you analyze the content but it’s hard to do it manually for multiple documents. So in this article, I’ll demonstrate how to programmatically count words and the number of occurrences of each word in PDF, Word, Excel, PowerPoint, … Continue Reading
Portable Document Format (PDF) is a popular and widely used document format developed by Adobe. The PDF documents can contain a variety of content including formatted text, images, annotations, form fields, etc. Parsing PDF document programmatically is a popular use case and there are multiple ways of extracting the text. However, extracting images from a PDF document is a complex task. This article demonstrates how easily you can extract images from the PDF documents programmatically in C# using GroupDocs.Parser for … Continue Reading
The all-new API v2 of GroupDocs.Parser for .NET has been released! It would be a piece of breaking news for those who are already using our document parsing API as well as who are looking for an easy to use solution for extracting text, images, and metadata from PDF, word processing documents, spreadsheets, presentations, emails, EPUB & ZIP file formats.
What’s new in the API v2?
We have done some major updates at … Continue Reading
Hello everyone! I am back with something new and exciting for the developers who use to deal with the automated data extraction from the documents. A few years back, we released GroupDocs.Parser API which aimed to extract the text from various document formats. We kept on adding the features to it and today, it has become a giant API that provides a wide range of features including formatted text extraction, highlighted and structured text extraction, metadata extraction, extraction of images … Continue Reading
We are delighted to announce the release of GroupDocs.Parser for Java 18.12. The latest version allows you to extract the tables from PDF documents. Furthermore, we have added the support of extracting text and metadata from text and presentation templates. For more details, please have a look at the release notes of version 18.12.
Extracting Tables from PDF DocumentsThis feature is very useful when you want to extract only the tables form a PDF document. For extracting… Continue Reading
We are delighted to announce the release of GroupDocs.Parser for Java 18.11. The latest version came up with one new feature and three enhancements. It allows you to get information about the supported extractors for a document. Furthermore, we have improved the text area extraction for the PDF documents. For more details, please have a look at the release notes of version 18.11.
Getting Information of Supported Extractors for a DocumentThis feature helps to get the information… Continue Reading
We are pleased to announce the release of version 18.11 of GroupDocs.Parser for .NET. The latest version came up with one new feature and three enhancements. It allows you to get information about the supported extractors for a document. Furthermore, we have improved the text area extraction for the PDF documents. For more details, please have a look at the release notes of version 18.11.
Getting Information of Supported Extractors for a DocumentThis feature helps to… Continue Reading
GroupDocs.Parser for .NET 18.9 has been released! The latest version allows you to extract text from the databases. You can also extract data from the form fields in a PDF document. Please continue to read for more details on the features introduced in v18.9.
Extracting Text from DatabasesYou can now extract text from the databases. To extract text from databases DbContainer class is used that implements IContainer interface. Each data table is represented by the entity. The… Continue Reading
Text Analysis API for Spreadsheets, Presentations and Text Documents – GroupDocs.Parser for .NET 18.8
We are pleased to announce the release of version 18.8 of GroupDocs.Parser for .NET. In this version, we have extended the support of text analysis API for spreadsheets, presentations and text documents. Furthermore, the latest version allows providing a password for protected documents on-demand. We’d recommend you to use the latest version of the API and share your feedback.