Text and Metadata Extraction APIs for Java Applications Accurately Parse MS Office, Emails, Zip and Legal Documents
GroupDocs.Parser is a Java document parsing and text extraction API for analyzing documents thus allowing users to extract metadata, raw and formatted text from the supported document formats (Word, Excel, PowerPoint, PDF, OneNote, Visio, Text, HTML, Zip, Markdown and Email messages). The API performs content parsing operations with highest level of accuracy and speed.
Also available for: .NET
From the Library
Product Releases and Updates