GroupDocs.Parser for .NET

It is our pleasure to announce the release of version 18.12 of GroupDocs.Parser for .NET. The latest version allows you to extract the tables from PDF documents. Furthermore, we have added the support of extracting text and metadata from text and presentation templates. For more details, please have a look at the release notes of version 18.12.

Features Introduced

Extracting Tables from PDF DocumentsThis feature is very useful when you want to extract only the tables form a PDF document. For extracting tables, TableAreaParser class is used. The instance of TableAreaParser class is available via property with the same name in PdfTextExtractor class. You can create the table bounds manually or let the API detect the layout in automatic mode using TableAreaDetector class. For working example of this feature, please refer to this documentation article.

Extracting Text and Metadata from Text and Presentation TemplatesGroupDocs.Parser now supports extracting text and metadata from the following text and presentation template formats:

  •     dotx (Template)
  •     dotm (Macro-enabled template)
  •     ott (OpenDocument Text Template)
  •     potx (Template)
  •     potm (Macro-enabled template)
  •     ppsm (Macro-enabled slideshow)
  •     pptm (Macro-enabled presentation)

The following code samples show how to extract text and metadata from templates.``` // Extracting Text void ExtractText(string fileName) { // Extract a text from the file var text = Extractor.Default.ExtractText(fileName); // Print an extracted text Console.WriteLine(text); } // Extracting Metadata void ExtractMetadata(string fileName) { // Extract metadata from the file var metadata = Extractor.Default.ExtractMetadata(fileName); // Print extracted metadata foreach (var m in metadata) { // Print a metadata key Console.Write(m.Key); Console.Write(": “); // Print a metadata value Console.WriteLine(m.Value); } }


# Enhancements

## Detecting Type of Password-protected Office Open XML DocumentsThis feature allows detecting media type of the password-protected Office Open XML documents. To detect media type of encrypted Office Open XML document, **Detect(Stream, LoadOptions)** method is used. You can also use **IPasswordProvider** interface for batch document processing. For working examples of this feature, please refer to [this](https://docs.groupdocs.com/parser/net) documentation article.

# Available Channels and Resources

Here are a few channels and resources for you to download, learn, try and get technical support on GroupDocs.Parser:

*   [Installation](https://www.nuget.org/packages/groupdocs.parser "GroupDocs.Text Nuget Package") - Install GroupDocs.Parser using NuGet
*   [Documentation](https://docs.groupdocs.com/display/parsernet/Home "GroupDocs.Text Documentation") - Product Docs
*   [Examples](https://github.com/groupdocs-parser/GroupDocs.Parser-for-.NET "GroupDocs.Text Github repository") - GitHub Source Code Examples
*   [Video Tutorials](https://www.youtube.com/playlist?list=PL25CTxMCj5vPQyfL8Tkz8XH7yOPhrglb7 "GroupDocs.Text for .NET tutorials") – YouTube Video Tutorials
*   [Product Support Forum](https://forum.groupdocs.com/c/parser "GroupDocs.Text for .NET Support forum") – Technical Support Forum for GroupDocs.Parser

# Have Queries?

If you have got any queries or concerns about the API, please feel free to get in touch with us over the [forum](https://forum.groupdocs.com/). We’ll be glad to address your concerns.