In this article, we will learn how to read and parse PDF documents and then programmatically extract PDF form field values in C#. Earlier, we have seen how to extract values from PDF forms in Java. After reading these articles, if you have filled feedback forms, you can extract the values within your .NET & Java applications for analysis or save them in the database.
.NET API to Parse and Extract Values from PDF Forms
GroupDocs.Parser for .NET is an easy to use, and powerful parsing and data extraction API for the .NET applications. It supports text, metadata, and image extraction from word-processing and PDF documents, spreadsheets, presentations, emails, markups, ebooks, archives, and much more. One of the significant features and will also be shown below is the parsing of fillable PDF forms to extract the form field values using a small piece of C# code.
To test the below-mentioned and other examples of the API, you may download and install the API from NuGet or directly download from GroupDocs downloads.
PM> Install-Package GroupDocs.Parser
Extract Data from PDF Form Field using C#
The following simple steps tell how to parse PDF and then extract PDF form field values in C#.
- Load the PDF file using Parser class.
- Parse the PDF form using ParseForm method.
- Traverse the parsed collection to extract the form field values.
The following C# code example shows the extraction of field values of filled PDF forms within .NET applications.
COMPANY: GroupDocs
EMAIL: everything@groupdocs.com
COUNTRY: Australia
Conclusion
I am confident, that you will now feel comfortable in developing your own .NET based application that can parse PDF files and fetch values from fillable PDF form fields quickly and precisely. To add more features, you can learn more about the API from the documentation articles and C# examples on GitHub.
For queries and quick response, be in contact on the forum.