Metadata Cleaner for Documents and Images using Java
Metadata is the data that provides information about the actual data. It is commonly described as “data about data”. When sending a file to someone, it is not a good practice to send metadata along. It can reveal your information to the receiver that you may not want to share. Some of the examples include; Name, Company Name, Document Modification Date, Make and Model of Camera, etc. In this article, we will be programmatically removing metadata from the images and documents using Java.
Read PDF Form Fields in Java
In this article, we will discuss how to parse PDF document and extract values from PDF forms programmatically in Java. There are many situations, where we have several filled survey forms or feedbacks in PDF format from a large audience. We can easily extract the filled data values and use them for analysis. Let us now move straight towards reading these PDF forms and extract filled data field values within Java applications.
Get 25% OFF on GroupDocs.Total
GroupDocs offers you 25% OFF on [GroupDocs.Total][1] - all GroupDocs APIs you have ever wanted. Do not forget to use the HOLOFF2020 coupon code while ordering to avail this discount offer.
Find and Remove Watermarks from Documents in Java
This article is useful for Java developers who are looking for a way to find and remove text or image watermarks from PDF, Word, Excel, PowerPoint, and Visio documents. In one of our posts we have learnt [finding and removing watermarks from documents in C#][1]. Let us now jump to quickly see a little about a Java API that allows adding, finding and removing watermarks from various documents in different ways.
Find and Remove Watermarks from Documents in C#
Today, we will have a look at how to find and remove watermarks from documents in C#. There can be text and image-based watermarks in a document. We can easily search and programmatically remove such watermarks from many PDF, Word, Excel, PowerPoint, and Visio supported documents.
Convert CAD Drawings to PDF in C#
Today, we will learn how to programmatically convert the CAD drawings to PDF format in C#. Previously, in an [earlier post][1], we did the same but in Java. We looked to convert the DWG, DGN, and DWF files into PDF document with the code example. Let us do it in C# using the document conversion API for .NET.
Extract Images from Documents using C#
In this article, we will be learning to programmatically extract images from PDF, Excel, PowerPoint, and Word documents in a C# application using document parsing .NET API. [GroupDocs.Parser for .NET][1] is document parsing and data extraction .NET API. It supports document parsing and extraction of images, text, and metadata from word-processing documents, spreadsheets, presentations, archives, and email documents.
Extract Images from Documents using Java
Today, we will learn to programmatically extract images from PDF, Excel, PowerPoint, and Word documents using Java. For the extraction of images, we will use [GroupDocs.Parser for Java][1]. This Java API supports the parsing of documents and extraction of images, text, and metadata from word-processing documents, spreadsheets, presentations, archives, and email documents. Extracted images can be saved in BMP, GIF, JPEG, PNG, and WebP formats.
Insert OLE Objects in Word, Excel, PowerPoint using Java
Today, we will be learning to embed PDF and other different documents as OLE objects in Word, Excel, PowerPoint files using Java. For embedding the documents via Object Linking and Embedding, we will be using the GroupDocs.Merger for Java API that also allows us to efficiently combine/merge and split multiple documents with minimum lines of Java code.
Verify Digital Signature in Documents using Java
In this article, we will be learning to programmatically verify the digitally signed documents using Java. The example uses PDF document for the verification, however, you may also perform verification of digitally signed word processing documents like MS Word DOC/DOCX, Excel spreadsheets XLS/XLSX, and presentations PPT/PPTX.