Generate Search Results from Filtered Documents

Sometimes, we have a lot of different formatted documents. We create index then perform search and API shows hits from all the indexed documents. Ever thought to get search results from desired documents only?

Let’s dig it further. Suppose, you have multiple TXT, XLSX and DOCX documents. What if you want to search a word or text only in TXT and DOCX files with specific word occurrence in file names? We’ve now improved ISearchDocumentFilter interface for such a purpose in GroupDocs.Search. This interface represents search document filter and it uses SearchDocumentFilter class for creation of a filter instances.

Let’s now understand this with a use-case.
C#

Java

What we did in this example? We initiated a request to search occurrence of a word hobbit in all the TXT and DOCX files with word task in their file names. In this snippet, you can see a method CreateConjuction. It creates logical conjunction (logical and) of the specified filters.

Settings for Log Functionality

This improvement allows to set up log file name and maximum log file size. Below are the public API changes.
Following class has been added to GroupDocs.Search namespace and com.groupdocs.search package:

  • LogSettings
Following properties are added to GroupDocs.Search.LogSettings and com.groupdocs.search.LogSettings class: C#
  • string FileName
  • double MaxSize
Java
  • String getFileName()
  • void setFileName(String value)
  • double getMaxSize()
  • void setMaxSize(double value)
Below properties are added to GroupDocs.Search.Index and com.groupdocs.search.Index classes respectively:
  • LogSettings LogSettings
  • LogSettings getLogSettings()


Let’s go through the implementation:
C#


Java

Add English Synonyms

This improvement adds English synonyms to default synonym dictionary.
C#


Java

Above are the major improvements introduced in version 19.5. Please go through all the other changes in release notes:

Avail these exciting improvements now in your project. This release is available for download. Please share your feedback or concerns here.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Search Product Family | Tagged | Leave a comment

Save Assembled Word Processing, Presentation, Spreadsheet and Email Documents as HTML File with Resources

The HTML format is widely used for creating the webpages or, in other words, the HTML files. These days, every operating system, whether it be of desktop or mobile, contains the built-in web browser that supports viewing the HTML files. Other than that, many free web browsers are also available in the market. This means that any information or the content that is available in the form of HTML pages can easily be viewed just having installed a web browser.

Let’s suppose a scenario where my friend shared with me a Word document created in MS Word but the problem is I don’t have MS Office installed on my system. So how would I view the content of the file? A suitable solution is if I could get the HTML form of that Word document then it can be viewed in my web browser and thus, I won’t have to install MS Office.

Now, you could imagine the usefulness of the HTML format apart from its usage in web development. So, let’s now find out how did we make use of HTML format in making GroupDocs.Assembly more powerful and useful for you.

Since version 19.5 of GroupDocs.Assembly, the assembled Word Processing documents, Spreadsheets, Presentations and Email files could be saved as HTML with resources. This means that the generated reports can now be saved as HTML files along with the resources such as images and, as I have mentioned before, you would only need to have a web browser to view the content of the generated reports.

Let’s take the example of generating the report from the Word template and see how we can save the report as an HTML file. For saving the generated report as HTML with resources, we have added a new property ResourceSaveFolder to the LoadSaveOptions class to specify the folder that will be used to store the resource files. If no folder is specified, by default, external resource files will be stored to a folder having the same name as that of the HTML file (without extension) plus the “_files” suffix. The following code snippets demonstrate this feature:

C#


Java


Along with this feature, we have also added the facility of loading HTML template documents referencing external resource files to be assembled and saved to non-HTML formats. For this, the containing folder of the template file is used as a base URI to resolve external resource files’ relative URIs to absolute ones by default, as shown in the following code snippet:

C#


Java


Well, this would definitely be a really attracting release for you guys. So let’s not wait and download the latest version to check how these features can make your report generation applications more powerful.

For any questions or suggestions, have a conversation with us on our forum.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Assembly Product Family | Tagged , , , , , , , | Leave a comment

Cache Page Previews in GroupDocs.Annotation for Java

EnableCaching feature stores image representation of a document in a directory at the local drive. In case of repetitive processing/opening of a document, it uses cached data. This thing helps to avoid the processing of the same document again and again. You might be waiting for this feature. The wait is over now. API permits you to enable cache mode and cache page previews. Let’s take a overview of its code.

We also added possibility to display distance annotation caption for following file formats:

  • Slides
  • Cells
  • Diagrams

Following older formats for Slides, Cells and Words are now supported:

  • .ppt
  • .xls
  • .doc

Along with new features, this release also covers some improvements and bug fixes. As an improvement we refactored PdfToPngSaver and AnnotationHandler classes.

Moreover, we added additional parameters to ImageOptions for more flexible retrieval of pages.

  • Parameter for getting pages without annotation
  • Retrieve the range of pages
  • Retrieve only specific count of pages

Given below are some of the major bug fixes.

  • Line width not set in Slides
  • Issue when opening protected with password documents
  • Exception occurred while remove annotations from .xls file

Download our open-source example project here and explore API documentation here.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Annotation Product Family | Tagged | Leave a comment

Implement Conversion to WMF Format

WMF is a Windows MetaFile that can contain both vector and bitmap image information. We added support of WMF file format along-with few others in GroupDocs.Conversion for .NET 19.5. You can now convert any supported file format to WMF. Along-with WMF, convert supported documents to EMF.

Convert to EMF

Convert to WMF

Moreover, you can now pay royalties for the use of the API. Yes, metered licensing is now enabled.

New file formats you can convert from:

  • DIB
  • XLT
  • POT

In this release, we also removed obsolete property JpegQuality from ImageSaveOptions.

We’d recommend you to download latest version of the API and share your feedback.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Conversion Product Family | Tagged | Leave a comment

Footnote Comparison Issue is Resolved

Footnote appears at bottom of the page and used to cite information. Previously, API was not comparing Footnotes correctly. But with the release of GroupDocs.Comparison for .NET 19.5, we fixed this issue. Please have a glance.

But this is not it we have a lot more to tell you. Along with bug fixes, some features and improvements are also introduced.

Lets talk about new features first. Did you ever think of getting coordinates of document changes or differences? It could be confusing at first but let me elaborate this. In your output or resultant document, you get every detail of inserted, deleted or style changed items. What else you can get now is the coordinate details where changes or differences actually occurred. Currently this feature is supported for only PDF, Slide and Diagram formats.

To use this feature you should specify CalculateComponentCoordinates property in ComparisonSettings.

Let’s see a complete use-case.

Coming to the second feature, its about DocumentInfo. This class contains following properties:

  • NumberOfPages (read only) – count of document pages
  • PagesData (read only) – list of PageInfo classes

PageInfo class contains following properties:

  • Width – the width of page
  • Height – the height of page

Following are the improvements:

  • Change detection in tables for Word documents
  • Error handling improvements for all the formats

Major Bug Fixes

  • Can’t get images for HTML files
  • Incorrect difference info in PDF
  • Font detection exception
  • Word separation exception

We’d recommend you to download latest release of the API and enhance your document comparison experience.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Comparison Product Family | Tagged | Leave a comment

Rendering Documents as PDF

The Portable Document Format (PDF) is a file format to present documents, including text formatting and images, in a manner independent of the application software, hardware, and operating systems. Based on the PostScript language, each PDF file encapsulates a complete description of a fixed-layout flat document, including the text, fonts, vector graphics, raster images and other information needed to display it.

This post demonstrates the usage of the GroupDocs.Viewer APIs (GroupDocs.Viewer for .NET and GroupDocs.Viewer for Java) to render the supported file formats as PDF documents.

How to Render Using GroupDocs.Viewer

Both the viewer handlers (ViewerHtmlHandler,ViewerImageHandler) provide an overloaded method GetPdfFile() which accepts the document name as parameter and returns FileContainer. The File stream can be fetched from the Stream property of the FileContainer in order to save the resultant document on the disk.

The following code snippet shows the way to render a word document as PDF:

Java lovers can write the code like following:

Add Print Action while Rendering as PDF

A Print Action is an option to appear the print popup when open the PDF document in Adobe reader. The popup will appear like below screenshot:

The below given steps can be followed, in order to add print action in the resultant PDF document.

  • Create/initialize PdfFileOptions object
  • Set Transformations property of PdfFileOptions to Transformation.AddPrintAction
  • Call htmlHandler.GetPdfFile method

The .NET code snippet will look like:

The Java code can be written as:

The complete ready to run code sample is available on GitHub.

  • The .NET guys can visit here
  • The code sample for Java lovers is available here

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Viewer Product Family | Leave a comment

GroupDocs Pricing Review 2019

In this post, we share how we conducted our 2019 GroupDocs Pricing Review.

Last month we shared the news about GroupDocs 2019 pricing change. Our aim behind that change was simple: to make sure we are delivering the best value Document Management API suite on the market.

Each year we review our pricing and licensing model to make sure we’re continually giving developers the functionality and flexibility they need. Providing top quality APIs at fair prices along with permissive licensing models.

How we conducted the review

For this review we gathered a list Vendors in the “.NET Document Viewer” space, ensured they had approximately comparable functionality to GroupDocs.Viewer for .NET and Java and measured the value that developers can get from each API vendor on our list.

Many vendors offer only a small subset of the vast feature-set offered inside GroupDocs.Total. So when performing our review we couldn’t measure a true “like-for-like” comparison. As such, for this review we focused on a single product and looked at 9 other vendors that offered a .NET document viewer.

Pricing Comparison

Purely from a pricing perspective, comparing GroupDocs’ Developer OEM license type at $2997 USD, we found that GroupDocs was the 3rd lowest priced solution on the market.


Vendor Cost for Equivalent OEM style license ($ USD)
Vendor A $35940
Vendor B $7400
Vendor C$5499
Vendor D$5000
Vendor E$4995
Vendor F$4719
Vendor G$2999
GroupDocs$2997
Vendor H *$1995
Vendor I **$1499

* This vendors Viewer only supports viewing PDF files
** This vendors Viewer is only supported on Silverlight and WPF

As well as being the most competitively priced API, our pricing starts at only $999, so the review doesn’t take our introductory, Developer Small Business license into account.

What Does A GroupDocs License Get You?

From a licensing perspective, our Developer OEM license offers:

  • Category leading functionality
  • 12 month subscription included – On average, products get updates each month, delivering bug fixes and new features.
  • Free Support – Dedicated support teams, also available during free trials. Paid Support options are available.
  • Deployment to unlimited servers – No per server licensing.
  • No extra cost for deployment to staging, build or backup servers.
  • Simple to understand, per developer licensing
  • Ability to upgrade single product purchases to GroupDocs.Total.

Conclusion

GroupDocs offers incredible value for money, market leading functionality, great licensing terms, free support and future proofing through free monthly updates.

If you’re looking into Document Manipulation API’s and you’re not evaluating GroupDocs, get in touch with our Sales Team or Support Team and we’ll be happy to discuss your needs.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in Business, News Release | Tagged , | Leave a comment

GroupDocs.Viewer for .NET now supports 95+ file formats!

Usually, when we have to shop for groceries, we choose the supermarket where we can get almost every item that we need. This way we don’t have to visit multiple stores for different types of products. Similarly, being specific to the software products, the developers prefer the component that provides a wide range of features regarding their use case so that they don’t have to use multiple solutions.

Let’s be more specific and talk about the online document viewers that are becoming more popular in modern days because of their ability to display the files within the applications without any need of dedicated software programs. Most of the time, the developers want to use an all-in-one document viewer component that can display a wide range of file formats. Thus, they can avoid using multiple components to provide support for multiple file formats.

What if I tell you that it’s now possible to have a document viewer that supports 95+ file formats?

Yes, you read it absolutely correct. In the latest release of GroupDocs.Viewer for .NET, we have added the support of 4 new file formats that are listed below.

  • Device Independent Bitmap (.dib)
  • Microsoft PowerPoint Template (.pot)
  • Electronic Business Card (.vcard)
  • Bzip2 Compressed File (.bz2)

With this addition, the API now supports 95+ file formats and we have the eye on completing our century soon.

So, let’s not wait and use this powerful API for integrating an all-in-one document viewer component in your application. For more news and updates, stay tuned to our blog and keep posting to our forum.

Share on FacebookTweet about this on TwitterShare on LinkedIn
Posted in GroupDocs.Viewer Product Family | Tagged , , , , , , , , | Leave a comment