Tag Archives: Extract Text from PDF

Extract Text from PDF using Java

In this post, you will learn how to extract text from PDF documents seamlessly using Java. Text extraction could be useful in various scenarios such as text analysis, information retrieval, document parsing, and so on. Since PDF is one of the most widely used digital documents, the use cases of text extraction from PDF documents are more in number. So let's begin and check how to perform PDF text extraction from within Java applications.

Posted in Aspose.PDF Product Family | Tagged , ,

Search and Extract Text from PDF Programmatically using C#

Extract text from PDF documents. You can extract from a specific page or region, or whole PDF file. Read text using .NET C# language.
Posted in Aspose.PDF Product Family | Tagged , , , , , ,

Extract Text by Paragraphs and Convert Files to PDF with Aspose.PDF

We at Aspose are very excited announcing new version of Aspose.PDF API. Aspose.PDF has been released for .NET and Java platforms with new 18.1 version. In this version of the API, we have offered exciting features including extracting paragraphs from PDF documents. Furthermore, we have improved usability and performance of the API. You can download Aspose.PDF API for .NET and Java platforms, from the links below:

Before downloading latest version of the … Continue Reading

Posted in Aspose.PDF Product Family | Tagged , , , , ,

Extract Text from PDF Including Images in C# using Aspose.PDF and Aspose.OCR

Extract text and images from PDF using C# .NET

Extracting text from a PDF file is a common requirement of the developers working with PDF files. Aspose.PDF for .NET already allows you to extract text from the PDF file using C# in your .NET applications. The only limitation was that the text couldn’t be extracted from the images contained by the PDF file.

Well, with the introduction of Aspose.OCR for .NET in our .NET product suite, we have moved towards removing this limitation. In this post, I’m going to … Continue Reading

Posted in Aspose.PDF Product Family, Aspose.OCR Product Family | Tagged , , , , , , , , , , , ,