Extract Text from Word Documents in C#

Text extraction from Word documents is often performed in different scenarios. For example, to analyze the text, to extract particular sections of a document and combine them into a single document, and so on. In this article, you will learn how to extract text from Word documents programmatically in C#. Moreover, we will cover how to extract content between specific elements such as paragraphs, tables, etc. dynamically.

Posted in Aspose.Words Product Family | Tagged , , ,

Convert CFF to PDF in Java

Three-dimensional packaging and die-cutting designs are usually stored in CFF file format. However, only specific applications can process the CFF or CF2 files when you need to open, view, or manipulate these files. In accordance with that, you may need to convert the CFF file to a PDF document. This article covers how to convert a CFF to a PDF file programmatically using Java.

Posted in Aspose.CAD Product Family | Tagged , ,

Convert HTML to PDF in Java on Linux

HTML files are commonly used to display text, images, drawings, etc. over the web. In certain situations, you might need to convert HTML files to PDF files. This article covers how to convert an HTML file to a PDF document on Linux in Java.

Posted in Aspose.HTML Product Family | Tagged , , ,

Convert PPT to JPG in Python

While working with PowerPoint presentations in your Python applications, you may need to generate thumbnails for a PPT. For example, while creating a slideshow or a presentation manipulation tool. In this article, you will learn how to generate thumbnails by converting a PPT to JPG images in Python. We will also cover how to generate JPG images having custom dimensions and render slide notes and comments.

Posted in Aspose.Slides Product Family | Tagged , , , ,

Convert Scanned PDF to Word DOCX or DOC File in C#

Scanned PDF files contain images where text cannot be selected or edited. In certain situations, you may need to convert scanned PDF to Word document. In this article, you will learn how to convert scanned PDF to Word document in DOCX or DOC format programmatically using C#.

Posted in Aspose.OCR Product Family | Tagged , , , ,

Convert a Word Document to EPUB in C#

The EPUB format is used for electronic publications, which are commonly known as ebooks. The EPUB files are supported by a range of smart devices such as smartphones, tablets, laptops, etc. In various cases, the documents are created in MS Word formats that do not often have built-in support on smart devices. Therefore, Word files are converted to EPUB format. In this article, you will learn how to convert Word (DOCX, DOC, etc.) files to EPUB programmatically using C#.

Posted in Aspose.Words Product Family | Tagged , , ,

Convert Scanned PDF File to Text in C#

A scanned PDF file is basically one or more flat images captured by a scanner or a camera. You cannot copy, paste, or process information from such files. This article covers how to convert a scanned PDF to text in C#.

Posted in Aspose.OCR Product Family | Tagged , , ,

Extract Images from Word Documents in C#

Images play an important role to illustrate the key information in Word documents. Moreover, they make the document more attractive and improve its presentation. As a programmer, you may get a job to extract the images embedded within the Word DOCX or DOC documents. To achieve that, this article covers how to extract images from Word documents programmatically using C#. Moreover, you will see how to save the extracted images to the desired location.

Posted in Aspose.Words Product Family | Tagged , , ,

Convert TXT Files to PDF in Python

Various people use Notepad to write down important points or create notes quickly in TXT format. Also, TXT files are used to store plain text in various applications. However, since Notepad does not provide advanced features, TXT files are often converted to PDF. In order to automate TXT to PDF conversion programmatically, this article covers how to convert TXT files to PDF format in Python.

Posted in Aspose.Words Product Family | Tagged , ,

Convert HTML Files to Word Document in Java

In various cases, you may need to convert the HTML content to a Word document. For example, for generating the document from a WYSIWYG HTML editor or converting a web page to DOCX or DOC format. To perform this conversion programmatically, this article covers how to convert HTML files to Word DOCX, DOC, DOCM, or other formats in Java.

Posted in Aspose.Words Product Family | Tagged , , , ,