Search
Follow Us
Categories
- Success Stories(157)
Archives
Extract Text from Word Documents in C#
Text extraction from Word documents is often performed in different scenarios. For example, to analyze the text, to extract particular sections of a document and combine them into a single document, and so on. In this article, you will learn how to extract text from Word documents programmatically in C#. Moreover, we will cover how to extract content between specific elements such as paragraphs, tables, etc. dynamically.
Convert CFF to PDF in Java
Three-dimensional packaging and die-cutting designs are usually stored in CFF file format. However, only specific applications can process the CFF or CF2 files when you need to open, view, or manipulate these files. In accordance with that, you may need to convert the CFF file to a PDF document. This article covers how to convert a CFF to a PDF file programmatically using Java.
Convert HTML to PDF in Java on Linux
HTML files are commonly used to display text, images, drawings, etc. over the web. In certain situations, you might need to convert HTML files to PDF files. This article covers how to convert an HTML file to a PDF document on Linux in Java.
Convert Scanned PDF to Word DOCX or DOC File in C#
Scanned PDF files contain images where text cannot be selected or edited. In certain situations, you may need to convert scanned PDF to Word document. In this article, you will learn how to convert scanned PDF to Word document in DOCX or DOC format programmatically using C#.
Convert a Word Document to EPUB in C#
The EPUB format is used for electronic publications, which are commonly known as ebooks. The EPUB files are supported by a range of smart devices such as smartphones, tablets, laptops, etc. In various cases, the documents are created in MS Word formats that do not often have built-in support on smart devices. Therefore, Word files are converted to EPUB format. In this article, you will learn how to convert Word (DOCX, DOC, etc.) files to EPUB programmatically using C#.
Convert Scanned PDF File to Text in C#
A scanned PDF file is basically one or more flat images captured by a scanner or a camera. You cannot copy, paste, or process information from such files. This article covers how to convert a scanned PDF to text in C#.
Extract Images from Word Documents in C#
Images play an important role to illustrate the key information in Word documents. Moreover, they make the document more attractive and improve its presentation. As a programmer, you may get a job to extract the images embedded within the Word DOCX or DOC documents. To achieve that, this article covers how to extract images from Word documents programmatically using C#. Moreover, you will see how to save the extracted images to the desired location.
Convert TXT Files to PDF in Python
Various people use Notepad to write down important points or create notes quickly in TXT format. Also, TXT files are used to store plain text in various applications. However, since Notepad does not provide advanced features, TXT files are often converted to PDF. In order to automate TXT to PDF conversion programmatically, this article covers how to convert TXT files to PDF format in Python.
Convert HTML Files to Word Document in Java
In various cases, you may need to convert the HTML content to a Word document. For example, for generating the document from a WYSIWYG HTML editor or converting a web page to DOCX or DOC format. To perform this conversion programmatically, this article covers how to convert HTML files to Word DOCX, DOC, DOCM, or other formats in Java.