How to Work with Core Document Management in Python

How to Work with Core Document Management in Python

How to Work with Core Document Management

This guide covers core document management using Aspose.Words FOSS for Python. It demonstrates document format conversion: reading Word files and writing them as PDF, Markdown, or plain text.

Prerequisites

Install the library:

pip install aspose-words-foss>=26.4.0

Requires Python 3.10 or later.

Load a Document

Create a Document object by passing a file path. Supported input formats include DOCX, DOC, RTF, TXT, and Markdown.

import aspose.words_foss as aw

doc = aw.Document("input.docx")

Convert to PDF

Call save() with SaveFormat.PDF:

import aspose.words_foss as aw

doc = aw.Document("input.docx")
doc.save("output.pdf", aw.SaveFormat.PDF)

Convert to Markdown

Supported input formats for Markdown conversion include DOCX, DOC, RTF, TXT, and Markdown files.

Pass SaveFormat.MARKDOWN to save output:

import aspose.words_foss as aw

doc = aw.Document("input.docx")
doc.save("output.md", aw.SaveFormat.MARKDOWN)

Extract Text

Use Document.get_text() to extract all text content:

import aspose.words_foss as aw

doc = aw.Document("input.docx")
text = doc.get_text()

Summary

TaskMethod
Load a documentDocument("path")
Export to PDFd.save("output.pdf", SaveFormat.PDF)
Export to Markdownd.save("output.md", SaveFormat.MARKDOWN)
Export to plain textd.save("output.txt", SaveFormat.TEXT)
Extract textd.get_text()

See Also