How to Work with Core Document Management in Python
How to Work with Core Document Management
This guide covers core document management using Aspose.Words FOSS for Python. It demonstrates document format conversion: reading Word files and writing them as PDF, Markdown, or plain text.
Prerequisites
Install the library:
pip install aspose-words-foss>=26.4.0Requires Python 3.10 or later.
Load a Document
Create a Document object by passing a file path. Supported input formats include DOCX, DOC, RTF, TXT, and Markdown.
import aspose.words_foss as aw
doc = aw.Document("input.docx")Convert to PDF
Call save() with SaveFormat.PDF:
import aspose.words_foss as aw
doc = aw.Document("input.docx")
doc.save("output.pdf", aw.SaveFormat.PDF)Convert to Markdown
Supported input formats for Markdown conversion include DOCX, DOC, RTF, TXT, and Markdown files.
Pass SaveFormat.MARKDOWN to save output:
import aspose.words_foss as aw
doc = aw.Document("input.docx")
doc.save("output.md", aw.SaveFormat.MARKDOWN)Extract Text
Use Document.get_text() to extract all text content:
import aspose.words_foss as aw
doc = aw.Document("input.docx")
text = doc.get_text()Summary
| Task | Method |
|---|---|
| Load a document | Document("path") |
| Export to PDF | d.save("output.pdf", SaveFormat.PDF) |
| Export to Markdown | d.save("output.md", SaveFormat.MARKDOWN) |
| Export to plain text | d.save("output.txt", SaveFormat.TEXT) |
| Extract text | d.get_text() |