aboutsummaryrefslogtreecommitdiffstats
path: root/textproc/py-pdf2docx/pkg-descr
blob: b0d2d9e319a850d486127c7aca00292150a1106b (plain) (blame)
1
2
3
4
5
6
pdf2docx is a Python program and library to convert pdf to docx.

pdf2docx allows to:
* Extract data from PDF with PyMuPDF, e.g. text, images and drawings
* Parse layout with rule, e.g. sections, paragraphs, images and tables
* Generate docx with python-docx