“Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library.” Man, this thing gives me a tingly feeling. Not sure I want to suffer the 70-MB Haskell installation, though,
“Pandoc is a Haskell library for converting from one markup format to another, and a command-line tool that uses this library.” Man, this thing gives me a tingly feeling. Not sure I want to suffer the 70-MB Haskell installation, though,
PDFMiner is a suite of programs that aims to help analyzing text data from PDF documents. It includes a PDF parser, a PDF renderer (though only rendering text is supported for now), and a couple of nice tools to extract texts.
The Apache XML Graphics Project is responsible for software licensed to... the creation & maintenance of: * the conversion of XML formats to graphical output * related software components...
Recently I was implementing PDF generation for a project utilizing the fantastic library Prince XML. I came across a blog article with a basic library and helper set for Prince