To answer my own question, the solution we came up with is to use OpenOffice + PyODConverter to convert the documents. Then used ImageMagick (already using it in OpenACS so it was a perfect fit for us) to convert images and also to do the concatenation of PDFs.