I wrote soem code (its in cvs /packages/xcms-ui/tcl/mime-procs.tcl) to convert Word to HTML. It runs through wvWare then Tidy.
It seems to be pretty effective. It was used to covnert a few thousand Word documents to be inserted into the content repository.
I want to finish this feature to allow defintion of filters for more types of conversion.