Forum OpenACS Development: Converting Powerpoint slides to text

Collapse
Posted by Dirk Gomez on
For search I need to convert powerpoint slides to text. Firstly I tried catppt but it infinitely looped and eventually made .LRN come to a halt.

Right now i use strings but it doesn't do a particularly good job.

What's a good *free* tool?

Collapse
Posted by Nima Mazloumi on
Hi Dirk,

I highly recommend xlhtml project. It contains

- Excel 2 HTML converter xlhtml
- Powerpoint to HTML converter ppthtml

The result is returned to standard out. If you pipe that through a proc that strips out the HTML tags you have a clean and fast solution:

Sources:
http://chicago.sourceforge.net/xlhtml/

Reference:
http://www.linuxdevcenter.com/pub/a/linux/2005/05/26/textonly.html

Collapse
Posted by Gary Roesler on
Nima is right xlhtml is what i used for my search procs... i just did a little tweaking to the source before compiling it to improve performance and error problems
Collapse
Posted by Dirk Gomez on
Gary, on which platform did you deploy? I am having problems compiling on Solaris and the xlhtml mailing list suggests I am not the first one - did you happen to deploy on Solaris?
Collapse
Posted by Gary Roesler on
Sorry Dirk,
My system runs on Mandrake
Collapse
Posted by Tournier Cyril on
Hi!

I met some problems to install XHTML..
I tried the "./configure" and then the "make && make install" but unforunately the command is not recognize..
Do you know what I can do???

Many Thanks
Cyril