Oracle Text (AKA Intermedia) can index documents in tables, in the file system, and on the web. Our current package "search" only does tables. I want to use files directly, specifically PDF documents (Oracle understands over 100 formats).
The package static-pages goes part of the way. It makes static HTML pages searchable by duplicating the content in the database. It also allows adding gc comments -- that seems very specific to HTML. (I noticed search is taking the approach of conversion scripts, and there are some provided for the essential formats, including PDF.) It is not quite what I want because I'd like to make more use of Oracle's capabilities.
So I'm considering creating a new package, for use with intermedia-driver and search. Any ideas, caveats, suggestions?