Hi Rafael,
<blockquote> we have developed an extension to Postgres that does
*automatic* classification
</blockquote>
We are working on a similar module, but trying to classify documents according to the projects (or: work groups) in which they are created in a company. This way we already get training sets for free (the documents in a projects filestorage).
However, we are going for multiple classifications (a vector of probablilities for each project) instead of a single "best fit" category.
We have looked around a bit, but we haven't found your code anywhere. Is it still available and/or GPL?
I think we should really try to kick the asses of these Autonomy guys ... 😊
Frank