Forum OpenACS Development: new: automatic document classification on openACS

1: new: automatic document classification on openACS

Posted by Rafael Calvo on 01/06/03 05:59 AM

I have meant to open the discussion about this for a while, but it has taken me a while.
We are working on adding automatic document management tools to openACS applications. These tools are based on machine learning algorithms that classify documents automatically.
We plan to add this as a service in openACS so any package can use it.
What is it for?
If you have a lot of "documents" you often need to classify them. You can (and probaly should) ask users, but this is sometimes not possible or is innefiecient, specially if users do not understand teh ontologies used. For example, how many of us effectively use the classification in the bboard postings?, how about giving the best approximation if the user doesn't type any? Performance can often be of over 80% precision.
One first step was given by David Bell, who added our classification system to postgreSQL. This will work in a similar way to openFTS (for information retrieval instead of classification).
If you are interested, you can have a look at David's project in Last years projects page.
or in a first draft paper or contact me

2: Re: new: automatic document classification on openACS (response to 1)

Posted by Dave Bauer on 04/17/03 01:04 PM

Is there any progress on this package? It would be very useful, particularly to openacs.org forums.

3: Re: new: automatic document classification on openACS (response to 1)

Posted by Rafael Calvo on 04/20/03 11:51 PM

Dave,

The tool is working but we haven't packaged it for OACS yet (we haven't had the need yet). It ads the functionality at the PostgreSQL level but it has no interface.