Dan's already working on workflow - let's not duplicate efforts.
Can you package up the query extractor by itself so we don't need to download things like pyXML in order to run the basic extractor? Or am I reading too much into your post?
I'm going to e-mail you some comments on the query extractor, overall it looks like a great start...