Forum OpenACS Q&A: Re: Invalid Unicode character sequence found in pg index.

Depending on the OS used, you might want to prefilter using either "recode" "iconv" to silently remove those gremlins.
I'm using RHE - so iconv is available.
But I'm not sure what encoding to specify to and from.
For reading into the indexer, should I be converting the output from pdftotext to UNICODE or UTF-8? From LATIN1/ASCII. I've tried these combinations with no luck.

Any help from someone familiar with iconv and pdftotext?

Thanks

hello