Begin main content

Forum OpenACS Q&A: Re: Invalid Unicode character sequence found in pg index.

Back to OpenACS Q&A

3: Re: Invalid Unicode character sequence found in pg index. (response to 1)

Posted by Torben Brosten on 05/23/06 02:47 PM

Depending on the OS used, you might want to prefilter using either "recode" "iconv" to silently remove those gremlins.

4: Re: Re: Invalid Unicode character sequence found in pg index. (response to 3)

Posted by Ryan Gallimore on 05/24/06 02:18 AM

I'm using RHE - so iconv is available.
But I'm not sure what encoding to specify to and from.
For reading into the indexer, should I be converting the output from pdftotext to UNICODE or UTF-8? From LATIN1/ASCII. I've tried these combinations with no luck.

Any help from someone familiar with iconv and pdftotext?

Thanks

6: Re: Re: Re: Invalid Unicode character sequence found in pg index. (response to 4)

Posted by koffi akimbo on 06/04/06 05:30 PM

hello

Back to OpenACS Q&A