Tuesday 07 December 2010 1:31:21 am
Hello everybody, I use the following code for indexing my pdf : http://share.ez.no/learn/ez-publish/indexing-multiple-binary-file-types/%28page%29/3 This script use the xpdf library http://www.foolabs.com/xpdf/download.html The problem is when I use the following command line : php updatesearchindexsolr.php -s <admin siteacces> the pdf are indexed but the special chars disappear and are replaced by a white space. But if I do the same thing with the command line interface : pdftotext example.pdf example.txt It works. I do not manage to identify why it doesn't work... Thanks in advance.
Romain Bremaud
Les clefs du net
|