Forums / Developer / binary search

binary search

Author Message

Tony Wood

Thursday 18 September 2003 6:30:31 am

Hi,

I am using eZintranet from Demo and have read http://ez.no/developer/ez_publish_3/documentation/incoming/configuring_binary_file_indexing and the optimise routines and have my ezpdftotext setup and it works from the command line.

My problem is I that the search text is not being made available to the search routine.
Where is the debug information from this routine?

tia

tony

Tony Wood : twitter.com/tonywood
Vision with Technology
Experts in eZ Publish consulting & development

Power to the Editor!

Free eZ Training : http://www.VisionWT.com/training
eZ Future Podcast : http://www.VisionWT.com/eZ-Future

Paul Borgermans

Thursday 18 September 2003 9:33:12 am

Are you sure the text is not made available? Is ezpdftotext available to the webserver?

Some other things to metion along the obvious:

- There is a search bug: you need a rule for your role which says content|read|*

- Is your binary file in the class made searchable?

- run the updatesearchindex.php script from the commandline, it gives you more clues of what's happening

- check the search index tables in the database, is there anything there from your pdf files?

- is your pdf file protected in some way?

Here its working fine, so it should work for you too. Even with large documents (one OOo generated pdf with 167 pages took roughly 4 seconds to insert which includes indexing).

-paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans

Tony Wood

Thursday 18 September 2003 10:35:40 am

Thanks Paul..

- Is your binary file in the class made searchable?

- run the updatesearchindex.php script from the commandline, it gives you more clues of what's happening

did it for me.. the File class in the demo is not searchable so it did not pick it up.. I ran updatesearch and it worked fine :)

Thanks

Tony Wood : twitter.com/tonywood
Vision with Technology
Experts in eZ Publish consulting & development

Power to the Editor!

Free eZ Training : http://www.VisionWT.com/training
eZ Future Podcast : http://www.VisionWT.com/eZ-Future

Paul Borgermans

Thursday 18 September 2003 11:21:06 am

Tony,

Glad it works for you too now.

I'm still looking for something that can output text from powerpoint files (under linux).

On the other hand there is more pressure in the OOo forums to have command-line conversions .... wanna do a disappear trick for .doc files into something more persistent (but the wv filter does a good job already).

-paul

eZ Publish, eZ Find, Solr expert consulting and training
http://twitter.com/paulborgermans