Forums / Extensions / eZ Find / Problems with indexing binary/pdf data in eZ Find

Problems with indexing binary/pdf data in eZ Find

Author Message

Jens Görisch

Friday 14 December 2007 10:02:50 am

Hi there,

i was grinding the whole day, trying to get some pdf data into the solr index of eZ Find ... inconclusively. The pdf document is transfered correcty to plaintext and finds its way into the eZPDFParser class. But if i want to search for content or even just some defined tags, i get no results.
Any hint somebody can give me? What did i forgotten and is there a way to get the whole indexdata in plaintext to check, what's in there, that should be?

best regards and thanks in advance

Jens Görisch

Kåre Køhler Høvik

Friday 21 December 2007 1:17:40 am

Hi

Can you provide an example PDF which you where not able to retrieve any search results from ? What PDF to text tool did you use ?

Kåre Høvik

Jordan Hirsch

Wednesday 26 March 2008 10:13:20 am

How are you handling binary file parsing with eZ Find? I've implemented various binary file parsers with the regular eZ publish search, but I haven't used eZ Find before. Is there a documented process for indexing binary files with it?

Me: http://jordan.teamhirsch.com
My blog: http://wiredformusic.blogspot.com
My other company: http://thinkimprov.com
eZ Certification: http://auth.ez.no/certification/verify/402488
eZ Award: http://ez.no/company/news/ez_awards_2007_prize_winners

Kåre Køhler Høvik

Wednesday 26 March 2008 1:45:21 pm

Hi

eZ Find is using the binary file handlers in eZ Publish. Set it up like you normally do, and it should work. Please report any misbehaviour in the issue tracker.

Best regards
Kåre

Kåre Høvik

Jordan Hirsch

Wednesday 26 March 2008 2:02:08 pm

Kåre,

Thank you for your response. I'm used to using the methodology from this article: http://ez.no/developer/articles/indexing_multiple_binary_file_types which involves creating a custom indexing script.

If I don't want to use that custom script and just want to use eZ Find, I just edit the binaryfile.ini override file and tell it the custom parsers I want to use, right?

Thanks again for the help.

Me: http://jordan.teamhirsch.com
My blog: http://wiredformusic.blogspot.com
My other company: http://thinkimprov.com
eZ Certification: http://auth.ez.no/certification/verify/402488
eZ Award: http://ez.no/company/news/ez_awards_2007_prize_winners

Kåre Køhler Høvik

Thursday 27 March 2008 2:55:55 am

Hi

Correct, overriding the binaryfile.ini file and run the update search index script provided by eZ Find.

Kåre Høvik

Jordan Hirsch

Thursday 27 March 2008 7:10:52 am

Great, thank you very much for your replies!

Me: http://jordan.teamhirsch.com
My blog: http://wiredformusic.blogspot.com
My other company: http://thinkimprov.com
eZ Certification: http://auth.ez.no/certification/verify/402488
eZ Award: http://ez.no/company/news/ez_awards_2007_prize_winners