Tags:
create new tag
view all tags

Question

Plucene was successfully indexing attachments -- .txt, .pdf, .doc, .ppt, .xls, .rtf -- and returning results. Then, one morning, with no changes to the installation that we could identify, we stopped getting search results. We have tried plucupdate and plucindex. We have deleted the entire index and reindexed. The logs look good. It appears that indexing is proceeding normally. We simply never find any of the indexed files.

Environment

TWiki version: Dakar
TWiki plugins: SpreadSheetPlugin, CalendarPlugin, CommentPlugin, EditTablePlugin, ImageGalleryPlugin, InterwikiPlugin, PreferencesPlugin, RedDotPlugin, RenderListPlugin, SlideShowPlugin, SmiliesPlugin, TablePlugin, WysiwygPlugin
Server OS: Red Hat Enterprise Linux ES release 4 (Nahant Update 2), kernel 2.6.9-22.0.2.ELsmp
Web server: Apache 2.0.52
Perl version: 5.8.5
Client OS: Ubuntu Linux
Web Browser: Firefox
Categories: Search

-- DavidHoughton - 01 May 2006

Answer

ALERT! If you answer a question - or someone answered one of your questions - please remember to edit the page and set the status to answered. The status selector is below the edit box.

I had once the issue that no files were found after plucupdate. After running plucindex it was OK. This was fixed after installing the latest SearchEnginePluceneAddOn version v2.100.

-- PeterThoeny - 07 May 2006

Is there any apache configuration needed to enable plucsearch? I'm getting no search results eventhough my index is generated perfectly. However the Apache error log says this:

Don't know how to turn into an index reader at /home/twiki/bin/plucsearch line 209, referer: http://localhost.localdomain/twiki/bin/view/TWiki/PluceneSearch

-- AlokNarula - 12 May 2006

David, is your environment running the latest SearchEnginePluceneAddOn?

-- JoanMVigo - 07 Jun 2006

Extracted from Plugins/SearchEnginePluceneAddOnDev :

  • line 209 of plucsearch tries to open the index folder. If an error occurs, it's very likely your setup is not correct. Verify the PLUCENEINDEXPATH variable in TWikiPreferences ( it should point to /your_twiki_path/plucene/index )

....

I have just deployed the Plugin in a new fresh installation - latest TWiki available build 9626. Indexing and searching both work fine.

Does your environment involve user authentication? If you have topic authentication enabled, then the script plucsearch by default may be executed as user nobody/TWikiGuest. Please, check this! Also, consider that only allowed topics for the authenticated user may be displayed as results.

If you have user authentication enabled, you should add the following lines to /twiki/bin/.htaccess if using Apache login module

<Files "plucsearch">
       require valid-user
</Files>
Otherwise, if using Template login module, launch /twiki/bin/configure script in your web browser and append plucsearch to {AuthScripts}

-- JoanMVigo - 08 Jun 2006

Extracted from Plugins/SearchEnginePluceneAddOnDev

I have checked it and finally the problem is that:

  1. plucsearch script gets the user from the SESSION object exposed by the TWiki fuync. module: my $remoteUser = $TWiki::Plugins::SESSION->{remoteUser}; and ...
  2. when using /twiki/bin/.htaccess configured to authenticate the plucsearch as described above (see my comments 08 Jun 2006), remoteUser is the one you typed, so the results are displayed ok, even with restricted webs, however ...
  3. when not using /twiki/bin/.htaccess, remoteUser is always the user guest even if you are authenticated using TemplateLogin and plucsearch appears in {AuthScripts}, so any restricted web's results are never listed.

I have tested some setup possibilities, and it seems that just editing the plucsearch script and changing line 58, replacing the old one my $remoteUser = $TWiki::Plugins::SESSION->{remoteUser}; with this new one my $remoteUser = $TWiki::Plugins::SESSION->{user}->{login}; will solve this problem, and the plucsearch script will always work, regardless which auth setup you have chosen.

Once again, I am sorry for the delayed reply.

-- JoanMVigo - 21 Jun 2006

Thank you! That got it for me.

-- GordonTerrell - 21 Jun 2006

In my case, the index runs succesfully (topics & attachments), however plucene doesn't seem to find ANY results within attachments...only in topics. Is there any way in which I can debug the application? Thanks a lot.

-- MiloValenzuela - 17 Nov 2006

For my search , the indexing works fine (both topics and attachments). But I only get topics when I run a search. Did anyone find a solution. I think there could be an error in plucsearch. The code doesnt seem to get -> name of attachments

-- RyanLi - 08 Dec 2006

Ryan, did you hit any issue when you installed the extension? Did it work fine since the beginning? I'm out of ideas to troubleshoot it.

-- MiloValenzuela - 08 Dec 2006

Are you using the latest twiki installation? Im going to try reinstalling twiki. Its a mess anyway. But i have a feeling there is something wrong either with plucindex code or the plucsearch code. It indexes the attachments but the attachments arent shown. I havent even gone into trying to search within attachments yet

-- RyanLi - 09 Dec 2006

No I'm not using the latest because it is a VM Debian Stabe install and it currently supports only v 4.4 In my case it indexes perfectly (no error in log) but it doesnt search within the docs. I think theres another simpler extension from CPAN that searches docs...I may try that one (I think the name is SWISH or something like that)

-- MiloValenzuela - 09 Dec 2006

Edit | Attach | Watch | Print version | History: r13 < r12 < r11 < r10 < r9 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r13 - 2006-12-09 - MiloValenzuela
 
  • Learn about TWiki  
  • Download TWiki
This site is powered by the TWiki collaboration platform Powered by Perl Hosted by OICcam.com Ideas, requests, problems regarding TWiki? Send feedback. Ask community in the support forum.
Copyright © 1999-2026 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.