Tags:
create new tag
view all tags

Question

On Twiki sites indexed by search engines, debug.txt and log files grow huge .

I installed twiki on a site indexed locally by altavista... every hour! consequently, the logs files are huge, and the debug.txt filed filled the disk (it weighted 256M).

The main culprit is Diffs that outputs all the diffs in the debug file!

-- ColasNahaboo - 04 Dec 2001

The real problem is that the line in the template

    <meta name="robots" content="noindex" />
Is incorrect. if you look at the standard: http://www.w3.org/Search/9605-Indexing-Workshop/ReportOutcomes/Spidering.txt it says:

So a value of "NOINDEX" allows the subsidiary links to be explored, even though the page is not indexed.

So the real solution would be to change most lines to NONE execpt the pages we want to be indexed (view.tmpl only I think)

Answer

My solution was to run each night a small shell script via cron truncaturating debug.txt and keeping only the last 2 log files. I append it here, and I suggest this problem to be at least documented in the install process

#!/bin/ksh
# local wiki data dir
cd /net/servers/www-sop/files/wiki/sophia/data
# truncate debug.txt
>debug.txt; chmod a+rw debug.txt
# dumb but portable way to keep only last 2 log files
ls -1r log* | (read;read; while read file; do rm -f $file; done)
# keep browsers happy
echo 'Contents-type: text/us-ascii';echo; exit 0

.

Edit | Attach | Watch | Print version | History: r2 < r1 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r2 - 2001-12-05 - ColasNahaboo
 
  • Learn about TWiki  
  • Download TWiki
This site is powered by the TWiki collaboration platform Powered by Perl Hosted by OICcam.com Ideas, requests, problems regarding TWiki? Send feedback. Ask community in the support forum.
Copyright © 1999-2026 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.