translate.pl and add sleep statements to simulate user inactivity
server-key.pem if you don't need SSL and don't want to install these modules)
mkdir wsp ; cd wsp wget -O - http://www.research.att.com/~hpk/wsp/wspv2.tgz | tar xz # (edit wsp.pl to have proper path to perl binary) wget -O - http://twiki.org/p/pub/Codev/WebScrapingProxy/translate.pl.txt >translate.pl chmod +x translate.pl
./wsp.pl -v | ./translate.pl >drive-lwp.pl
tee:
./wsp.pl -v | ./translate.pl | tee >drive-lwp.pl
|
|
translate.pl
Set-Cookie headers and reuse them for the remainder of the session
wsp.pl could do with a few improvements; patches (diff -u) could be attached to this topic #!) doesn't have the "standard" /usr/bin/perl path
-i option ignores .jpg, .gif, and .css; could/should add others (especially .png, but perhaps other media types, too)
use the SSL modules and simply disable (or fail) if trying to use the proxy with an SSL connection
translate.pl emulates a browser more fully re:HTTP headers
| WebForm | |
|---|---|
| TopicClassification | FeatureBrainstorming |
| TopicSummary | |
| InterestedParties | |
| AssignedTo | |
| AssignedToCore | |
| ScheduledFor | |
| ImplementationDate | N/A |
| RelatedTopics | |
| SpecProgress | |
| ImplProgress | |
| DocProgress | |
| I | Attachment | Action | Size | Date | Who | Comment |
|---|---|---|---|---|---|---|
| | translate.pl.txt | manage | 1.2 K | 27 Sep 2004 - 22:15 | WillNorris | converts wsp logs into LWP Perl scripts |
| | web-scraping-proxy-settings.png | manage | 28.4 K | 27 Sep 2004 - 22:14 | WillNorris | browser proxy settings dialog box |