Tags:
create new tag
, view all tags

Improving the plugins start-up times

I am timing the init times of a lot of plugins to speed-up TWiki.

On my machine it takes 7 seconds to do a click! (it's a dual AMD 2000+ with RedHAt 8.0, Apache 2, ModPerl).

Installed Plugins:

I have installed all the plugin that I have found:

NOTE: I have not checked the set-up of all plugins ... someone cold be slow just because I didn't do it properly.

Instrumenting Plugins.pm

I have instrumented (just a little) Plugins.pm to collect times with CPAN:Time::HiRes (see the attached Plugins.pm file)

Initial timings

The init time can be splitted into (ms):

  • discoverPlugins 2.6
  • eval "use $plugin" 999.7
  • readPrefsFromTopic 270

The cumulative handlers' execution on a simple page without plugin tags is (ms):

  • startRenderingHandler 92.2
  • commonTagsHandler 1638.6
  • getSessionValueHandler 13.1
  • endRenderingHandler 7.5
  • outsidePREHandler 4617.3 <<== HUGE! frown
  • insidePREHandler 0.2

I have modified Prefs.pm to cache the preferences read (see Prefs.pm attached)

By caching the Prefs we go:

  • from 270 ms to 130 ms

Then I have tweaked a couple of plugins that uses outsidePREhandler

Now the time is lower:

  • outsidePREHandler 36.3 ms <<== SMALL!! smile

The next biggest time is:

  • use "$plugin" 1057.5 ms <<== HUGE!! frown

Proposal

In the current implementation the Preferences are always loaded for each initted plugin. This time is low (a total of 130ms if cached).

Thus I propose to use the plugin topic to store a preference that is used to decide if the plugin should be initialized.

I have added a SYNTAX variable to all the plugins to initialize a Plugin only:

  • if the SYNTAX declaration is NOT present (old plugins and special ones)
  • or if the SYNTAX (a regular expression) matches the topic text
Problems:
  1. patterns produced by other plugins are not processed
  2. included topics are not processed
  3. templates are not processed
Solutions:
  1. we could:
    • define a second preference PLUGINDEPS that lists the plugins that a plugin uses:
      • this means that the plugin developer must check for new plugins often
    • else define a SAMPLE preference that show a complete sample of the plugin output:
      • this is useful also to automatically build documentation
      • if a new plugin matches the SAMPLE output it will be properly initialized
    • NOTE: in both cases the ordering of the initializations implicitly defines an efficient RenderingPipeline (the handlers are applied in the proper order)
  2. we could move the initialization to after the include processing
  3. we could pass to the initialization both the topic and template text

Resulting times

The resulting times are (in ms, on a page without particular patterns):

  • init 378.2
    • discoverPlugins 2.6
    • getPrefsFromTopic 143.1
    • use 113.4
    • initPlugin 39.0
    • registerHandler 0.7
  • getSessionValueHandler 2.4
  • startRenderingHandler 13.0
  • commonTagsHandler 20.0
  • outsidePREHandler 81.0
  • endRenderingHandler 4.8
  • writeHeaderHandler 0

They are encouraging, no? (very encouraging smile )

Comparison between plugin starting times

I table here the starting times that I have collected.

selective plugin activation OFF selective plugin activation ON

Phase Total time (ms)
discoverPlugins 2.63
getPrefsFromTopic 176.24
usePlugin 1443.78
initPlugin 203.63
registerHandler 3.66
initTotal 1986.57
Handler Total time (ms)
startRenderingHandler 62.77
outsidePREHandler 29.77
getSessionValueHandler 5.76
commonTagsHandler 788.93
endRenderingHandler 5.51
writeHeaderHandler 0.04

Plugin Init time (ms)
ActionTrackerPlugin 124.73
AgentPlugin 7.93
BeautifierPlugin 36.03
BugzillaLinkPlugin 8.22
CalendarPlugin 55.71
ChartPlugin 12.91
CommentPlugin 8.01
ConditionalPlugin 5.48
DatabasePlugin 72.72
DefaultPlugin 8.58
DoxygenPlugin 6.26
EditTablePlugin 17.76
EmbedPDFPlugin 9.24
EmbedQTPlugin 6.37
EmptyPlugin 6.27
EncryptedPagesPlugin 11.73
ExplicitNumberingPlugin 7.26
FeedbackPlugin 19.17
FindElsewherePlugin 7.54
FormFieldsPlugin 7.76
FormPivotPlugin 7.08
FormQueryPlugin 112.81
GaugePlugin 15.40
GnuSkinPlugin 15.12
HeadlinesPlugin 11.49
ImageGalleryPlugin 11.27
ImmediateNotifyPlugin 9.88
IncludeIndexPlugin 6.78
IncludeRevisionPlugin 4.96
InterwikiPlugin 12.45
JavaDocPlugin 6.80
LaTeXToMathMLPlugin 49.37
LdapPlugin 411.94
ListOfSkinsPlugin 7.75
LocalCityTimePlugin 7.43
LocalTimePlugin 88.95
MathModePlugin 80.09
NavPlugin 10.22
NavbarPlugin 8.61
NewsPlugin 5.92
NotifyOnChildModificationPlugin 7.24
PdfPlugin 7.93
PerlDocPlugin 11.45
PerlSamplePlugin 21.00
PhantomPlugin 24.18
PollPlugin 10.74
PowerEditPlugin 5.65
ProgramsPlugin 15.85
PrologSamplePlugin 5.41
QuickCalendarPlugin 16.54
QuickSearchPlugin 9.06
RandomQuotePlugin 9.67
RandomTopicPlugin 25.07
RecursiveRenderPlugin 9.51
RedirectPlugin 6.90
RevRecoverPlugin 6.31
RevisionLinkPlugin 9.61
RicherSyntaxPlugin 11.52
RollupPlugin 7.40
SearchToTablePlugin 9.62
SectionalEditPlugin 9.90
SessionManagerPlugin smile 9.27
SessionPlugin 5.18
SingletonWikiWordPlugin 6.25
SlideShowPlugin 11.77
SmiliesPlugin 13.29
SourceHighlightPlugin 32.05
SpacedWikiWordPlugin 5.07
SpreadSheetPlugin 19.56
StylePlugin 10.83
SyntaxHighlightingPlugin 8.02
TWikiDrawPlugin 9.47
TablePlugin 24.91
TigerSkinPlugin 13.72
TocPlugin 28.20
TodaysVisitorsPlugin 10.04
TopicVarsPlugin 10.07
TranslateTagPlugin 69.33
TreePlugin 31.60
TypographyPlugin 9.05
UpdateInfoPlugin 5.71
UserCookiePlugin 6.28
VersionLinkPlugin 8.92
XmlXslPlugin 5.90
XpTrackerPlugin 54.70
Total init time (ms) 1973.75

Phase Total time (ms)
discoverPlugins 2.68
getPrefsFromTopic 147.95
usePlugin 123.55
initPlugin 41.18
registerHandler 0.76
initTotal 418.62
Handler Total time (ms)
getSessionValueHandler 2.83
startRenderingHandler 13.14
commonTagsHandler 17.93
outsidePREHandler 11.92
endRenderingHandler 4.46
writeHeaderHandler 0.03

Plugin Init time (ms)
ActionTrackerPlugin 3.21
AgentPlugin 2.89
BeautifierPlugin 1.50
BugzillaLinkPlugin 2.29
CalendarPlugin 1.30
ChartPlugin 2.71
CommentPlugin 2.88
ConditionalPlugin 1.54
DatabasePlugin 2.07
DefaultPlugin 2.04
DoxygenPlugin 2.14
EditTablePlugin 1.59
EmbedPDFPlugin 2.18
EmbedQTPlugin 1.68
EmptyPlugin 5.42
EncryptedPagesPlugin 1.67
ExplicitNumberingPlugin 1.97
FeedbackPlugin 1.82
FindElsewherePlugin 7.38
FormFieldsPlugin 7.69
FormPivotPlugin 2.24
FormQueryPlugin 2.55
GaugePlugin 3.65
GnuSkinPlugin 2.81
HeadlinesPlugin 2.95
ImageGalleryPlugin 4.02
ImmediateNotifyPlugin 3.06
IncludeIndexPlugin 2.04
IncludeRevisionPlugin 2.29
InterwikiPlugin 10.70
JavaDocPlugin 3.40
LaTeXToMathMLPlugin 2.65
LdapPlugin 3.54
ListOfSkinsPlugin 2.14
LocalCityTimePlugin 20.09
LocalTimePlugin 2.89
MathModePlugin 2.34
NavPlugin 2.74
NavbarPlugin 1.79
NewsPlugin 2.31
NotifyOnChildModificationPlugin 6.83
PdfPlugin 7.47
PerlDocPlugin 2.86
PerlSamplePlugin 1.69
PhantomPlugin 23.29
PollPlugin 5.10
PowerEditPlugin 4.73
ProgramsPlugin 30.41
PrologSamplePlugin 1.94
QuickCalendarPlugin 1.74
QuickSearchPlugin 2.50
RandomQuotePlugin 2.42
RandomTopicPlugin 2.95
RecursiveRenderPlugin 2.63
RedirectPlugin 2.70
RevRecoverPlugin 2.55
RevisionLinkPlugin 2.50
RicherSyntaxPlugin 4.22
RollupPlugin 2.01
SearchToTablePlugin 2.15
SectionalEditPlugin 3.76
SessionManagerPlugin smile 2.86
SessionPlugin 4.30
SingletonWikiWordPlugin 2.84
SlideShowPlugin 3.04
SmiliesPlugin 12.29
SourceHighlightPlugin 2.20
SpacedWikiWordPlugin 4.11
SpreadSheetPlugin 3.60
StylePlugin 8.78
SyntaxHighlightingPlugin 3.08
TWikiDrawPlugin 3.79
TablePlugin 23.55
TigerSkinPlugin 3.85
TocPlugin 25.71
TodaysVisitorsPlugin 3.00
TopicVarsPlugin 8.21
TranslateTagPlugin 9.95
TreePlugin 3.71
TypographyPlugin 7.82
UpdateInfoPlugin 2.30
UserCookiePlugin 4.69
VersionLinkPlugin 3.33
XmlXslPlugin 4.68
XpTrackerPlugin 3.84
Total init time (ms) 408.15


Comments

Andrea, I really like the fact that you profiled the code base, always a sign of diligent engineer. I had wanted to do it myself, but never got the time. One suggestion I would like to make, is that you (or someone who has time) put code into twiki that uses Timer::HiRes or whatever that will log times (if requested) for all rendered pages and separate out by plugin. If we have more global visibility to what plugins are pigs or what code areas are pigs we will get more people looking into fixing them.

As for other suggestions. If I understand your issue regarding 'patterns produced by other plugins are not processed' to be that a plugin that produces twiki output might not have that output rendered, I would suggest that you push that responsibility of declaring the output of the plugin to be either clean (needing no further rendering by other plugins) or dirty (needing further rendering by other plugins) to the plugin developer.

Not sure if it helps, but you get what you pay for. wink

-- JohnCavanaugh - 16 Sep 2003

Thanks for the "diligent engineer" smile ... I was simply annoyed of waiting 7 seconds for each click.

My version of Plugins.pm is already almost completely "instrumented" with Time::HiRes calls ... I'll immediately complete it as for your suggestion to show in (the logs? in an automatically generated page?) a nice table of the time spent in each init phase and in each handler.

Your idea for the clean/dirty declaration suggests me that if a plugin would declare its pre-requisites I could init all the prerequisites also. Thus a MytagPugin that contains in its topic page the lines

   * Set SYNTAX = %MYTAG{.*?}%
   * Set PREREQUISITES = TablePlugin
would init also the TablePlugin if the %MYTAG{.*?}% is found (and so on and so on recursively)

It remains to handle templates ... this should'nt be too difficult.

-- AndreaSterbini - 17 Sep 2003

Im not sure at run time or even code time pne could know all the prereqs needed.

What I was attempting to get at is a plugin is deemed a "clean" plugin iff when processing tags it never inserts text that has other tags. Ie. If you did something like s/%TAG%/xxxx/, the xxx would never have any %MYTAG{.*?}% in it. [A potential other name instead of clean could be "terminal node"]

A plugin is considered dirty if does or might inject %MYTAG{.*?}% in its processing. [A potential other name instead of dirty could be "recursive" or "branch node"]

Im thinking along the lines of two categories of plugins. For lack of a better term I would group plugins that only have %MYTAG{}% style syntax as SyntaxPlugins. These SyntaxPlugins could be broken down into 2 further categories, namely clean and dirty, following the definition above.

Another category grouping of plugins would be (again for lack of a better term) FreeformPlugins. These would be plugins that have unusual freeform syntax, something like the SmiliesPlugin would be a classic example.

Then a rendering pipeline could be established.

SyntaxPlugins(Dirty) [Loop] -> SyntaxPlugins(Clean) -> FreeformPlugins -> DefaultTwikiRendering

Granted an assumption here is that FreeformPlugins are always clean. I couldnt think of any exceptions to this off the top of my head, but I guess it is possible.

I think the benefit here is the recursive looping only occurs on dirty SyntaxPlugins, which I believe would help the rendering time. But as with most things, having the real data (as part of your profiling) is what will really answer the questions.

Separately as well, I think we might want to strive for NOT processing the plugin prefs for every rendering. Thinking further out, focusing on how mod_perl could optimize things I believe would be prudent. Specifically, I believe a strategy that loads the prefs and keeps them in memory for all plugins and only "reloads" them at specific times (perhaps on demand) would also make things much faster.

I state (if possible) because Im not sure exactly how shared memory would work across multiple httpd instances. We would definitely need a mod_perl expert to comment.

Just another $0.02 that I hope makes sense.

-- JohnCavanaugh - 18 Sep 2003

I see ... a plugin developer shouldn't know about the other plugins ...

The ideal situation would be if a plugin declares both the syntax he recognizes and the syntax of the produced output ... and then deduce if a plugin needs another by looking at the two declaration ... a not simple problem if you want to match a regexp over another regexp. I'll give a thought to this.

By ordering appropriately the initialization we get the proper sequence of application of handlers ... then (implicitly) your pipeline ...

About Prefs ... my new implementation caches each set of variables on disk and loads/join the needed ones when the prefs-init is called ... this halves the time (that's small for almost 100 files).

A better idea could be start using CPAN:Cache::Cache to share Preferences (but also other general data structures) across calls on different apache processes, even without mod_perl.

I will do some experiments on that .... it could make TWiki blazingly fast!

-- AndreaSterbini - 18 Sep 2003

I have found the Grail project to manipulate finite languages, regular expressions and finite machines ... perhaps the problem is solved by:

  • generating the language of the output regexp (with retofl)
  • and filtering the stream with the input regexp (with retofm and flfilter)
  • if anything comes out then we have a match
... but this will take almost forever (I fear) smile

... but it has to be recomputed only when a plugin page changes ....

... else we could use a minimal approach:

  • the plugin declares an example of its output (a text, not a regexp)
  • all the other plugins that match on such example are initialized after it

  • the nice thing about this approach is that we already are showing output examples! (as documentation)

Now I should find a way to declare the OUTPUT_EXAMPLE:

  • as an OUTPUT_EXAMPLE variable (but I have problems with the \n
  • or as the text enclosed between %STARTINCLUDE% and %ENDINCLUDE% (or %OUTPUT_EXAMPLE% and %END_OUTPUT_EXAMPLE%)

-- AndreaSterbini - 18 Sep 2003

I have slightly refactored the topic and added a table of times above.

-- AndreaSterbini - 22 Sep 2003

copied from HighQualityPlugins:

I think that bad performance is inherent to the current Plugin mecanism. In my opinion, this should be changed first. I am not a perl programmer, but points which strikes me as inefficient are:

  • Nothing should be done at runtime to registrer a new plugin. Some script should be called to install/deinstall a plugin, modifying perl code. Now each time a page is viewed, the list of all possible plugins are scanned, initialized, etc..
  • Now the text body is scanned wholly N times, for each possible %-var. An obvious optimization would be to just scan the body ONCE for an %-vars, such as: %[A-Z0-9_]+({.*?})?%, and for each match, call the plugin-provided function/method registred for each var, stocked in an hashtable, that could be statically defined in a perl module, so that even its instanciation could be done at server start when using perl_mod, not on each request. This plugin install script would thus just add/remove lines in this .pm file...
Ideas?

-- ColasNahaboo - 30 Dec 2003

_ Now the text body is scanned wholly N times, for each possible %-var.An obvious optimization would be to just scan the body ONCE for an %-vars_

This can be done only if no plugin produces an output with an %-var in it, and there is at least one plugin that I know (XpTrackerPlugin) that put in it's output.

Extending the Idea, perhaps several scanning can be done until there's nothing to transform. I bet that at most it'll be 2 or 3 passes.

-- RafaelAlvarez - 05 Jan 2004

Bumping Feature topic marked as Scheduled for CairoRelease that hasn't been modified recently.

-- SamHasler - 20 Apr 2004

Another idea (borrowed from the way Java handle the JDBC Driver selection based on the provided URL):

The rendering process could scan for tags in the form %.*?% and ask each plugin if it recognize it. The content of the tag is passed to the first plugin that recognized it.

This way the plugin don't need to parse the page to find it's tags (XpTrackerPlugin parses a page a lot of times because it can recognize several tags). This process could be repeated until there are not more tags present in the page (to allow plugins to emit text that uses other plugins) or if all the tags in the pass failed to be served by a plugin.

If you go further and move all the TWiki vars processing into a Plugin then you can skip a whole step in the rendering process.

hmmm. I'll give it a little more thought.

-- RafaelAlvarez - 20 Apr 2004

I think that's a very good idea. One concern you'll need to deal with is loops in the generation, though. Have a limit on the number of passes through the text before you give up on trying to get rid of all 's even if some are still being recognized.

-- WalterMundt - 20 Apr 2004

What if you recursivly parse the replacement text to see if there are new tags in it, and pass the list of plugins that have already processed it and don't allow a plugin to process the text more than once.

-- SamHasler - 21 Apr 2004

Finally, I'm uploading the proof-of-concept implementation for the alternate plugin tag handling in AlternatePluginTagHandling

-- RafaelAlvarez - 28 Apr 2004

Feature bumped to Dakar due to lack of progress. I already did a lot in Cairo to improve this; but there is more that could be done.

-- CrawfordCurrie - 30 Jun 2004

This is a fantastic analysis. I hope that addressing this can be a principle contribution towards DakarRelease's performance goals.

-- MartinCleaver - 21 Oct 2004

I've looked hard at accelerating plugins, and it basically requires some heavy refactoring of the core rendering loop to achieve performance improvmenets. Because the plugins API is so tightly tied to this architecture, that implies a huge number of changes to plugins. In the short term, plugins can take advantage of the registeragHandler method in TWiki.pm for considerable improvements in tag handling performance, but until there is a commitment to upgrade existing plugins there is no point in scheduling anything else. -- CrawfordCurrie - 15 Feb 2005

Topic attachments
I Attachment History Action Size Date Who Comment
Perl source code filepm Plugins.pm r1 manage 15.5 K 2003-09-22 - 22:01 AndreaSterbini  
Perl source code filepm Prefs.pm r1 manage 11.0 K 2003-09-22 - 22:02 AndreaSterbini  
Edit | Attach | Watch | Print version | History: r26 < r25 < r24 < r23 < r22 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r26 - 2005-02-15 - CrawfordCurrie
 
  • Learn about TWiki  
  • Download TWiki
This site is powered by the TWiki collaboration platform Powered by Perl Hosted by OICcam.com Ideas, requests, problems regarding TWiki? Send feedback. Ask community in the support forum.
Copyright © 1999-2017 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.