Tags:
create new tag
view all tags

LISA Conference 2002 Notes

http://www.usenix.org/lisa02/

Tutorial Programs

  • Sunday November 3, 2002
  • Blueprints for High Availability: Designing Resilient Distributed Systems
    • Evan Marcus, Veritas Software Corp. evan@veritasPLEASENOSPAM.com
    • Drop Evan an email to get updated slides
    • "High Availability cannot be achieved by merely installing Failover Software and walking away."
    • Keep Availability Statistics
      • Include root cause of problems if possible
    • System naming conventions
      • Use mnemoic names - violate rules of simplicity
      • Add personification
    • Security Book
      • Secret's & Lies - Bruce Schnier
    • Crack to check passwords

  • Tuesday November 5, 2002
    • Intro to Massive Upgrades and Changes
      • Christine Hogan and Tom Limoncelli
      • "The Practice of System and Network Administration" by Limoncelli and Hogan
      • http://www.EverythingSysadmin.com
      • Create a list of services, software, users on a checklist
      • netstat -a, rc files, ps
      • Backout plan
      • Notification communicated
      • Test verifications
      • Upgrade
      • Test verifications, debug
      • Notify upgrade completed to same community

Technical Sessions

  • Wednesday November 6, 2002
    • AFS Guru session
      • Not very good, doesn't really seam like something that would work easily for us.
    • The Constitutional and Financial Arguement Against SPAM.
  • Thursday November 7, 2002
    • Monitoring and Logging
      • Addamark search
        • Used for large search engines, yahoo, Atom, etc.
      • Mielog - Visual Log Browser
      • Detecting Events that Didn't happen
        • Something that should usually have happened that didn't
      • Defining and Monitoring Service Level Agreements for dynamic e-Business
        • Why do sysadmins care about SLAs?
        • How much does it cost to guarantee a Response time less than 1 sec?
        • 3rd party for web service monitoring
      • Hotswap - Transparent Server Failover for Linux
        • Maintain client connections by using own IP layer to make connections to two machines
      • Defending Against Internet Attacks - SOrting Through the Value Propositions of Different Security Technologies

  • Friday November 8, 2002
      • Performance Tuning Guru
        • Jeff Allen, Tellme Networks
        • VXFS, fsadm checking for fragmenting, how much is too much? Increased kernel versus user time.
        • ncsize, kernel parameter in Solaris, name cache size avoid the work of looking up inodes.
          • explicity set ncsize in /etc/system dynamically kernel size adjustments. High kernel time due to large "vmstat -s" dnlc hit rate should be over 90%. Think about application why would it be low.
        • general approach for finding problems
          • Scientific point of view...measure first, put in new setting, measure again.
        • Where is my bottleneck? I/O bound?(iostat -x) whatching on a per controller basis, then disk basis. Watch right hand column. %busy 40-55% is OK. Memory bound, SR Scan Rate in vmstat.
        • Logging file systems could help mail system by allowing the whole process happen in the log. File creating and deleting could happen all in the log.
    • System Monitoring Guru Session
      • Doug Hughes, monitoring guru
        • Netcool - http://www.micromuse.com/products/netcool_suite_overview.html - Premiere alert, event coralation tool. GUI for prioritizing.
          • Object Server, database underneath, deduplication, correlation, automation, resolution
          • Probes, active or passive
          • 30,000 events/minute
          • Down and up event coralation
          • Syslog or snmp probes are most popular
          • VERY $$$$$$$$$$$$$$ $4million to $100,000
          • very good customer support
          • Dynamic suppression
        • Nagios http://www.nagios.com
          • is a host and service monitor designed to inform you of network problems before your clients, end-users or managers do.
          • Static suppression
    • Infrastructures Guru Session
      • Steve Traugott, http://www.infrastructures.org
        • Pull works best because some machine will be down.
        • Pull helps divergence. Host brings itself up to the latest rev.
        • Push is controlled, infrastructure is not overwhelmed.
        • Do both pushes and pull ( Pull am I up to date ) ( Trigger pulls )

To Do

-- MattMillard - 09 Nov 2002

Edit | Attach | Watch | Print version | History: r18 < r17 < r16 < r15 < r14 | Backlinks | Raw View | Raw edit | More topic actions
Topic revision: r18 - 2002-11-11 - MattMillard
 
  • Learn about TWiki  
  • Download TWiki
This site is powered by the TWiki collaboration platform Powered by Perl Hosted by OICcam.com Ideas, requests, problems regarding TWiki? Send feedback. Ask community in the support forum.
Copyright © 1999-2026 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.