0% found this document useful (0 votes)

59 views35 pages

Four Star Network Management: Jeff Allen Webtv Networks David Williamson Global Networking and Computing

1) The document discusses an approach to network management called the "Four Star" system which uses a collection of small, specialized tools rather than monolithic applications. 2) It provides examples of tools they use for different functions like trending and alert management and discusses integrating these tools by connecting them. 3) The document encourages the audience to think about their own needs and build out a customized network management system in an incremental way using tools from their online menu.

Uploaded by

Ayush Handa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views35 pages

Four Star Network Management: Jeff Allen Webtv Networks David Williamson Global Networking and Computing

Uploaded by

Ayush Handa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 35

Four Star Network Management

Jeff Allen ([email protected]) WebTV Networks David Williamson ([email protected]) Global Networking and Computing

Where this is going

Who we think you are Who we know we are Tools as a philosophy A menu of tools to choose from Choosing from the menu A sampling of tools we like Connecting your tools Marching Orders

Our Audience
System/network administrators People who dont think they need Network Management Managers you know who you are!

Who we are
Corporate and Service background Were sick of monolithic tools that dont do what we want them to do! A toolsmith and a network admin David: MRTG has paid me money

A tale of two philosophies

The Vendor Approach:

Deploy a monolithic application/framework. Solve all problems directly, or with add-ons. Lots of risk that some part wont address your needs. Select small tools the do precisely what you need from a menu of choices. Work to interconnect into a web of tools or not. Incremental improvement reduces risk of failure.

Our approach:

The Menu, Part I

Alert management Change management Trending and thresholding Intrusion Detection Project Management Workflow automation Document control

The Menu, Part II

Time Management Inventory control Software distribution A la carte: Miscellaneous Tools

Console, dashboard, third-party diagnosis tools

Public relations Monolithic Systems

Choosing from the Menu

Scale:
Big and medium shops Small shops too!

Priority Think BIG!

This is not a closed list

Network Management isnt just for networks anymore!

WebTVs 4-Star System

Trending and Thresholding Cricket
Alert Management Netcool Workflow Management Remedy

Dashboard Approach to problem solving To be solved, if ever

Why Watch Trends?

Short-term issues make us act reactively Need data that we often dont have to make good long-term decisions
Common Questions:
Is the link to Europe up? Do we need more bandwidth to Europe?

Better Questions:
What is the current state of the link? What has it been recently? Is it what we expect it to be? Is it different from other links that should be the same? What long-term trends can we discern? Answering questions like these requires a good data collection and graphing system.

Examples

The System:

Cricket!

Cricket is a tool for storing and viewing time-series data

Very flexible Extremely Legible Graphs Space and Time efficient Platform Independent

How it works
Crickets collector runs from cron every 5 minutes and stores the data. Crickets grapher CGI script is used interactively to browse the data. The system uses a hierarchical configuration system called a Config Tree.

Too many graphs

The capacity to draw 5000 graphs hardly qualifies as a proactive monitoring tool. Humans must check the graphs now. Wouldnt be nice if Cricket could check the graphs itself? How would a computer know if a graph looks right? Cricket could send traps to an Alert Manager

Too Many Pages?

Ever had this happen to you?

Step 1: Fetch nifty monitoring package off the net. Step 2: Compile, install, point it at your pager. Step 3: Fall asleep. Step 4: Wake up to a pager with a useless message. Step 5: Go to Step 3.

Congratulations! You have just discovered the need for Alert Management!

Alert Management
Alerts are:
Any message about the state of the system Can be good, bad, or neither

Management is:
Prioritizing Filtering Escalation and de-escalation Destruction

Where do Alerts come from?

Network Devices (syslog, SNMP traps) Operating Systems (syslog, SNMP traps) Applications Cricket (threshold violations and recoveries) Miscellaneous monitoring scripts Intrusion Detection system

Netcool
A picture is worth 1000 words:
Probes Syslog Database Interfaces

P
P

R
R

Triggers
Actions

GUI: Motif, NT, Java

Traps

P = Protocol Specific R = Rules Engine

External Databases, Ticketing Systems, Perl Scripts

What it looks like:

Implementing policy
Rules engine:
Selects alerts Sets initial priority

Triggers and actions:

Calculate rates Adjust priority Automatic resolution Trim and maintain database

How we implemented policy

Configure the system to send everything in as uncategorized. See what you get. Codify policies for what gets attention:

Edit rules files to prioritize alerts

Triggers and actions for escalation, resolution, and destruction.

Implement other policies:

Workflow Systems
A system to help operations folks accomplish their mission by:
Keeping things from falling through the cracks Maintaining an audit trail Making it possible to measure things:

quality of service where all the time goes which systems (or users) are unreliable

A Good Workflow System

Helps move tasks through the organization smoothly

Handoffs happen reliably Helps operators implement established processes

Lets management understand the value of the operations staff, and where to make improvements. Is Really Hard To Make!

Why is it so difficult?
Its a software solution to an essentially social problem.

Requires commitment at a management level Requires buy-in at an operator level Lightweight, unobtrusive, accurate, quickly extensible, and completely reliable. Ha! This is software we are talking about!

To facilitate this buy-in, the software needs to be:

What WebTV uses

We have created several schemas in Remedys Action Request system. Three departments use a common Remedy server:

Development (bug tracking, configuration tracking) Operations (trouble tracking) Customer Care (call/e-mail tracking)

Operations tickets can be linked to customer tickets.

Remedy Pros and Cons

Pros

Very customizable: can solve any problem Scalable and reliable Very customizable: need consulting help to set it up, and internal expertise to manage it going forward No referential integrity Clunky UI

Cons

The good news is that its not too hard to replace its UI for simple tasks, using ARSPerl and web interface.

Where we are going

We are implementing a change management system, using Remedy.
Codifies existing best practices. Will add new procedures to avoid known mistakes. A fundamental design consideration: must be easy to use, or it will be abused or ignored. It will be advisory, not supervisory.

The Dashboard
The genesis of the idea was Spectrums device view. The vision: A dynamic web page you can go to and see everything there is to know about a host:

Embedded graphs of recent network and OS trends. Output from top, vmstat, iostat, etc. Application status (via app-specific test scripts) A button that pops up an ssh session Links to recent tickets related to this kind of machine Links to troubleshooting tips for this kind of machine

Why isnt it done?

Is it a bad idea? No, it just always falls off the bottom of the priority list. This is OK! It means you know the limits of your appetite for tools. It also leaves an interesting project for junior toolsmiths to cut their teeth on.

The Rest of the Constellation

Change Management

Multiple version control systems in use.

Project Management Software Distribution Monoliths

Spectrum: OK at mapping and displaying network topology. For us, this is a solved problem: its nice to work in a group with good executive support!

Public Relations

Connections
Once you have small tools doing useful work for you, start making connections between them. Monolithic systems fail in part because they have too many connections. Add connections only where they add value to your system or simplify it.

Examples of Connections
We have a system that puts POP Health data into Remedy tickets.

One less tool for operations folks to monitor.

Wed like to have Cricket generate alerts in Netcool.

The ability to make 5000 graphs is not a proactive tool!

The mythical Dashboard is one too!

Go forth and Think!

Take control of your environment by rolling out small tools that do what you need, a little at a time. As you add new tools, work to integrate them with what you already have. Use our website to find the tools you need and the tools weve demonstrated.

About that web site

GNAC hosts a site with material related to this presentation: http://www.gnac.com/four-star This is a work in progress! Were depending on you to help us fill out a larger menu.

Week 09 Linux
No ratings yet
Week 09 Linux
33 pages
Network Management
No ratings yet
Network Management
36 pages
The Role of Chain Management in Information Systems Operations
No ratings yet
The Role of Chain Management in Information Systems Operations
39 pages
001 Network-Management
No ratings yet
001 Network-Management
45 pages
Chapter 4
No ratings yet
Chapter 4
18 pages
Managing E-Business and Network Systems
No ratings yet
Managing E-Business and Network Systems
30 pages
Network Management Essentials
100% (1)
Network Management Essentials
34 pages
ISO Network Management Model
100% (1)
ISO Network Management Model
22 pages
CHAPTER 3 Network Maintenance
No ratings yet
CHAPTER 3 Network Maintenance
9 pages
Chapter 12
No ratings yet
Chapter 12
17 pages
Introduction To Networking Monitoring and Management
No ratings yet
Introduction To Networking Monitoring and Management
48 pages
Network Management
No ratings yet
Network Management
34 pages
Chapter Two-Nd
No ratings yet
Chapter Two-Nd
10 pages
Network Management
No ratings yet
Network Management
7 pages
Unit I Network Troubleshooting Components and Os: Problem Solving
No ratings yet
Unit I Network Troubleshooting Components and Os: Problem Solving
8 pages
Network Management Essentials
No ratings yet
Network Management Essentials
25 pages
IT Service Management Essentials
100% (1)
IT Service Management Essentials
5 pages
Chapter2 Network Admin
No ratings yet
Chapter2 Network Admin
22 pages
Drivers, Enablers and Standards2
No ratings yet
Drivers, Enablers and Standards2
69 pages
III
No ratings yet
III
27 pages
Network Configuration Management Introduction
No ratings yet
Network Configuration Management Introduction
3 pages
Network Management Essentials
No ratings yet
Network Management Essentials
23 pages
Network Management Updated
No ratings yet
Network Management Updated
8 pages
Notes: Introduction To Networking Monitoring and Management
No ratings yet
Notes: Introduction To Networking Monitoring and Management
12 pages
NM 1
No ratings yet
NM 1
24 pages
Change Management
75% (4)
Change Management
39 pages
Bit 308
No ratings yet
Bit 308
22 pages
NMT All
No ratings yet
NMT All
305 pages
Key Network Management Decisions
No ratings yet
Key Network Management Decisions
4 pages
Network Analysis and Design - CIS 460: Based On: Top Down Network Design Second Edition By: Oppenheimer, Priscilla
No ratings yet
Network Analysis and Design - CIS 460: Based On: Top Down Network Design Second Edition By: Oppenheimer, Priscilla
209 pages
Network Management Basics: Background
No ratings yet
Network Management Basics: Background
4 pages
Unit 4-Configuration Management
No ratings yet
Unit 4-Configuration Management
30 pages
Designing New IT Infrastructure - Where Do You Even Start - Networking - Spiceworks
No ratings yet
Designing New IT Infrastructure - Where Do You Even Start - Networking - Spiceworks
1 page
Network Management Functions Guide
No ratings yet
Network Management Functions Guide
37 pages
Software Configuration Basics
No ratings yet
Software Configuration Basics
65 pages
Network Planning for Organizations
No ratings yet
Network Planning for Organizations
5 pages
Public Web Status Monitoring System
No ratings yet
Public Web Status Monitoring System
60 pages
Tracking Workflow and Illustrating Value: Dr. Toby Pearlstein New York Chapter SLA March 11, 2008
No ratings yet
Tracking Workflow and Illustrating Value: Dr. Toby Pearlstein New York Chapter SLA March 11, 2008
33 pages
Driving Forces Behind Client/server: Business Perspective: Need For
0% (1)
Driving Forces Behind Client/server: Business Perspective: Need For
27 pages
Gibbard Operation Practices N45
No ratings yet
Gibbard Operation Practices N45
57 pages
Resumen MIS
No ratings yet
Resumen MIS
79 pages
Introduction To Network Management
No ratings yet
Introduction To Network Management
17 pages
Chapter Three New F
No ratings yet
Chapter Three New F
71 pages
Progress Report
No ratings yet
Progress Report
20 pages
The Complete Servicenow System Administrator Course: Section 7 - Core Applications
No ratings yet
The Complete Servicenow System Administrator Course: Section 7 - Core Applications
36 pages
Project Synopsis NMS
100% (1)
Project Synopsis NMS
8 pages
NM 9
No ratings yet
NM 9
41 pages
How To Document IT Infrastructure V1
No ratings yet
How To Document IT Infrastructure V1
28 pages
Lec 9 Solving The Problems
No ratings yet
Lec 9 Solving The Problems
23 pages
Network Management Insights
No ratings yet
Network Management Insights
10 pages
DISC 112 Computer and Problem Solving: Sessions 7-8
No ratings yet
DISC 112 Computer and Problem Solving: Sessions 7-8
29 pages
Network Management - (LO1) Part 2
No ratings yet
Network Management - (LO1) Part 2
26 pages
Auditoría de Tecnología de Información: Espinosa Ángeles Juan Abdel Hernández Medina Adrián Ramírez Antonio Alejandra
No ratings yet
Auditoría de Tecnología de Información: Espinosa Ángeles Juan Abdel Hernández Medina Adrián Ramírez Antonio Alejandra
53 pages
Handout1 Introduction SFW Configuration Management
No ratings yet
Handout1 Introduction SFW Configuration Management
24 pages
The Art of Network Architecture
No ratings yet
The Art of Network Architecture
3 pages
Moxa White Paper - Overcoming Obstacles of Industrial Network Management
No ratings yet
Moxa White Paper - Overcoming Obstacles of Industrial Network Management
9 pages
2 Project Management and Inception
No ratings yet
2 Project Management and Inception
21 pages
Bibliography
No ratings yet
Bibliography
2 pages
Anupam Singh Extended Internship Report
No ratings yet
Anupam Singh Extended Internship Report
9 pages
Web Scraping Functions Guide
No ratings yet
Web Scraping Functions Guide
5 pages
(Ebook) Beginning Lua Programming (Programmer To Programmer) by Kurt Jung, Aaron Brown, ISBN 9780470069172, 9780470139523, 0470069171, 0470139528 PDF Download
100% (1)
(Ebook) Beginning Lua Programming (Programmer To Programmer) by Kurt Jung, Aaron Brown, ISBN 9780470069172, 9780470139523, 0470069171, 0470139528 PDF Download
56 pages
LTE RF Optimization Guide
No ratings yet
LTE RF Optimization Guide
53 pages
SQL Python PowerBI Questions and Answers
No ratings yet
SQL Python PowerBI Questions and Answers
4 pages
ZXSDR R8129 Product Description - UniRAN16
100% (1)
ZXSDR R8129 Product Description - UniRAN16
21 pages
K Chief 700
100% (1)
K Chief 700
122 pages
Public Domain Book Digitization Guide
100% (2)
Public Domain Book Digitization Guide
342 pages
Database Practices Syllabus
No ratings yet
Database Practices Syllabus
4 pages
Computer Security & Network Protocols
No ratings yet
Computer Security & Network Protocols
60 pages
Massimiliano (Max) Loi: General Profile
No ratings yet
Massimiliano (Max) Loi: General Profile
2 pages
CAA V5 C++ Coding Rules
No ratings yet
CAA V5 C++ Coding Rules
19 pages
An Introduction To Rapid System Prototyping
No ratings yet
An Introduction To Rapid System Prototyping
5 pages
Network Time Protocol Vulnerability Analysis
No ratings yet
Network Time Protocol Vulnerability Analysis
13 pages
Ict Practical 2
No ratings yet
Ict Practical 2
7 pages
Presented By:: Salil K. Udainiya Abhishek Sharma Bhargava K. Mallina Deep N. Gautam
No ratings yet
Presented By:: Salil K. Udainiya Abhishek Sharma Bhargava K. Mallina Deep N. Gautam
47 pages
Disk Management MCQs & Scheduling
No ratings yet
Disk Management MCQs & Scheduling
5 pages
Oracle Row Cache Lock Troubleshooting
No ratings yet
Oracle Row Cache Lock Troubleshooting
7 pages
Core Java Cheat Sheet
No ratings yet
Core Java Cheat Sheet
11 pages
Sample Shooting Code
No ratings yet
Sample Shooting Code
5 pages
EmployeeTrackingSystem PDFCOFFEE - Com 1658006517766
No ratings yet
EmployeeTrackingSystem PDFCOFFEE - Com 1658006517766
13 pages
Business Process Modelling "As Is"
100% (1)
Business Process Modelling "As Is"
25 pages
Linux-Networking Cheat Sheet
No ratings yet
Linux-Networking Cheat Sheet
7 pages
Offensive AWS Security
No ratings yet
Offensive AWS Security
1 page
TIB BW Administration 6.1.0
No ratings yet
TIB BW Administration 6.1.0
46 pages
Data Flow Diagrams & UML Guide
No ratings yet
Data Flow Diagrams & UML Guide
28 pages
IT - OT Conv and Cybersecurity
No ratings yet
IT - OT Conv and Cybersecurity
5 pages
Tripwire Log Center: Product Brief
No ratings yet
Tripwire Log Center: Product Brief
8 pages
2021 Using Scte 224 To Increase Advertising Revenue
No ratings yet
2021 Using Scte 224 To Increase Advertising Revenue
11 pages

Four Star Network Management: Jeff Allen Webtv Networks David Williamson Global Networking and Computing

Uploaded by

Four Star Network Management: Jeff Allen Webtv Networks David Williamson Global Networking and Computing

Uploaded by

Four Star Network Management

Where this is going

A tale of two philosophies

The Menu, Part I

The Menu, Part II

Console, dashboard, third-party diagnosis tools

Public relations Monolithic Systems

Choosing from the Menu

Priority Think BIG!

This is not a closed list

Network Management isnt just for networks anymore!

WebTVs 4-Star System

Dashboard Approach to problem solving To be solved, if ever

Why Watch Trends?

Cricket is a tool for storing and viewing time-series data

Too many graphs

Too Many Pages?

Where do Alerts come from?

GUI: Motif, NT, Java

P = Protocol Specific R = Rules Engine

External Databases, Ticketing Systems, Perl Scripts

What it looks like:

Triggers and actions:

How we implemented policy

Edit rules files to prioritize alerts

Implement other policies:

A Good Workflow System

Handoffs happen reliably Helps operators implement established processes

To facilitate this buy-in, the software needs to be:

What WebTV uses

Operations tickets can be linked to customer tickets.

Remedy Pros and Cons

Where we are going

Why isnt it done?

The Rest of the Constellation

Multiple version control systems in use.

Project Management Software Distribution Monoliths

One less tool for operations folks to monitor.

Wed like to have Cricket generate alerts in Netcool.

The ability to make 5000 graphs is not a proactive tool!

The mythical Dashboard is one too!

Go forth and Think!

About that web site

You might also like