Title: OSG Grid Operations
1OSG Grid Operations
- Leigh Grundhoefer
- Indiana University
2Agenda
- Grid Operations Development
- OSG Functional Service Cycle
- Deployment
- Integration
- Provisioning
- Production
- OSG Operations Activities
3Big Picture Goals
- The OSG Operations activity and Support Centers
Group has been tasked with the role of preparing,
provisioning and running the infrastructure used
for the OSG production environment. - The operations activities duties to the OSG are
to ensure that the production environment is
usable for the current application base and to
continue to evolve as a common service
environment which is able to support multiple
sciences with a application-friendly grid
infrastructure.
4Grid Operations
Other Grids
Security, Policy and Authentication
Registration, Verification and Monitoring
5Operations Environment
- Organization and Definition of core elements
- Grids
- Support Centers
- Virtual Organizations
- Resources
- Registration by Owners and Providers
- Common software services
- Verification and ongoing evaluation
- Resources and services
- Support Center ticketing, ticketing response
- VO services
6Operations Environment (cont.)
- Information from monitoring
- Job slots and file transfers
- Published policies
- Accounting
- Coordination of OSG Help Desk
- Coordinator of support centers
- Definition and execution of Standard Operational
Procedures (SOPs).
7 Grid to Grid relationships
- Understand and create specialized help desk and
trouble reporting schemas - Understand and create monitoring and accounting
interfaces - Create and deploy identity and authorization
interfaces
8Agenda
- Grid Operations Development
- OSG Functional Service Cycle
- Deployment
- Integration
- Provisioning
- Production
- OSG Operations Activities
9Where do you get this stuff?(Architecture and
Requirements)
Release Candidate
Blueprint (ARCH)
ITB 0.3
Integration Test Bed
Operations OSG 0.4
Provisioning
VOs
Deployment Activity
Release Description
Service Development (Sponsored Activities)
Technical Groups
10OSG Integration Activity
OSG Integration Activity
Readiness plan Effort Resources
Readiness plan adopted
VO Application Software Installation
Software packaging
OSG Deployment Activity
Service deployment
OSG Operations-Provisioning Activity
Release Candidate
Application validation
Middleware Interoperability
Functionality Scalability Tests
feedback
Metrics Certification
Release Description
11Provisioning
- Finalize all installation software and procedures
- OSG based software packages
- Pre-compiled binaries
- Source code (compiled during installation)
- Post configuration scripts ( configure_osg.sh )
- Create production Grid Support documentation
for all software and defined procedures.
12Provisioning
- Translate OSG operations model into production
operations activities - Provide timelines for Resources and Support
Centers - Setup versioning using release procedures
- Install or upgrade grid wide services
13Production
- Reports
- Usage reports from OSG monitoring
- Operations reports to the OSG community
- Daily Sites Status reports to Support Centers
- Meetings
- Weekly Operations Activity
- Release-based provisioning Activity
- Weekly Support Centers Technical Group
- Weekly Documentation Activity
- Monitors for verification
- ACDC Operations Dashboard
14Verification of Resouces
- ACDC Operation Dashboard provides detailed cyclic
testing of all resources - Tests based upon site-verify tool distributed
with OSG common software, around 30 tests - Tests results output available per test per site
- Five possible results
- No Information
- Pass
- Fail
- Error
- Not Tested
- Resources are grouped in to three areas
Production, Pending or Offline
15Agenda
- Grid Operations Development
- OSG Functional Service Cycle
- Deployment
- Integration
- Provisioning
- Production
- OSG Operations Activities
16GOC A Communications Hub
- Grid Operations Center
- Leveraged Coverage from GRNOC
- Abilene, NLR, TransPAC
- OSG Trouble Ticket System
- Trouble Ticket Exchange with Support Centers /
Grids - Weekly Issue tracking report
- Web Page Development and Documentation
- OSG Production web site www.opensciencegrid.org
- Commonly answered questions Knowledge Base
- Collaborative development information -
osg.ivdgl.org - OSG Registration Database
17GOC A Service Center
- Grid wide Information Services
- Registration Database Catalog
- Production Software Caches
- OSG Knowledge Base
- OpenScienceGrid web site
- Grid wide Monitoring Services
- GLUE information providers
- ACDC Dashboards
- GridCat
- MonaLisa archive
- Multiple OSG ITB resources
- Small but demonstrative OSG production resource
18GOC Incident Response
- Response has been defined by the Security
Technical Group - All incidents should be reported mail aliases
which are monitored by the GOC. - The GOC maintains a list of local site security
contacts, derived from the registrations. - The GOC designed and implemented a specialized
mail service for secure correspondence - A response team leader forms a group to access,
contain, and report on each incident.
19GOC Policy Procedures
- Grid Service Change or Upgrade
- Registrations
- Support Centers
- Virtual Organizations
- Resources
- Critical OSG Release Update
20Support Center Registration
21Virtual Organization Registration
22Resource/Service Registration
23Critical Release Update
24(No Transcript)
25OSG Community Support
- How do we support issues that fall outside the
Support Model? - Open Support Model
- Mailing Lists (OSG-General)
- Knowledge Base and Release Documentation
- Jabber chat room
- Weekly meetings
26(No Transcript)
27GOC Conclusions
- Enables Users and Usage
- Creates a known and usable service environment
- Allows status, monitoring and accounting
- Helps users and applications bridge the gap
between single use and grid based resource
utilization
28Thank you!