Title: Open Science Grid: Project Statement
1Open Science Grid Project Statement Vision
- Transform compute and data intensive science
through a cross-domain self-managed national
distributed cyberinfrastructure that brings
together campus and community infrastructure
facilitating the research of Virtual
Organizations at all scales.
2Why the effort is important
Facility (preliminary commitments)
- Sustained growth in the needs of traditional
compute and data intensive science - The steady stream of scientific domains that add
and expand the role of computing and data
processing in their discovery process - Coupled with the administrative and physical
distribution of compute and storage resources and
increase in the size, diversity and scope of
scientific collaborations.
3Goals of the OSG
- Support data storage, distribution computation
for High Energy, Nuclear Astro Physics
collaborations, in particular delivering to the
schedule, capacity and capability needed for LHC
and LIGO science. - Engage and benefit other Research Science of
all scales through progressively supporting their
applications. - Educate train students, administrators
educators. - Operate evolve a petascale Distributed Facility
across the US providing guaranteed
opportunistic access to shared compute storage
resources. - Interface Federate with Campus, Regional, other
national international Grids (including EGEE
TeraGrid). - Provide an Integrated, Robust Software Stack for
Facility Applications, tested on a well
provisioned at-scale validation facility. - Evolve the capabilities offered by the Facility
by deploying externally developed new services
technologies.
4Challenges Sociological and Technical
- Develop the organizational and management
structure of an open consortium that drives such
a CI. - Develop the organizational and management
structure for the project that builds, operates
and evolves such CI. - Maintain and evolve a software stack capable of
offering powerful and dependable capabilities to
the NSF and DOE scientific communities. - Operate and evolve a dependable facility.
Boston University Brookhaven National Laboratory
California Institute of Techology Columbia University
Cornell University Fermi National Accelerator Laboratory
Indiana University Lawrence Berkeley National Laboratory
Rennaisance Computing Institute Stanford Linear Accelerator Center
University of California, San Diego University of Chicago/Argonne National Laboratory
University of Florida
University of Iowa
University of Wisconsin, Madison
http//www.opensciencegrid.org
5Computational Science Here, There and Everywhere
Grid of Grids From Local to Global
Global Research Shared Resources
Software Stack
Software Release Process
6Timeline Milestones (preliminary)
Contribute to Worldwide LHC Computing Grid
LHC Event Data Distribution and Analysis
LHC Simulations
Support 1000 Users 20PB Data Archive
Contribute to LIGO Workflow and Data Analysis
Advanced LIGO
LIGO Data Grid dependent on OSG
LIGO data run SC5
STAR, CDF, D0, Astrophysics
CDF Simulation
CDF Simulation and Analysis
D0 Simulations
D0 Reprocessing
STAR Data Distribution and Jobs
10KJobs per Day
1 Community
1 Community
1 Community
1 Community
1 Community
1 Community
1 Community
1 Community
1 Community
Additional Science Communities
Facility Security Risk Assessment, Audits,
Incident Response, Management, Operations,
Technical Controls
Plan V1
1st Audit
Risk Assessment
Audit
Risk Assessment
Audit
Risk Assessment
Audit
Risk Assessment
Facility Operations and Metrics Increase
robustness and scale Operational Metrics defined
and validated each year.
Interoperate and Federate with Campus and
Regional Grids
VDT and OSG Software Releases Major Release
every 6 months Minor Updates as needed
VDT 1.4.0
VDT 1.4.1
VDT 1.4.2
OSG 0.6.0
OSG 0.8.0
OSG 1.0
OSG 2.0
OSG 3.0
VDT Incremental Updates
dCache with role based authorization
Accounting
Auditing
Federated monitoring and information services
VDS with SRM
Common S/w Distribution with TeraGrid
Transparent data and job movement with TeraGrid
EGEE using VDT 1.4.X
Transparent data management with EGEE
Extended Capabilities Increase Scalability and
Performance for Jobs and Data to meet Stakeholder
needs
Integrated Network Management
SRM/dCache Extensions
Just in Time Workload Management
VO Services Infrastructure
Data Analysis (batch and interactive) Workflow
Improved Workflow and Resource Selection
Work with SciDAC-2 CEDS and Security with Open
Science