Information Systems describing resources

About This Presentation

Title:

Information Systems describing resources

Description:

High when large number of aps blocking on I/O ... test bed size is only 7 machines and 10 clients. local cluster, i.e. latency is well controlled ... – PowerPoint PPT presentation

Number of Views:52

Avg rating:3.0/5.0

Slides: 78

Provided by: Margri4

Category:

more less

Transcript and Presenter's Notes

Title: Information Systems describing resources

1
Information Systemsdescribing resources

Grid Middleware 4
David Groep, lecture series 2005-2006

2
Outline

Taxonomy of information systems
hierarchies and republishers
Grid Monitoring Architecture
push and pull, subscriptions
Performance of an IS
collecting information
sensors
IS content schemas and approaches

3
Grid Information Systems

Concerns data
shared between administrative domains
for use by multiple people or VOs
So it does not include things like
cluster temperature monitoring
debugging streams
accounting history

4
Classification of information systems

Which monitoring systems types are suitable for
grid?
Paper
http//www.cs.man.ac.uk/zanikols/fgcs05.pdf
Different types are
Level 0
self-contained not accessible by programs (but
only e.g. web)
Level 1
events are accessible remotely at the single
producer level
Level 2
includes republishers with fixed functionality
Level 3
supports hierarchies of republishers

5
System taxonomy levels of systems

Components used in information systems
and taxonomy levels

graphics and concept from S. Zanikolas et al.,
FGCS 21 (2005) 163-188
6
Information system classes

Level 2 or 3 system are suitable
Reference architecture GMA
Grid Monitoring Architecture requirements
(performance) information with relatively short
lifetime
frequent updates
(should) carry quality-of-information status as
well
but when you get down to it, almost anything
fits in this architecture
including directories with
relatively static information
suitable mainly for resource state

7
Grid Monitoring Architecture

Definition of terms and roles (GWD-GP-16-2)
Functions
Registry (directory)
Add, Update, Remove, Search
Producer
Maintain Registration, Accept Query, Accept
(Un)subscribe, Locate Consumer, Notify,
Initiate (Un)subscribe
Consumer
Locate Producer, Initiate Query, (Un)subscribe,
Maintain Registration, Accept Notification,
(Un)subscribe, Locate Event Schema

8
GMA Intermediaries

Also referred to as republishers
make it a level-3 system
Examples
Latest Producer
return the last value of an event
Archiver (history producer)
storage of historical monitoring data
e.g. accounting records

9
Directories

Information providers publish information to a
directory
Directories may be linked in networked
hierarchies
Information is usually also in a DIT-like
structure(Directory Information Tree)
Typical implementation LDAP

10
Approaches to sending information

Orthogonal to the topology is the information
flow model
Push model
information gets published regardless of its use
bet its there (in higher-level aggregators) when
its needed
e.g. Condor Hawkeye, LCG BDII
Hybrid
information location gets published
consumers can subscribe to information and from
then on continuously get it
e.g. R-GMA, (MDS4?)
Pull model
information is retrieved on-demand, and you
cannot subscribe
e.g. MDS-2

11
Information Systems

Examples shown in this lecture
Monitoring and Discovery Service (MDS)
Relational Grid Monitoring Arch (R-GMA)
Hawk eye
Berkeley-DataBase Information Index (BDII)

12
1 MDS2

Part of GT2.x
Typical use resource selection by brokers
Architecture
decentralized
hierarchical
soft-state protocols with timeouts
supports caching in index servers
Security GSI (optional)

13
MDS2 Architecture
graphic J. Schopf, GFNL masterclass 2005
Distributed Monitoring and Information Services
for the Grid
14
MDS2 information flow

Soft-state registration of GRISes with GIISes
time out on the registration (TTL and nextUpdate)
Data retrieved on-demand from underlying GRIS
timeout on the answer
resources silently drop out if they fail
GRISes collect information using scripts
GIISes can be collated in arbitrary hierarchies

15
2 R-GMA

straight implementation of the GMA
uses a relational representation of the data
notification/subscription directly from the
source
implementation in Java
developed in EU DataGrid and EGEE JRA1
UK cluster, Steve Fisher (RAL), et al.

16
R-GMA Archirecture
17
MON Box

Every site has a MON box to proxy information
local cache of info in memory
through-channel to systems behind a firewall
producers/consumers connect actively to the MON
box
Multiple producers can publish in the same table
joins can be done, but only via a secondary
producer
Usually deployed with a single registry

18
R-GMA plain SQL interface

bosuidavidg1001 rgma
Welcome to the R-GMA virtual database for Virtual
Organisations.
Your local R-GMA server is
https//eg.nikhef.nl8443/R-GMA
You are connected to the following R-GMA Registry
services
https//lcgic01.gridpp.rl.ac.uk8443/R-GMA/Regis
tryServlet
You are connected to the following R-GMA Schema
service
https//lcgic01.gridpp.rl.ac.uk8443/R-GMA/Schem
aServlet
Type "help" for a list of commands.
rgmagt show tables
------------------------------------------
Table Name
------------------------------------------
ArchiverTestTable
...
GlueCE
...

19
Queries

rgmagt select UniqueID,Name,TotalCPUs from GlueCE
WHERE UniqueID LIKE 'ulakbim'
-------------------------------------------------
---------------------
UniqueID
Name TotalCPUs
-------------------------------------------------
---------------------
ce.ulakbim.gov.tr2119/jobmanager-lcgpbs-seegrid
seegrid 126
ce.ulakbim.gov.tr2119/jobmanager-lcgpbs-trgrida
trgrida 126
ce.ulakbim.gov.tr2119/jobmanager-lcgpbs-lhcb
lhcb 126
...

20
3 Hawkeye

Condor information system
publishes class-ads for
matchmaking
fault detection
periodic updates to the agents by the modules
information kept in the agents

21
Hawkeye architecture
graphic J. Schopf, GFNL masterclass 2005
Distributed Monitoring and Information Services
for the Grid
22
4 BDII GIP

BDII conceptually similar to Hawkeye
but data is pulled rather than pushed
mentioned here because of its wide-spread
deployment in EGEE/LCG, OSG, c
Generic Information Providers (GIP)
scripting framework to produce LDIF
static values overridden by output from scripts
periodically, LDAP queries sent to subordinate
directories
with time-out on the answer
previous answer is persistent for a defined
amount of time
contrary to MDS2, BDII will never forget
Paperhttp//indico.cern.ch/materialDisplay.py?co
ntribId126sessionId23materialIdpaperconfId0

23
BDII organisation
BDII
Site BDII
24
BDII scaling

OpenLDAP update (write) is not optimized
with SleepyCat Berkeley DB, simultaneous
read/write lead to timeouts
So, put in a forwarder service that redirects to
a pool of OpenLDAP/DB backends that swap roles

25
WS style information systems

MDS4
based on WS-RF, WS-Notification mechanisms
provides a common aggregator framework for
index service (republisher)
trigger service (send events, mails, execute
programs)
archive service
NAREGI Distributed Information Service
Aggregator collect information from various
sources
put these as CIM objects in a database
OGSA-DAI front-end to the database with CIM
objects
PS OGSA-DAI (Data Access Integration) is a
system for providing uniform grid access to
database resources

26
MDS4 Aggregator Framework
27
NAREGI Distributed Information Service
graphicSatoshi Matuoka, Tokyo Institute of
Technology NII, NAREGI
28
Status

Both developed and available
neither been tested yet at the very large scale
i.e. O(1000) resources, thousands of simultaneous
queries

29
Hierarchies and Views
30
Views on the information system

For resource information
information view on those resources to which the
viewer potientially has access
a single global root is neither feasible nor
needed
a per-VO or per-infrastructure view is sufficient
For application level monitoring
fine-grained access control needed
at the VO or user level
attributes in the schema may have different
privacy levels
requires view management like in regular databases

31
Typical hierarchical top levels today

per-infrastructure
e.g. EGEE/LCG, OSG, NAREGI
used by many VOs
needs support at the infrastructure level
per-VO view
prevalent in grass-roots deployment
all systems can support both
although not all in the same wayR-GMA works
with per-site mon boxes that (today) use a
central registry -gt one per infrastructure

32
Performance

an example of a grid performance study

33
Performance analysis

Best paper so far X. Zhang, J. Freschl, J.
Schopf, A performance study of monitoring and
information services for distributed systems,
in Proceedings of the 12th IEEE High
Performance Distributed Computing (HPDC-12 2003),
IEEE Computer Society Press, Seattle, WA, USA,
2003, pp. 270282.
Perf results on R-GMA are outdated, but basics
still do hold
MDS2 has since been replaced with MDS4 (in GT4)
The three systems selected are indicative of the
different classes, and thus its a very valuable
comparison!
Data in the next slides by Jennifer Schopf
from the GridForum NL/ISOC NL Masterclass 2005

34
Roles of components in the comparison
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
35
Performance analysis

Three characteristics systems
MDS2 (pull system, with and without caching)
R-GMA (hybrid, straight GMA implementation
w/Relational IF)
Hawkeye (push system, from Condor)
Tests done on a small test bed (7 systems)
scaling has not been tested
but results are at least comparable

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
36
Performance analysis other facts

Keep in mind that MDS2 Hawkeye are programmed
in C
R-GMA is in Java
This R-GMA version relied heavily on threads
i.e. implementation was straight translation of
architecture
JVM and Linux kernel 2.4 dont like too many
O(500) threads

37
Model for evaluation

paper attempts to compare similar properties in
the three systems
deploy in a standard mode (as depicted)

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
38
Experiments in Zhang et al.

How many users can query an information server at
a time?
How many users can query a directory server?
How does an information server scale with the
amount of data in it?
How does an aggregator scale with the number of
information servers registered to it?

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
39
Experiments
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
40
Comparing Information Systems

We also looked at the queries in depth -
NetLogger
3 phases
Connect, Process, Response

Response
Process
Connect
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
41
Testbed

Lucky cluster at Argonne
7 nodes, each has two 1133 MHz Intel PIII CPUs
(with a 512 KB cache) and 512 MB main memory
Users simulated at the UC nodes
20 P3 Linux nodes, mostly 1.1 GHz
R-GMA has an issue with the shared file system,
so we also simulated users on Lucky nodes
All figures are 10 minute averages
Queries happening with a one second wait between
each query (think synchronous send with a 1
second wait)

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
42
Metrics

Throughput
Number of requests processed per second
Response time
Average amount of time (in sec) to handle a
request
Load
percentage of CPU cycles spent in user mode and
system mode, recorded by Ganglia
High when running small number compute intensive
aps
Load1
average number of processes in the ready queue
waiting to run, 1 minute average, from Ganglia
High when large number of aps blocking on I/O

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
43
Information Server Throughputvs. Number of Users
(Larger number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
44
Query Times
400 users
50 users
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
45
Experiment 1 Summary

Caching can significantly improve performance of
the information server
Particularly desirable if one wishes the server
to scale well with an increasing number of users
When setting up an information server, care
should be taken to make sure the server is on a
well-connected machine
Network behavior plays a larger role than
expected
If this is not an option, thought should be given
to duplicating the server if more than 200 users
are expected to query it

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
46
Directory Server Throughput
(Larger number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
47
Directory Server CPU Load
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
48
Query Times
400 users
50 users
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
49
Experiment 2 Summary

Because of the network contention issues, the
placement of a directory server on a highly
connected machine will play a large role in the
scalability as the number of users grows
Significant loads are seen even with only a few
users, it will be important that this service be
run on a dedicated machine, or that it be
duplicated as the number of users grows.

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
50
Information Server Scalabilitywith Information
Collectors
(Larger number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
51
Experiment 3 Load Measurements
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
52
Experiment 3 Query Times
80 Info Collectors
30 Info Collectors
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
53
Sample Query
Note log scale
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
54
Experiment 3 Summary

The more the data is cached, the less often it
has to be fetched, thereby increasing throughput
Search time isnt significant at these sizes

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
55
Aggregate Information Server Scalability
(Larger number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
56
Load
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
57
Query Response Times
400 Info Servers
50 Info Servers
(Smaller number is better)
ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
58
Experiment 4 Summary

None of the Aggregate Information Servers scaled
well with the number of Information Servers
registered to them
When building hierarchies of aggregation, they
will need to be rather narrow and deep having
very few Information Servers registered to any
one Aggregate Information Server.

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
59
Overall Results

Performance can be a matter of deployment
Effect of background load
Effect of network bandwidth
Performance can be affected by underlying
infrastructure
LDAP/Java strengths and weaknesses
Performance can be improved using standard
techniques
Caching multi-threading etc.

ideas, graphics, results J. Schopf, GFNL
masterclass 2005 Distributed Monitoring and
Information Services for the Grid
60
Observations on the performance study

Measures performance, not stability
test bed size is only 7 machines and 10 clients
local cluster, i.e. latency is well controlled
In a real-life deployment, complexity is
determining factor in success
simple systems are more likely to survive
systems with soft-state registration timeouts
(like MDS) are more prone to instabilities than
systems based on a persistent elephant-style
memory (like BDII) (c.f. sypical signal
processing issues)

61
Assorted Issues
62
Access Control

AuthN is simple
keeps out 99 of the rogue information
doesnt do a bit for privacy preservation
not every cert owner is a grid user for a
specific infrastructure
course grained ACLs better
grid-mapfile, ACLs on access to service
keeps known bad guys out
still no privacy
fine-grained acls
support within the DB engine is actually
requiredas its too hard to retro-fit otherwise

63
Timeout issues

differences in timeouts in information providers
lead to phase difference effects in the system
temporary amnesia of aggregate information
indices
cumulative delays
Timeouts
registration period for GRIS with a GIIS
time to bind to GRIS (defines if a resource is up
or down)
time to produce information entries
cache TTL in the GIIS
timeout before removing stale information from
GIIS
essentially its feed-back signal theory -)

64
Content of the Information System
65
Approaches to resource information

Resource description
GLUE
CIM
and similar but slightly different schemas for
ARC and GT2
Job description
Unicores AJO

66
Information Schemas GLUE

Describes resource availability information
Common for various middleware suites
Known limitations
not even all specified info is actually used
contains lots of info that are un-used
cannot express information needed for brokering
at the appropriate granularity level (this is
fundamental for all such information schemas)
More specifics discussed with each component
See http//infnforge.cnaf.infn.it/glueinfomodel/

67
Glue Abstractions

Core Entities
Site name, contact info, latitude/longitude,
sponsor
Service type, version, endpoint, status, WDSL
URL, Semantics URN, StartTime
Cluster
ComputingElement Info, State, Policy, ACBRule,
VOView ACBRule, Running, Waiting, Free, ERT,
WRT,
SubCluster HostOperatingSystem, HostAppSWRTEnv,
StorageElement

68
GLUE Core Schema
69
GLUE Cluster
70
GLUE Storage
71
GLUE Linking compute and storage

Useful is storage is accessible via POSIX, or via
faster networks
position of such a binding is difficult
abused for pure-SE info as this is the only place
where the file path to the storage was specified

72
Alternative schemas with the same viewpoint

Original GT2 schema (obsolete)
NorduGrid ARC

73
CIM Common Information Model

object oriented abstraction of information (DMTF)
uses abstractions, dependencies, inheritance
goes beyond a mere information model by
defining methods for standard object behaviour
trying to solve every possible problem (and solve
the perpetuum mobile issue in the process )
information components of CIM can use used to
represent resources

74
Common Information Model (CIM)

Object-oriented schema developed by the DMTF
representation in different formats (such as XML)
See http//www.dmtf.org/standards/cim/
Extended for grid elements by the GCS-WG
BatchService, c
The NAREGI grid is main user of this system

75
Example CIM Job Submission Interface
76
The Unicore information model

Describe the resource requests(so opposite
viewpoint compared to GLUE)
the resources themselves need not be described,
since they will bid on the job requests
we will deal with this one in the Brokering CE
lecture

77
Summary

Information systems used across multiple
organisations and by multiple people or VOs
taxonomy classiciation (republishing data flow)
Any grid information system needs
programmatic access via producer/consumer APIs
compositional IS freedom (VO or infrastructure
hierarchies)
focus has been on resource selection
used for brokering decisions, either by people or
programs
needs a common information schema or translators
for application-level information systems
user-defined schema and a schema registry (like
R-GMA)