Title: A SelfManageable Infrastructure for Supporting Webbased Simulations
1A Self-Manageable Infrastructure for Supporting
Web-based Simulations
- Yingping Huang
- Xiaorong Xiang
- Gregory Madey
- Computer Science Engineering
- University of Notre Dame
- Sponsored by NSF/ITR-DEB
2Outline
- Introduction
- Autonomic Computing
- Web-based Simulations
- Self-manageable infrastructure
- Self-Configuring
- Self-Healing
- Self-Optimizing
- Self-Protecting
- Conclusion and future work
3Autonomic Computing
- Motivation
- Whats next? A dozen information technology
research goals (J. Gray, Microsoft Research) - Goal
- The SysAdmin sets system goals and high level
polocies - System takes care of itself
http//www.ibm.com/autonomic
4Autonomic Computing (cont)
- Self-Configuring
- New simulations, new simulation servers
- Self-Healing
- Completed simulations
- Self-Optimizing
- Efficient usage of system resources
- Self-Protecting
- No unauthorized access
5Web-based Simulations
- Features of Web-based simulations
- Simulations run on the simulation servers
- Simulation data is downloadable for users
- Simulation reports are generated dynamically
- Simulation status is sent to users by email
- Collaboration among users
- Challenges
- Reliability
- Availability
- Efficiency
- Security
6Motivation NOMSIM
- Simulate natural organic matter (NOM) evolution
behavior - Agent-based stochastic simulation method
- Multi-disciplinary project that involves
chemists, biologists, environmental scientists,
geologists and computer scientists - Collaboration is essential
- Funded by NSF-ITR
7Outline
- Introduction
- Autonomic Computing
- Web-based Simulations
- Self-manageable infrastructure
- Self-Configuring
- Self-Healing
- Self-Optimizing
- Self-Protecting
- Conclusion and future work
8The Infrastructure
9Features of the Infrastructure
- Scalability
- Web server tier new application servers can be
added to the balanced application server cluster - Simulation server tier can be scaled almost
linearly by installing new simulation servers
running identical simulations - Database server tier real application cluster
(RAC) enables all active instances executing
transactions against a shared database
10Features of the Infrastructure (cont)
- Availability
- Web server tier eliminates single point of
failure by redundancy and failover, and session
state is maintained in the database server tier - Simulation server tier simulation checkpointing
and resuming - Database server tier eliminates single point of
failure by redundancy and failover
11Simulation Metadata
ltsimulation name"nomsim"gt ltdb_urlgt
lturlgtjdbcoraclethinusername/password_at_hostnamep
ortsidlt/urlgt ltusernamegtdbusernamelt/username
gt ltpasswordgtdbpasswordlt/passwordgt
lt/db_urlgt ltinput_partgt ltinput
name"time" type"number" /gt ltinput
name"temperature" type"number" /gt ltinput
name"granted" type"char(1)" /gt ltinput
name"molecule_name" type"varchar2(50)" /gt
lt/input_partgt lt/simulationgt
12Simulation Manager and Intelligent Agents
- One intelligent agent runs on one simulation
server - Functionalities of intelligent agents
- Register new simulation servers to simulation
manager - Reports metrics of simulation servers to
simulation manager - Deploy new simulation models
- Check for simulation jobs
- Transport data from simulation servers to
database servers - Cancel simulation jobs as directed by the
simulation manager - Functionalities of simulation manager
- Dispatch and manage simulation jobs
- Notify users simulation job status
13Self-Configuring
- ltsimulation name"nomsim"gt
- ltdb_urlgt
- lturlgtjdbcoraclethinusername/password_at_host
nameportsidlt/urlgt - ltusernamegtdbusernamelt/usernamegt
- ltpasswordgtdbpasswordlt/passwordgt
- lt/db_urlgt
- ltinput_partgt
- ltinput name"time" type"number" /gt
- ltinput name"temperature" type"number" /gt
- ltinput name"granted" type"char(1)" /gt
- ltinput name"molecule_name"
type"varchar2(50)" /gt - lt/input_partgt
- lt/simulationgt
HTML form
JSP Code
JavaScript Form Validation
Database Table
14Self-Configuring (cont)
- On simulation servers
- Intelligent agents must run
- Install simulation software
- (To simplify the simulation software is
installed on an NFS server and which is mounted
on the simulation servers) - On simulation manager
- Email masquerading
15Self-Healing
- Self-Healing Web servers
- Clustered application server instances
- Automatic recovery of failed instance
- Self-Healing simulation servers
- Simulation checkpointing
- Simulation resuming
- Self-Healing database servers
- Clustered database instances
- Automatic recovery of failed instance
- Raid 01
16Self-Healing (cont)
RDBMS
Checkpointing
Simulation
Resuming
Simulation Server Tier
Database Server Tier
17Self-Protecting
- Role based access control
- Public
- Owner
- Grant
- Firewall
- Port scan
- IPTABLES
- Log messages scanning
- Network traffic monitoring, Intrusion Detection
(Future work)
18Self-Optimizing
- Self-Optimizing web server tier
- Load balanced application server cluster
- Self-Optimizing database servers
- Database parameter self-tuning
- Online index rebuilding
- Summary and aggregation
19Self-Optimizing (cont)
Time
DBMS
Migrate
Checkpoint
Simulation Server 1
Simulation Server 2
20Implementation of self-
- Tools
- IBMs ABLE (agent building and learning
environment) - Oracle Data Mining
- Unix Crontab
- Languages
- Java
- SQL and PL/SQL
- Bourne shell scripts
21Outline
- Introduction
- Autonomic Computing
- Web-based Simulations
- Self-manageable infrastructure
- Self-Configuring
- Self-Healing
- Self-Optimizing
- Self-Protecting
- Conclusion and future work
22Conclusions and Future Work
- Conclusions
- Self-Manageable infrastructure
- Intelligent agents
- Simulation manager
- Future work
- Applying data mining
- Intelligent agents
- Proactive critical event prediction
- Job completion time prediction
23Questions?
Thank You