The GSI Mass Storage for Experiment Data - PowerPoint PPT Presentation

1 / 30

About This Presentation

Title:

The GSI Mass Storage for Experiment Data

Description:

8 tape drives LTO2 ULTRIUM (35 MByte/s) ca 170 volumes (32 TByte) ... disks faster emptied than filled: network - disk: ~10 MByte/s. disk - tape: ~30 MByte/s ... – PowerPoint PPT presentation

Number of Views:29

Avg rating:3.0/5.0

Slides: 31

Provided by: horstg3

Category:

Tags: gsi | data | emptied | experiment | familar | mass | storage

Transcript and Presenter's Notes

Title: The GSI Mass Storage for Experiment Data

1
The GSI Mass Storage for Experiment Data

DVEE-Palaver GSI Darmstadt
Feb. 15, 2005
Horst Göringer, GSI Darmstadt
H.Goeringer_at_gsi.de

2
Overview

different views
current status
last enhancements
- write cache
- on-line connection to DAQ
future plans
conclusions

3
GSI Mass Storage System

Gsi mass STORagE system
gstore

4
gstore storage view
5
gstore hardware view

3 automatic tape libraries (ATL)
(1) IBM 3494 (AIX)
8 tape drives IBM 3590 (14 MByte/s)
ca. 2300 volumes (47 TByte, 13 TByte backup)
1 data mover (adsmsv1)
access via adsmcli, RFIO read
read cache 1.1 TByte
StagePool, RetrievePool

6
gstore hardware view

(2) StorageTek L700 (Windows 2000)
8 tape drives LTO2 ULTRIUM (35 MByte/s)
ca 170 volumes (32 TByte)
8 data mover (gsidmxx), connected via SAN
access via tsmcli, RFIO
read cache 2.5 TByte
StagePool, RetrievePool
write cache
ArchivePool 0.28 TByte
DAQPool 0.28 TByte

7
gstore hardware view

(3) StorageTek L700 (Windows 2000)
4 tape drives LTO1 ULTRIUM (15 MByte/s)
ca. 80 volumes (10 TByte)
backup copy of 'irrecoverable' archives
...raw
mainly for backup of user data ( 30 TByte)

8
gstore software view

2 major components
TSM (Tivoli Storage Manager) commercial
handles tape drives and robots
data base
GSI software ( 80,000 lines of code)
C, sockets, threads
- interface to user (tsmcli / adsmcli,
RFIO)
- interface to TSM (TSM API client)
- cache administration

9
gstore user view tsmcli

tsmcli subcommands
archive file archive path
retrieve file archive path
query file archive path
stage file archive path
delete file archive path
ws_query file archive path
pool_query pool
any combination of wildcard characters (,?)
allowed
soon file may contain list of files (with
wildcard chars)

10
gstore user view RFIO

rfio_fopen
rfio_fread
rfio_fwrite
rfio_fclose
rfio_fstat
rfio_lseek
GSI extensions (for on-line DAQ connection)
rfio_fendfile
rfio_fnewfile

11
gstore server view query
12
gstore server view archive to cache
13
gstore server view archive from cache
14
gstore server view retrieve from tape
15
server view retrieve from write cache
16
gstore overall server view
17
server view gstore design concepts

strict separation of control and data flow
no bottleneck for data
scalable in
capacity (tape and disk)
I/O bandwidth
hardware independent
(as long as TSM support)
platform independent
unique name space

18
server view cache administration

multithreaded servers for read and write cache
each with own metadata DB
main tasks
- lock/unlock files
- select data movers and file systems
- collect actual infos on
disk space
soon data mover and disk load -gt load
balancing
- trigger asynchronous archiving
- disk cleaning
several disk pools with different attributes
StagePool, RetrievePool, ArchivePool,
DAQPool, ...

19
usage profile batch farm

batch farm 120 double processor nodes
gt highly parallel mass storage access (read and
write)
read requests
'good' user stage all files before
use wildcard chars
'bad' user read lots of single files from
tape
'bad' system stage disk/DM crashes during
analysis
write requests
via write cache
distribute as uniformly as possible

20
usage profile experiment DAQ

several continous data streams from DAQ
keep same DM during life time of data stream
only via RFIO
GSI extensions necessary
rfio_fendfile, rfio_fnewfile
disks faster emptied than filled
network -gt disk 10 MByte/s
disk -gt tape 30 MByte/s
gt time to stage for on-line analysis
enough disk buffer necessary for case of problems
(robot, TSM, ...)

21
current plans new hardware

more and safer disks
write cache all RAID
4 TByte (ArchivePool, DAQPool)
read cache 7.5 TByte new RAID
StagePool, RetrievePool,
new pools, e.g. with longer file life
time
5 new data movers
new fail-safe entry server
hosts query server, cache administration servers
-gt query performance!
take-over in case of host failure
metadata DBs mirrored on 2nd host

22
current plans merge tsmcli /adsmcli

new command gstore
replaces tsmcli and adsmcli
unique name space (already available)
users need not care in which robot data reside
new archive policy computing center

23
brief excursion future of IBM 3494?

still heavily used
rather full
hardware highly reliable
should be decided this year!

24
usage IBM 3494 (AIX)
25
brief excursion future of IBM 3494?

2 extreme options (and more in between)
no more money investment
use as long as possible
in a few years move data to other robot
upgrade tape drives and connect to SAN
3590 (30 GB, 14 MB/s) -gt 3592 (300 GB, 40
MB/s)
new media gt 700 TByte capacity
access with available data movers via SAN
new fail-safe TSM server (Linux?)

26
current plans load balancing

acquire actual info on no. of read/write
processes
for each disk, data mover, pool
new write request
select resource with lowest load
new read request
avoid 'hot spots'
-gt create additional instances of stage
file
new option '-randomize' for stage/retrieve
distribute equally to different data
movers / disks
split into n (parallel) jobs

27
current plans new org. of DMs

Linux platform
more familar environment (shell scripts, Unix
commands, ...)
case sensitive file names
current mainstream OS for experiment DV
'2nd level' data movers
no SAN connection
disks filled via ('1st level') DMs with SAN
connection
for stage pools with guaranteed life time of
files

28
current plans new org. of DMs

integration of selected group file servers
as '2nd level' data movers
disk space (logically) reserved for owners
pool policy according to owners
many advantages
no NFS gt much faster I/O
files physically distributed over
several servers
load balancing of gstore
disk cleaning
disadvantages
only for exp. data, access via gstore
interface

29
current plans user interface

a large number of user requests
- longer file names
- option to rename files
- more specific return codes
- ...
program code consolidation
improved error recovery after HW failures
support for successor of alien
GRID support
- gstore as Storage Element (SE)
- Storage Resource Manager (SRM)
-gt new functionalities, e.g. reserve
resources

30
Conclusions

GSI concept for mass storage successfully
verified
hardware and platform independent
scalable in capacity and bandwidth to keep up
with
- requirements of future batch farm(s)
- data rates of future experiments
gstore able to manage very different usage
profiles
but still a lot of work ...
to fully reach all discussed plans

Write a Comment

User Comments (0)

About PowerShow.com

Recommended Relevance Latest Highest Rated Most Viewed

Sort by:

Related More from user

CrystalGraphics Presentations

Introducing-PowerShowcom PowerPoint PPT Presentation

Introducing-PowerShowcom - Introducing-PowerShowcom (Without Music)

CrystalGraphics 3D Character Slides for PowerPoint PowerPoint PPT Presentation

CrystalGraphics 3D Character Slides for PowerPoint - CrystalGraphics 3D Character Slides for PowerPoint

Chart and Diagram Slides for PowerPoint PowerPoint PPT Presentation

Chart and Diagram Slides for PowerPoint - Beautifully designed chart and diagram s for PowerPoint with visually stunning graphics and animation effects. Our new CrystalGraphics Chart and Diagram Slides for PowerPoint is a collection of over 1000 impressively designed data-driven chart and editable diagram s guaranteed to impress any audience. They are all artistically enhanced with visually stunning color, shadow and lighting effects. Many of them are also animated. And they’re ready for you to use in your PowerPoint presentations the moment you need them. – PowerPoint PPT presentation

Related Presentations

AMS Mass Data Processing Grid PowerPoint PPT Presentation

AMS Mass Data Processing Grid - The AMS(Alpha Magnetic Spectrometer)experiment, led by Nobel Prize winner ... The portal may use this information to consign trust certificate in the MYProxy. ... | PowerPoint PPT presentation | free to view

Grid Data Management in Action Experience in Running and Supporting Data Management Services in the EU DataGrid Project PowerPoint PPT Presentation

Grid Data Management in Action Experience in Running and Supporting Data Management Services in the EU DataGrid Project - DataGrid is a project funded by the European Union CHEP 2003 24-28 ... list. Subscriber. list ... edg-rm-creg. fileA. Edg-rm/edg-rc Pros. User friendly ... | PowerPoint PPT presentation | free to view

Managed Data Storage and Data Access Services for Data Grids PowerPoint PPT Presentation

Managed Data Storage and Data Access Services for Data Grids - Cell Package. PNFS. HSM Adapter. dCap Client. GRIS. GFAL ... Most of the files transferred in the first 12 hours, then waiting for files to arrive at EB. ... | PowerPoint PPT presentation | free to view

The Consequences of Decentralized Security in a Cooperative Storage System PowerPoint PPT Presentation

The Consequences of Decentralized Security in a Cooperative Storage System - Problem: Complexity Confuses! Detail: Reservation Right. Challenges ... Problem: Complexity Confuses! For beginning users: Negotiated authentication makes life easy. ... | PowerPoint PPT presentation | free to view

Computing and Data Management for CMS in the LHC Era PowerPoint PPT Presentation

Computing and Data Management for CMS in the LHC Era - ORCA uses Objectivity to read/write objects. files. GDMP. Production manager. Build ... orcarc and other ORCA config. maybe via local job queue. objects. ORCA ... | PowerPoint PPT presentation | free to view

Storage and Data PowerPoint PPT Presentation

Storage and Data - Title: Over kwaliteit Author: Margriet Heim Last modified by: David Groep Created Date: 7/15/2003 1:58:52 PM Document presentation format: On-screen Show | PowerPoint PPT presentation | free to view

Storage and Data PowerPoint PPT Presentation

Storage and Data - the same physical file will be replicated to several SEs with different local file names ... information system) treats files as the basic resource abstraction ... | PowerPoint PPT presentation | free to view

Towards the CrossGrid Architecture PowerPoint PPT Presentation

Towards the CrossGrid Architecture - Development of distributed data-mining techniques. ... Optimization of Grid Data Access (3.4) Scheduling Agents (3.2) Replica Manager (DataGrid / Globus) ... | PowerPoint PPT presentation | free to view

The GSI anomaly PowerPoint PPT Presentation

The GSI anomaly - Frenquency measurement (by Schottky-Pickups) 2. The Experiment: ... Frenquency measurement (by Schottky-Pickups) due to cooling (?v/v 0), the fre ... | PowerPoint PPT presentation | free to view

Architecture of the gLite Data Management System PowerPoint PPT Presentation

Architecture of the gLite Data Management System - www.eu-eela.org. E-infrastructure shared between Europe and Latin America ... Cursors for large queries. Timeouts and retries from the client ... | PowerPoint PPT presentation | free to view

Data Grids for Next Generation Experiments PowerPoint PPT Presentation

Data Grids for Next Generation Experiments - Data Grids for Next Generation Experiments Harvey B Newman California Institute of Technology ACAT2000 Fermilab, October 19, 2000 http://l3www.cern.ch/~newman/grids ... | PowerPoint PPT presentation | free to view

Grids and 21st Century Data Intensive Science PowerPoint PPT Presentation

Grids and 21st Century Data Intensive Science - Virtual teams, communities, enterprises and organizations that use specific ... [Paraphrased from NSF Blue Ribbon Panel report, 2003] ... | PowerPoint PPT presentation | free to view

Adam Jacholkowski PowerPoint PPT Presentation

Adam Jacholkowski - 24-25/01/05 GSI. The NA57 Experiment. Study of the dependence of hyperon enhancements on: ... Gold-Plated ntuples. CORRECTING for ACCEPTANCE and LOSSES EVENT-BY ... | PowerPoint PPT presentation | free to view

Architecture of the gLite Data Management System PowerPoint PPT Presentation

Architecture of the gLite Data Management System - Insecure RFIO daemon (rfiod) only LAN limited file access. Single disk or disk array ... An alias created by a user to refer to some item of data, e.g. 'lfn: ... | PowerPoint PPT presentation | free to view

Data Collection Management within the NPACI Toolkit PowerPoint PPT Presentation

Data Collection Management within the NPACI Toolkit - Blue horizon. Data is output TBytes per run. Output dumps occurr about 35 min apart ... Mary Thomas, Jay Boisseau, Maytal Dahan, Eric Roberts, Tomislav Urban (TACC) ... | PowerPoint PPT presentation | free to view

Data Grids for Next Generation Experiments PowerPoint PPT Presentation

Data Grids for Next Generation Experiments - Daily, Weekly, Monthly and Yearly Statistics on the 45 Mbps US-CERN Link ... OC-3. VRVS. MPEG2. OC-3. FEth Switch. FEth Switch. FEth Switch. FEth Switch. GEth ... | PowerPoint PPT presentation | free to view

Architecture of gLite Data Management System PowerPoint PPT Presentation

Architecture of gLite Data Management System - Data are stored in different locations in most cases there is no shared file ... Cursors for large queries. Timeouts and retries from the client ... | PowerPoint PPT presentation | free to view

The GSI anomaly PowerPoint PPT Presentation

The GSI anomaly - The GSI anomaly. Alexander Merle. Max-Planck-Institute for Nuclear Physics. Heidelberg ... unexplained statistical features (pointed out by us) ... | PowerPoint PPT presentation | free to view

Lessons learned from Data Management in the EU DataGrid PowerPoint PPT Presentation

Lessons learned from Data Management in the EU DataGrid - DataGrid is a project funded by European Union whose objective is to exploit and ... Konrad-Zuse-Zentrum f r Informationstechnik Berlin - Germany ... | PowerPoint PPT presentation | free to view

Architecture of the gLite Data Management System PowerPoint PPT Presentation

Architecture of the gLite Data Management System - Data are stored in different locations in most cases there is no shared file ... File pinning. Space reservation. File status notification. Life time management ... | PowerPoint PPT presentation | free to view

HADES experiment: dilepton spectroscopy in C C 1 and 2 AGeV collisions Motivations HADES Dielectron PowerPoint PPT Presentation

HADES experiment: dilepton spectroscopy in C C 1 and 2 AGeV collisions Motivations HADES Dielectron - for the HADES Collaboration. 2nd International Conference on Hard and Electromagnetic ... HADES fully operational 1 month experimental runs ... HADES collaboration ... | PowerPoint PPT presentation | free to view

Overview of GT4 Data Services PowerPoint PPT Presentation

Overview of GT4 Data Services - Construct a summary of LRC state by hashing logical names, creating a bitmap. Compression ... RLI stores in memory one bitmap per LRC. Advantages: Updates much ... | PowerPoint PPT presentation | free to view

Polarization Experiments in Storage Rings PowerPoint PPT Presentation

Polarization Experiments in Storage Rings - Quark chirality conserved in QCD and electroweak processes: 1/2 -1/2 ... Another chiral odd object flipping chirality needed. Collins fragmentation function (H ... | PowerPoint PPT presentation | free to view

Chapter 3. Basic Instrumentation for Nuclear Technology PowerPoint PPT Presentation

Chapter 3. Basic Instrumentation for Nuclear Technology - Chapter 3. Basic Instrumentation for Nuclear Technology Outline of experiment: get particles (e.g. protons, ) accelerate them throw them against ... | PowerPoint PPT presentation | free to view

Status GridKa PowerPoint PPT Presentation

Status GridKa - Scheduled batch analysis using GRID (Event Summary Data and Analysis ... Also PDC data need to be transferred for prototyping and testing of analysis code. ... | PowerPoint PPT presentation | free to view

Diapositive 1 PowerPoint PPT Presentation

Diapositive 1 - The DESIR facility at SPIRAL2 DESIR: D sint gration, excitation et stockage d ions radioactifs (Decay, excitation and storage of radioactive ions) | PowerPoint PPT presentation | free to view

WP2: Data Management PowerPoint PPT Presentation

WP2: Data Management - Generic mirroring tool for any file type (read only replica) ... Categorise possible areas for optimisation: User oriented: high performance ... | PowerPoint PPT presentation | free to view