Best Ever Alarm System Tool - PowerPoint PPT Presentation

About This Presentation
Title:

Best Ever Alarm System Tool

Description:

... contacting one of the 2 experts, edit correct config files, restarts: ... For now without BEAUtY. DESY AMS is similar and has been. operational for longer ... – PowerPoint PPT presentation

Number of Views:74
Avg rating:3.0/5.0
Slides: 38
Provided by: xnv5
Category:
Tags: alarm | best | system | tool

less

Transcript and Presenter's Notes

Title: Best Ever Alarm System Tool


1
Best Ever Alarm System Tool
  • Xihui Chen,
  • Katia Danilova,
  • Kay Kasemir
  • SNS/ORNL
  • kasemirk_at_ornl.gov
  • April 2009

2
Previous Attempts
  • First ALH,then soft-IOCs and EDM generated from
    ALH config. (Pam Gurd)
  • GUI
  • Static Layouts
  • N clicks to see (some of the) active alarms
  • Configuration
  • .. was bad ? Always too many alarms
  • Operator guidance?
  • Related displays?
  • Changes required contacting one of the 2 experts,
    edit correct config files, restarts seldom
    happened
  • Info
  • Most frequent alarm?
  • Timeline of alarm?

3
New End-User View Alarm Table
  • All currentalarms
  • new, acked
  • Sort by PV,Descr., Time, Severity,
  • Optional Annunciate or Enunciate
  • Acknowledge one or multiple alarms
  • Select by PV or description
  • BNL/RHIC type un-ack

4
Another View Alarm Tree
  • All alarms
  • Disabled, inactive, new, acked
  • Hierarchical
  • Optionally only showactive alarms
  • Ack/Un-ack PVs or sub-tree

5
Guidance, Related Displays, Commands
  • Basic Text
  • Start EDM screen
  • Open web page
  • Run ext. command
  • HierarchicalIncluding info of parent entries
  • Merges Guidance etc. from all selected alarms

6
.. Within CSS
  • Alarms
  • History of PV
  • EPICS Config.

7
CSS Context Menus Connect the Tools
Send alarmPV to anyother CSSPV tool
8
Convenient E-Log Entries
  • Logbookfrom context menucreates text w/basic
    info aboutselected alarms.Edit, submit.
  • Pluggable implementation, not limited to
    Oracle-based SNS ELog

9
Online Configuration Changes
  • .. optionally w/ Authentication/Authorization
  • Log in/out while CSS is running

10
Add PV or Subsystem
  • Right-click on parent
  • Add
  • Enter name
  • Online. No search for config files, no restarts.

11
Configure PV
  • Again online
  • Especially usefulfor operators
  • update guidance,related screens.

12
Logging
  • ..into generic CSS log also used for
    error/warn/info/debug messages
  • Alarm Server State transitions, Annunciations
  • Alarm GUI Ack/Un-Ack requests, Config changes
  • Generic Message History Viewer
  • Example w/ Filter on TEXTCONFIG

13
Logging Get timeline
  • Example Filter on TYPE, PV

6. All OK
4. Problem fixed
5. Acked by operator
3. Alarm Server annunciates
1. PV triggers,clears, triggers again
2. Alarm Server latches alarm
14
Various Web Reports
15
Technical View
IOCs
PV Updates (Channel Access, )
Alarm Server Current Alarms Acknowledged?
Transient? Annunciated?
  • Tomcat
  • Reports

Alarm Updates
Log Messages
Ack Config Updates
Annunciations
Alarm Cfg State RDB
JMS

ALARM_CLIENT
ALARM_SERVER
LOG
TALK
JMS2Speech
JMS2RDB
Alarm Client GUI
MessageRDB
CSS Applications
16
Alarm Server Behavior Similar to ALH
  • Latch highest severity, or non-latching
  • like ALH ack. transient
  • Chatter filter ala ALH
  • Alarm only if severity persists some minimum time
  • .. or alarm happens gtN times within period
  • Annunciation (or Enunciation, or both)
  • Optional formula-based alarm enablement
  • Enable if (pv_x gt 5 pv_y lt 7) pv_z1
  • but we prefer to move that logic into IOC
  • When acknowledging MAJOR alarm, subsequent MINOR
    alarms not annunciated
  • ALH would again blink/require ack

17
Best Ever Alarm System Tools, Indeed
  • .. but Tools are only half the issue
  • Good configuration requires plan follow-up.
  • B. Hollifield, E. Habibi,"Alarm Management
    Seven (??) Effective Methods for Optimum
    Performance", ISA, 2007

18
Alarm Philosophy
  • Goal Help operators take correct actions
  • Alarms with guidance, related displays
  • Manageable alarm rate (lt150/day)
  • Operators will respond to every alarm(corollary
    to manageable rate)

19
Whats a valid alarm?
  • DOES IT REQUIRE IMMEDIATE OPERATOR ACTION?
  • What action? Alarm guidance!
  • Not make elog entry, tell next shift,
  • Consider consequence of no action
  • Is it the best alarm?
  • Would other subsystems, with better PVs, alarm at
    the same time?

20
How are alarms added?
  • Alarm triggers PVs on IOCs
  • But more than just setting HIGH, HIHI, HSV, HHSV
  • HYST is good idea
  • Dynamic limits, enable based on machine state,...
  • Requires thought, communication, documentation
  • Added to alarm server with
  • Guidance How to respond
  • Related screen Reason for alarm (limits, ),
    link to screens mentioned in guidance
  • Link to rationalization info (wiki)

21
Impact/Consequence Grid
Category So What Minor Consequence Major Consequence
Personnel Safety PPS independent from EPICS?
Environment, Public Can EPICS cause contained spill of mercury? Uncontained spill??
Cost Beam Production, Downtime,Beam Quality No effectBeam off lt 1 sec? Beam off lt10 min lt10000 Beam off gt10min gt10000
  • Mostly How long will beam be off?

22
.. combined with Response Time
Time to Respond Minor Consequence Major Consequence
gt30 Minutes NO_ALARM MINOR
10..30 minutes MINOR MAJOR
lt10 minutes MAJOR MAJOR Annunciate
  • This part is still evolving

23
Example Elevated Temp/Press/Res.Err./
  • Immediate action required?
  • Do something to prevent interlock trip
  • Impact, Consequence?
  • Beam off Reset OK, 5 minutes?
  • Cryo cold box trip Off for a day?
  • Time to respond?
  • 10 minutes to prevent interlock?
  • ?
  • MINOR? MAJOR?
  • Guidance Open Valve 47 a bit,
  • Related Displays Screen that shows Temp, Valve,

24
Safety System Alarms
  • Protection Systems not per se high priority
  • Action is required, but were safe for now, it
    wont get worse if we wait
  • Pick One
  • Mommy, I need to gooo!
  • Mommy, I went
  • (Does it require operator action?
    How much time is there?)

25
Avoid Multiple Alarm Levels
  • Analog PVs for Temp/Press/Res.Err./
  • Easy to set LOLO, LOW, HIGH, HIHI
  • Consider
  • Do they require significantly different operator
    actions?
  • Will there be a lot of time after the HIGH to
    react before a follow-up HIHI alarm?
  • In most cases, HIGH HIHI only double the alarm
    traffic
  • Set only HSV to generate single, early alarm
  • Adding HHSV alarm assuming that the first one is
    ignored only worsens the problem

26
Bad Example Old SNS MEBT Alarms
  • Each amplifier trip 3 identicalalarms, no
    guidance
  • Rethought w/ subsystemengineer, IOC
    programmerand operators 1 better alarm

27
Alarms for Redundant Pumps
28
Alarm Generation Redundant Pumps
the wrong way
  • Control System
  • Pump1 on/off status
  • Pump2 on/off status
  • Simple Config setting Pump Off gt Alarm
  • Its normal for the backup to be off
  • Both running is usually bad as well
  • Except during tests or switchover
  • During maintenance, both can be off

29
Redundant Pumps
1
Required Pumps
  • Control System
  • Pump1 on/off status
  • Pump2 on/off status
  • Number of running pumps
  • Configurable number of desired pumps
  • Alarm System Running Desired?
  • with delay to handle tests, switchover
  • Same applies to devices that are only needed
    on-demand

30
Weekly Review How Many? Top 10?
31
A lot of information available
  • How often did PV trigger?
  • For how long?
  • When?
  • Temporary issue?Or need HYST,alarm delay,fix
    to hardware?

32
Weekly Check Stale? Forgotten?
33
Similar DESY Alarm System
IOC
Other
CSS
Interconnection Server
No Channel Access Monitor of selected alarm
PVs! IOCs push all alarms via new protocol into
Interconn. Server.
JMS
Filt.Alrm
LOG
ALARM
JMS2RDB
Filters
LDAP
GUI Similar to SNS GUI shown here
RDB
34
Design Choices
  • Similar alarm table and tree GUIs
  • JMS for communication
  • slightly different messages, though
  • DESY IOCs send all alarms, then filtered in AMS
  • DESY All IOC alarms should show up in AMS, zero
    additional configuration
  • At SNS, how many of the 350000 PVs would send
    alarms?We want to make the addition of alarms
    simple, but not automatic, and encourage
    guidance, related displays.
  • DESY/SNS LDAP vs. RDB for configuration/state
  • Choice was based on available infrastructure.
  • JMS Listeners
  • SNS Logger, Annunciator
  • DESY Logger, Send SMS, EMail, Voice Mail

35
AMS Alarm Message SystemConfiguration Views
  • AMS is a JMS (Java Message Service ) based
    Information-System.
  • It offers different options for message
    distribution
  • SMS
  • E-Mail
  • Voices-Mail
  • Another JMS Topic
  • Messages are sent on the basis of filtered PV.
    (Filters can be combined AND/OR Sequence)
  • The recipients are Users or User groups. User
    groups can be used in two ways.
  • Send to all Users
  • Send to one after another until a user confirms
    the message
  • User, User groups as well as Filters and Actions
    are configures in the AMS configuration View

Slide info from Helge Rickens, DESY
36
AMS
Editor to configure a Filter
Different views to select User, User-Group,
Filter condition, Filter and Alarm Topics
Slide info from Helge Rickens, DESY
37
Summary
  • BEAST operational since Feb09
  • Needs a logo
  • For now without BEAUtY
  • DESY AMS is similar and has beenoperational for
    longer
  • Pick either, but good configuration requires work
    in any case
  • Started with previous annunciated alarms
  • 300, no guidance, no related displays
  • Now 330, all with guidance, rel. displays
  • Philosophy helps decide what gets added and how
  • Immediate Operator Action? Consequence?Response
    Time?
  • Weekly review spots troubles and tries to improve
    configuration
Write a Comment
User Comments (0)
About PowerShow.com