RRN 00048 ANOMALY REPORT

LQTDAY09

Message Triggering Condition

This report is triggered in case of the reception of an anomaly detected by the system. Reports of anomalies detected by the system are analysed by Analyse Error Reports (FMM) to determine what action, if any, must be taken to protect the system from the cause of the anomaly and to prevent the distribution of the effect of the anomaly throughout the system. Anomalies include : * failure of hardware to perform as expected by the software, * hardware malfunctions detected by hardware validity checking circuits (registers, rack / master alarms PBAs). Analyse Error Report is also informed for autonomous reactions as restart, reload performed by error handler. The actual report layout is formatted depending on the error originator. Normally (most cases) the first block is displayed, while the second block is displayed in case of the bootstrap is originated by NMI. The third block is used for multipath self routing switch software errors

Report Layout

   SWA-ERROR REPORTING
      ERRTYPE = (A).....
      ---------------------------------------------------------------
        (1):(2):(3)        NA = (4). OBC = (5)..
        ERCLAS SEQ RLVL (6)   PROC  CS : IP  USERDATA
        (7)... (8) (9). (10). (11) (12):(13) (14) (15) (16) (17) (18) (19)
(previous line continue on next line)
        (20) (21) (22) (23)
        (24):(25):(26)        NA = (27) OBC = (28).
        ERCLAS SEQ RLVL  DS   ES   SS   CS   IP   SP   BP   SI   DI   BX
        (29).. (30) (31) (32) (33) (34) (35) (36) (37) (38) (39) (40) (41)
        NA = (42)
        MANAGED OBJECT CLASS = (43).........
        EVENT TYPE           = (44).................. EVENT TAG    = (45)..
        EVENT TIME (Y,M,D,H,M,S,MICROSEC) = (46) (47) (48) (49) (50)(51)(52)..
        EVENT INFORMATION
              PROBABLE CAUSE = (53)......
          PERCEIVED SEVERITY = (54)......
                 ERROR CLASS = (55)......
                    PBA TYPE = (56)..                 PBA NUMBER   = (57)..
        SOFT CONTEXT   SC-ID = (58)    VAR = (59)     THREAD ID    = (60)..
        SOURCE CONTEXT SC-ID = (61)    VAR = (62)
             AUTHORITY TYPE  = (63)......
             RECOVERY  TYPE  = (64)......
             CLEARCASE LABEL = (65)....................
             FILENAME        = (66)..........................
             LINE NUMBER     = (67)......           SEQ NBR      = (68)......
        AWI CONTEXT PROC INSTANCE SEL  = (69)....   OPERATION ID = (70)....
        PERFORMED RECOVERY BOOT LEVEL  = (71)........
        DATA STATUS
                        DIRECT ACCESS  = (72).........
               SIMPLEX MEM WRITE PROT  = (73).........
                DUPLEX MEM WRITE PROT  = (74).........
                     SIMPLEX MEM DISK  = (75).........
                      DUPLEX MEM DISK  = (76).........
                               EEPROM  = (77).........
        MANDATORY USER DATA NBR WORDS  = (78)
        (79).... (80).... (81).... (82).... (83).... (84).... (85).... (86)....
        OPTIONAL  USER DATA
        (87)...........................
  REPORT REFERENCE NUMBER = 0048

Parameter Identification

Par. No. Meaning Remarks
(A) error type . . . . . . . . . . . . refer to SI008 List of Error Types
(1) Error detection time minutes
(2) Error detection time seconds
(3) Error detection time tenths of seconds
(4) Network address in hexadecimal format
(5) On-board Controller (OBC) identity in case of the error is generated
by OBC
(6) Program type
CLK SSM clock process
EVH SSM event handler
FMM FMM process
INT SSM interrupt process
OSN operating system nucleus
(7) Error class
AUDIT errors reported from audit
programs (not including blocked
process audit)
BPA errors reported from "Blocked
Process Audit" when a process is
found blocked
DPM errors in case of Dual Port
Memory failure (PLCE or defence
processor)
EH-ERR error class used by error handler
to report the errors it has
detected itself
F-REL indication of a forced bootstrap
of a processor, message interface
FMM puts this class in the boot
buffer after the reception of a
forced boot request from
maintenance or the operator
F-RES indication of a forced restart of
a processor, message interface
FMM puts this class in the
restart buffer after the
reception of a forced restart
request from maintenance or the
operator
HW hardware errors found by device
handlers
INIT error class to be used when an
FMM is not able to initialise
itself after having received the
FMM_INIT message
MATE in case the message handler
detects an access failure from a
standby CE that tries to
communicate with the mate
NMI non maskable interrupt such as
write protect violation or
invalid addressing reported by
process manager
NOTIFY error class to be used for error
reports with no system reaction
needed but which have always to
be reported to the operator
O-NREC non-recoverable errors reported
by OSN module
O-REC recoverable errors in OSN module
O-USER errors detected by OSN during the
process of primitives due to
incorrect info given by the user
(FMM or SSM) of the primitives
OM errors reported from Local
Overlay Manager when an overlay
FMM is timed out
PWR-ON power-on indication
REL error class to indicate
explicitely that a processor
reload is needed (to be used very
carefully)
RES error class to indicate
explicitely that a processor
restart is needed (to be used by
non-critical FMMs)
RT hardware errors found by routine
tests
SANITY unmaskable interrupt from Sanity
Timer
SUMCHK sumcheck failure
U-NREC errors reported by FMMs or SSMs
in case that no internal recovery
was possible
U-REC errors reported by FMMs or SSMs
in case that internal recovery
was possible
(8) Sequence number an indication where the error
occurs in the originating program
(9) Recovery level
ABOR abort (the error handler aborts
the process or the FMM that
caused the problem)
BOOT boot (the error handler triggers
the ROM BOOTSTRAP program to
execute the FAST TESTS)
NONE no recovery (the error handler
reports te error to LEA with an
anomaly message and returns back
to the caller)
REST restart (the error handler
triggers the ROM RESTART program)
TKOV take-over (the error handler
reports the error to LEA and
requests a switch-over to RAM
RESTART)
(10) Program identity FMM or SSM identity (if the
originator is not OSN), see SI
024 in case of FMM identity
(11) Process identity in hexadecimal format
(12) Code segment of the program counter in hexadecimal format
(13) Instruction pointer of the program counter in hexadecimal format
(14) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(15) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(16) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(17) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(18) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(19) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(20) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(21) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(22) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(23) User data bytes that contain extra information about the error 5 words (10 bytes), in
hexadecimal format
(24) Error detection time minutes
(25) Error detection time seconds
(26) Error detection time tenths of seconds
(27) Network address in hexadecimal format
(28) On-board Controller (OBC) identity in case of the error is generated
by OBC
(29) Error class
see param (7)
(30) Sequence number an indication where the error
occurs in the originating program
(31) Recovery level
see param (9)
(32) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(33) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(34) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(35) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(36) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(37) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(38) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(39) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(40) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(41) Contents of register if bootstrap originated by NMI (Non Maskable Interrupt) in hexadecimal format
(42) Network address in hexadecimal format
(43) Managed object class
(44) Event type
(45) event tag
(46) Error detection time: year
(47) Error detection time: month
(48) Error detection time: day
(49) Error detection time: hour
(50) Error detection time: minutes
(51) Error detection time: seconds
(52) Error detection time: microseconds
(53) Probable cause
(54) Perceived severity
(55) Multipath self routing switch error class
MEM-VIOLATION memory violation
NREC-CODE non recoverable code
NREC-DATA non recoverable data
NREC-HW non recoverable HW
OAM-REQ operations and maintenance
request
OPERATOR-REQ operator request
SW-SWAPPING-REQ request for software swapping
TAKE-OVER-REQ request for take over
(56) Multipath self routing switch board type
BB-AS access switch
BB-CCK central clock board
BB-PS1 plane switch stage 1
BB-PS2 plane switch stage 2
BB-RCD rack clock distribution board
BB-RLK rack controller link board
BB-RPU rack based processing unit board
BB-SCC supervision and control module
for cooling, board
BB-XLK link termination board
(57) Multipath self routing switch board number
(58) Soft context software component id mnemonic
(59) Soft context variant mnemonic
(60) Soft context software thread id
(61) Source context software component id mnemonic
(62) Source context software component id mnemonic
(63) Source context authority type
APPL application software
KERNEL kernel software
OAM operation and maintenance
(64) Source context recovery type
(65) Source context clearcase label
(66) Source context filaname
(67) Source context line number
(68) Source context sequence number
(69) Application writers interface context process instance selector
(70) Application writers interface context operation identity
(71) Performed recovery level boot boot level
COLD-DISK cold boot from disk
COLD-NETW cold boot via network
HOT-DISK hot boot from disk
HOT-NETW hot boot via network
HOT-RAM hot boot from random access
memory
MTR-EVENT boot after hardware watchdog
timer event
POWER-ON boot after power on
WARM-DISK warm boot from disk
WARM-NETW warm boot via network
WARM-RAM warm boot from random access
memory
(72) Performed recovery level data direct access data
RESET reset of data done
VALID data valid
(73) Performed recovery level data simplex memory write protected
data
RESET reset of data done
VALID data valid
(74) Performed recovery level data duplex memory write protected
data
RESET reset of data done
VALID data valid
(75) Performed recovery level data simplex memory disk data
RESET reset of data done
VALID data valid
(76) Performed recovery level data duplex memory disk data
RESET reset of data done
VALID data valid
(77) Performed recovery level data eeprom data
RESET reset of data done
VALID data valid
(78) Mandatory user data, number of meaningful words
(79) Mandatory user data word 1
(80) Mandatory user data word 2
(81) Mandatory user data word 3
(82) Mandatory user data word 4
(83) Mandatory user data word 5
(84) Mandatory user data word 6
(85) Mandatory user data word 7
(86) Mandatory user data word 8
(87) Optional user data text string