Skip to content
Projects
Groups
Snippets
Help
Loading...
Sign in
Toggle navigation
W
White Rabbit
Project
Project
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
5
Issues
5
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
Wiki
Wiki
image/svg+xml
Discourse
Discourse
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Commits
Issue Boards
Open sidebar
Projects
White Rabbit
Commits
62f9c378
Commit
62f9c378
authored
Mar 06, 2015
by
Grzegorz Daniluk
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
documents/wrs_failures: adding notes on MIB organization for General and Expert status
parent
91a51bea
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
130 additions
and
72 deletions
+130
-72
snmp_exports.tex
...s/specifications/management/wrs_failures/snmp_exports.tex
+130
-72
No files found.
documents/specifications/management/wrs_failures/snmp_exports.tex
View file @
62f9c378
\section
{
SNMP exports
}
\section
{
SNMP exports
(WIP)
}
\label
{
sec:snmp
_
exports
}
\label
{
sec:snmp
_
exports
}
\subsection
{
Operator/basic objects
(WIP)
}
\subsection
{
Operator/basic objects
}
Objects providing basic status of the WR Switch. It should be used by control
Objects providing basic status of the WR Switch. It should be used by control
system operators and people without deep knowledge of the White Rabbit
system operators and people without deep knowledge of the White Rabbit
internals. These values report the general status of the device and high level
internals. These values report the general status of the device and high level
errors.
errors.
\\
\noindent
\rule
{
\textwidth
}{
2pt
}
{
\bf
Note
}
: We will need to change the SNMP code. There should be something like
{
\bf
Note
}
: Basically I think we should have another process monitoring various
a loop reading all information periodically (e.g. every 5s) from various SHM
stuff according to possible faults that may occur. This process should then be
areas (HAL, PPSi, SPLL), caching and calculating general status information.
used to report high-level information i.e. if this OK, is that OK, etc. At least
This way, when we receive SNMP request we can feed the information from our
for more complex stuff, e.g. we can simply export temperature or CPU load and
local SNMP cache. The same code could be later used to generate SNMP Traps.
\\
let NMS to decide when it's bad.
\\
\noindent
\rule
{
\textwidth
}{
2pt
}
\begin{itemize}
[leftmargin=0pt]
\noindent
{
\bf
General Status
}
:
\item
[]
\texttt
{
WR-SWITCH-MIB::status
}
\\
- general status word for WR Switch.
\begin{itemize}
%[leftmargin=0pt]
It is split into several 2-bit fields. Each of them describes one
\item
WRS general status - OK / Warning / Error
function of the WR Switch and can be:
\item
Timing Status
\begin{packed_items}
\item
Networking Status
\item
[]
{
\bf
"00"
}
- Status OK
\item
System Statue
\item
[]
{
\bf
"01"
}
- Status Warning
\item
Detailed status
\item
[]
{
\bf
"10"
}
- Status Failure
\begin{itemize}
\end{packed_items}
\item
Timing
\vspace
{
12pt
}
\begin{itemize}
\begin{tabular}
{
|c|l|
}
\item
PTP (TRACK
\_
PHASE, offset, RTT, fixed deltas, deamon crash,
\hline
servo
\_
update
\_
cnt)
bits
&
description
\\
\item
SoftPLL (DelCnt = 0; mode, SeqState, AlignState)
\hline
\hline
\item
Slave link down
1:0
&
PTP status
\\
\item
PTP frames flowing ?
3:2
&
SoftPLL status - OK if locked and aligned
\\
\item
(placeholder for Switchover)
5:4
&
Switching status
\\
\item
(placeholder for Holdover)
7:6
&
System status
\\
\end{itemize}
&
Redundancy status
\\
\item
Networking
\hline
\begin{itemize}
\end{tabular}
\item
(placeholder for Link down)
\item
SFPs (portSfpError.<x> ?)
\item
Endpoint status (2.2.2)
\item
Swcore status (2.2.3, 2.2.5)
\item
RTU status (2.2.4, 2.2.7)
\item
(placeholder for TRU)
\item
(placeholder for switchover or backup link state)
\end{itemize}
\item
System
\begin{itemize}
\item
Boot ok
\item
Free memory too low
\item
Temperature
\item
CPU load too high
\item
Disk space too low (?)
\end{itemize}
\end{itemize}
\item
Version (rewrite existing)
\begin{itemize}
\item
last date/time when firmware was updated
\\
(save current time on restart, when new firmware is in /update so that it can be exported with SNMP)
\item
contact info
\item
build by
\item
build date
\item
hash, HW, SW,
\item
(check what exists and add missing)
\end{itemize}
\end{itemize}
\item
[]
\texttt
{
WR-SWITCH-MIB::ptpMode
}
\\
\newpage
Synchronization mode: Grand Master / Free-running Master / Slave
\subsection
{
Expert/extended status
}
\item
[]
\texttt
{
WR-SWITCH-MIB::spllState
}
\\
Expert objects can be used by White Rabbit experts for the in-depth diagnosis of
\begin{packed_items}
the switch failures. These values are verbose and should not be used by
\item
[]
\texttt
{
WR-SWITCH-MIB::spllState.mode
}
: (Grand Master /
operators.
Free-running Master / Slave)
%\item [] \texttt{WR-SWITCH-MIB::spllState.locked}: is Helper/Main locked (true / false)
%\item [] \texttt{WR-SWITCH-MIB::spllState.aligned}: is it phase-aligned (true / false)
\item
[]
\texttt
{
WR-SWITCH-MIB::spllState.hover
}
: is in holdover (true /
false)
\item
[]
\texttt
{
WR-SWITCH-MIB::spllState.sover
}
: is it switched-over to a
backup link (true / false)
\end{packed_items}
\item
[]
\texttt
{
WR-SWITCH-MIB::ptpClockOffsetPs
}
\\
\begin{itemize}
Clock offset calculated by PTP/PPSi
\item
Operation Status
\begin{itemize}
\item
CPU Load (
\%
)
\item
current time
\begin{itemize}
\item
TAI
\item
date string
\end{itemize}
\item
Boot status
\begin{itemize}
\item
boot cnt
\item
restart reason
\item
boot status values
\\
(1 object for each: hwinfo readout, FPGA, LM32, kernel modules, userspace daemons, config retreived ok)
\item
config source (tftp, flash, as string?)
\end{itemize}
\item
Temperature
\begin{itemize}
\item
temp 1..4
\item
threshold 1..4
\end{itemize}
\end{itemize}
\item
[]
\texttt
{
WR-SWITCH-MIB::tempFPGA
}
\\
- SCB temperature below the FPGA
\item
Restart Counters
\item
[]
\texttt
{
WR-SWITCH-MIB::tempScbPsu.1
}
\\
- SCB temperature near the
\begin{itemize}
power supply circuit
\item
HAL
\item
[]
\texttt
{
WR-SWITCH-MIB::tempScbPsu.2
}
\\
- SCB temperature near the
\item
PPSi
power supply circuit
\item
RTUd
\item
[]
\texttt
{
WR-SWITCH-MIB::tempPLL
}
\\
- SCB temperature near the VCXO and
\item
(..)
PLLs
\item
SPLL
\end{itemize}
\item
SoftPLL state
\begin{itemize}
\item
mode, irqcnt, seqstate, alignstate, Hlock, Mlock, Block[18], Err[18], HY, MY, delCnt, holdover, holdoverTime
\item
spll version
\item
spll build date
\item
(...)
\end{itemize}
\item
[]
\texttt
{
WR-SWITCH-MIB::portLink.<n>
}
\item
Networking
\end{itemize}
\begin{itemize}
\item
VLAN table dump
\item
RTU table dump (check if management sw uses snmpwalk)
\item
SW core status
\begin{itemize}
\item
Free pages
\end{itemize}
\end{itemize}
\item
Pstats (pivot table, some of the counters should be used to fill
standard MIBs)
\item
PtpData (make it an array for later switch-over needs)
\begin{itemize}
\item
per instance/ which port
\end{itemize}
\item
Ports status (per-port information)
\begin{itemize}
\item
portEnable (enable/disable port via ifconfig)
\item
ptpTxFrames (per port or per instance, depending on implementation)
\item
ptpRxFrames (per port or per instance, depending on implementation)
\end{itemize}
\item
Configuration
\begin{itemize}
\item
PPS width
\end{itemize}
\begin{itemize}
\item
Uptime
\item
Firmware version
\item
Hardware version
\item
Manufacturer
\item
Serial number
\item
How WRS was configured (manually / .config fetched from server/...)
\item
Current WR time
\item
Last date/time when firmware was upgraded
\item
Contact info
\item
Link failure detected, switched over to a backup link No. X
\item
WRS has booted successfully, none of the steps has failed (reading HW
info, programming FPGA and LM32, loading kernel modules, starting daemons)
\end{itemize}
\end{itemize}
\newpage
\newpage
\subsection
{
Expert objects
}
\subsection
{
Expert objects (to be updated, was first draft
}
Expert objects can be used by White Rabbit experts for the in-depth diagnosis of
{
\bf
Note:
}
we will put here MIB file dump later.
the switch failures. These values are verbose and should not be used by
operators.
\subsubsection
{
PTP/WR parameters
}
\subsubsection
{
PTP/WR parameters
}
\begin{itemize}
[leftmargin=0pt]
\begin{itemize}
[leftmargin=0pt]
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment