Commits · 8e3dfe52c201f388cad23c60022fa08a271ae9f6 · Projects / PPSi

12 Jul, 2017 15 commits

audit porting: ported the patches of the audit chapter 3.3 · 8e3dfe52

Sven Meier authored Jul 12, 2017

Most of them were already ported, added a check before returning next delay if
a timeout for a (p)delay request has to be taken into account

8e3dfe52

audit porting: ported the patches of the audit chapter 2.4.5 · a8683138
Sven Meier authored Jul 12, 2017
```
Now everywhere the packet buffer is a void *buf and the lenght is int len
```
a8683138

audit porting: ported the patches of the audit chapter 2.4.4 · c105365a

Sven Meier authored Jul 12, 2017

Only the patches ragrding the flag from the current master was
taken over, the other falgs are still needed and used.

c105365a

audit porting: ported the patches of the audit chapter 2.4.3 · 17c4f21f
Sven Meier authored Jul 11, 2017
```
Streamlined length naming, other parts were already ported
```
17c4f21f
audit porting: ported the patches of the audit chapter 2.4.2 · b612bab6
Sven Meier authored Jul 11, 2017
```
No real porting was required only some clarifying, the rest was ported already
```
b612bab6

audit porting: ported the patches of the audit chapter 2.4.1 · b5cdb8a3

Sven Meier authored Jul 11, 2017

The patches from the audit chapter 2.4.1 were ported, some of the
changes were not ported due to previous changes or different
foreign master handling, eg.g announce unpacking is still done since
this seems more streamline with other message handling and is actually
still used for field extraction for the foreign master data set

b5cdb8a3

bmc: fixed announce timeout dataset update and initializing dataset update · ab3d3253

Sven Meier authored Jul 07, 2017

For the announce receipt timeout the datasets shall be updated depending on the other ports states, only if no other in slave update parent dataset.
when a link was connected it runs through initializing which was wrongly updating the parent dataset which caused a short masterchange condition an resyncing.

ab3d3253

msg: cleared out frames so all reserved fields are 0 · 84c4984a
Sven Meier authored Jul 05, 2017

84c4984a

p2p: only answer peer delay messages when in p2p mode · 78f90b18

Sven Meier authored Jul 05, 2017

changed peer delay handling so p2p messages are only answered when in this mode
added p2p messages to some states where it was missing

78f90b18

bmc: timeouts fixe and announce handling changed · 53cb49ff

Sven Meier authored Jul 03, 2017

Message handling for the case of received back frames changed and timeouts for announces reset according to standard

53cb49ff

bmc: handshake started on state base and announce change · e1e3935d

Sven Meier authored Jun 29, 2017

The calibration handshake is now started on the slave state base
Announce messages from the same device are handled differnetly for a BC

e1e3935d

bmc: added hook for state machine extension · 0b21e2c6

Sven Meier authored Jun 28, 2017

A hook was added that handles the wr states, so that they don't get overwritten by bmc decisions.
The extension stays in the white rabbit states until a calibration is done.

0b21e2c6

bmc: added debug info and changed fsm, workaround for htons · 4dc9a628

Sven Meier authored Apr 28, 2017

The FSM order was changed in order to leave the fsm on a state change and re-enter the fsm directly afterward,
this makes sure no state decision is overwritten by the state handling and each state dicision is handled at least once,
There seems to be an issue with htons on unaligned addresses, which is the case for stepsRemoved in an announce,
a workaround of a manual 16bit conversion was added.

4dc9a628

bmc: more changes · 280580e7

Sven Meier authored Apr 27, 2017

Merged uncalibrated and slave state, added event handling to the individual states,
changed in all states the frame handling to table driven handling,
moved common handling from common to slave since it is actually not common,
fixed state passive to be according to standard,
added uncalibrated handling,
fixed listening and master frame handling,

280580e7

bmc: add agging, call periodically and data sets · cdf616d8

Sven Meier authored Apr 19, 2017

BMC fixed to be called periodically and data sets comparisments fixed
also changed the foreign master table adding and agging and some
minor adpatations in state pre-master

cdf616d8

06 Jul, 2017 1 commit
- Revert "wr-servo: simplify servo_reset (do as init does)" - it breaks 1-PPS in master mode · ed394bc1
  Grzegorz Daniluk authored Jul 06, 2017
```
This reverts commit a72f6bdc.
```
  ed394bc1
03 Jul, 2017 1 commit

wrpc: remove warning caused by HAS_ABSCAL · 365e3e99

Adam Wujek authored Jul 03, 2017

Remove warning that HAS_ABSCAL is redefined
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>

365e3e99

29 Jun, 2017 2 commits
- Makefile: Propagate a CONFIG_ABSCAL from WRPC · 4a7e9fae
  Adam Wujek authored Jun 29, 2017
```
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>
```
  4a7e9fae
- time-wrpc: remove some more code when ABSCAL is not used · f39d9e79
  Adam Wujek authored Jun 29, 2017
```
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>
```
  f39d9e79
23 Jun, 2017 3 commits

wr/wrpc: implement absolute calibration support · 0f688012

Alessandro Rubini authored May 18, 2017

This is a port of previous work by Peter Jansweijer from nikhef.

To perform absolute calibration, we need a grand-master look-alike
mode that sends sync once a second (and hopefully slightly after the
pps signal).

Using a special gateware that sends a pulse whenever a frame is
transmitted and received, users can correlate collected timestamps
(T1 and T4), this special pulse and the pps pulse of the node.

The procedure for absolute calibration is described in

http://www.ohwr.org/attachments/4542/WhiteRabbitAbsoluteCalibrationProcedure.pdf

Another commit, in wrpc-sw, adds "mode abscal" for this feature to be used.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

0f688012

export msg_pack_sync to fsm code · 9b178a77

Alessandro Rubini authored May 18, 2017

This is used by absolute calibration, where we send sync and no f-up.
We may implement two-step flag, actually, but this is an easier choice.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

9b178a77

diag: print send/recv stamp with added ps field · e94a665b

Alessandro Rubini authored May 18, 2017

format is "%9d.%09d.%03d". This is not properly a flating point number, but
counting 9 digits is already heavy, I'd better not have a 12-digit field
(which, btw, will be wrongly converted by 32-bit parsers).

This comes from a similar change by Peter Jansweijer from nikhef,
for absolute-calibration work.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

e94a665b

12 Jun, 2017 3 commits

pdelay: rework and extend prev commit · 31f08f19

Alessandro Rubini authored Jun 12, 2017

The previous commit is not enough as a fix.  This may happen:

    - we invalidate stamps after processing them
    - we send request
    - get reply, loose reply-fup
    - send request
    - loose reply, get f-up

So we now invalidate when sending the request. And invalidate t4 alone
as the beautifulness and symmetry of the previous commit is lost
anyways.

Note: there no need to invalidate stamps in e2e mode, because checking
the sequence number to validate RX frames is enough.  But here all
replies match the sequence number, so the problem is not caught and
stamps from different tuples are mixed.

Example beofre this commit, with trimmed stamps (was 1497283863):

   diag-frames-1-wr1: SENT 54 bytes at 863.333173928 (pdelay_req)
   diag-frames-1-wr1: RECV 54 bytes at 863.334158796 (type 3, pdelay_resp)
   diag-frames-1-wr1: Drop received frame
   diag-frames-1-wr1: SENT 54 bytes at 864.479336104 (pdelay_req)
   diag-frames-1-wr1: Drop received frame
   diag-frames-1-wr1: RECV 54 bytes at 864.481095164 (type a, presp_follow_up)

   diag-servo-2-wr1: servo:t3 = 864:479336104:0
   diag-servo-2-wr1: servo:t4 = 863:333174267:586
   diag-servo-2-wr1: servo:t5 = 864:480295312:0
   diag-servo-2-wr1: servo:t6 = 863:334158796:773
   diag-servo-2-wr1: ->mdelay = -2:-292298352:359
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

31f08f19

pdelay: mark stamps as invalid after use · 223459dd

Alessandro Rubini authored Jun 12, 2017

The code is checking the sequence number of pdelay-rep and
pdelay-rep-fup, but we may miss the reply and get the f-up.

The result was something like this (first tuple is ok, next is wrong):

   diag-servo-2-wr1: servo:t3 = 1497279009:22584224:0
   diag-servo-2-wr1: servo:t4 = 1497279009:22584574:759
   diag-servo-2-wr1: servo:t5 = 1497279009:23564032:0
   diag-servo-2-wr1: servo:t6 = 1497279009:23564365:547
   diag-servo-2-wr1: ->mdelay = 0:684:306

   diag-servo-2-wr1: servo:t3 = 1497279009:663586672:0
   diag-servo-2-wr1: servo:t4 = 1497279009:22584574:759
   diag-servo-2-wr1: servo:t5 = 1497279009:683142000:0
   diag-servo-2-wr1: servo:t6 = 1497279009:23564365:547
   diag-servo-2-wr1: ->mdelay = -1:-300579732:306

Here, t4 and t6 are old. The former is the receipt of the request,
send back to the "slave" in the pdelay-reply payload; the latter is
the receive time of such frame.

We now invalidate t4 and t5 when using the tuple. They are the two
"remote" times, one sent back in the response and the other sent back
in the response-fup.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

223459dd

proto-standard: trivial: merge common code · dfeb1890
Alessandro Rubini authored Jun 12, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
dfeb1890

06 Apr, 2017 2 commits

send_and_log: treat drop events as success · 1c2b0811

Alessandro Rubini authored Apr 06, 2017

Dropping on tx is normal behaviour under test. state-listening is
still entering faulty (thus waiting 4 seconds before restarting) if tx
fails.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

1c2b0811

general: use PP_RECV_DROP and PP_SEND_DROP instead of "-2" · 3a3ef679

Alessandro Rubini authored Apr 06, 2017

It was a hack of mine, I'd better call it by name. We must spit
no errors when injecting faults.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

3a3ef679

05 Apr, 2017 9 commits

wr-servo: trivial: set udate_time (for snmp) in missing places · 75707f6a
Adam Wujek authored Mar 15, 2017
```
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>
```
75707f6a

general fix: implement SYNCHRONIZATION_FAULT · a438acc9

Alessandro Rubini authored Mar 14, 2017

If we stopped sending to the master or the peer (for traffic or
whatever; in my case with "fault drop"), we wouldn't notice the
problem.

This looks like SYNCHRONIZATION_FAULT (9.2.6.12), so this reuses the
almost-unused TO_FAULTY, renaming it to a more generic TO_FAULT.

9.2.6.12 says we should reach uncalibrated, but since uncalibrated doesn't
exits (it is never entered, it's dead and untested code at this point),
I handle the problem just like the timeout receiving announce messages.

For wr, I reset the servo, so the problem can be seen.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

a438acc9

trivial: rename a variable · 0bdf155f
Alessandro Rubini authored Mar 13, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
0bdf155f

wr-servo: make re-track much faster · 63cb8ae9

Alessandro Rubini authored Mar 14, 2017

There is no need to go to 0 phase at servo init. It is already 0
at the beginning of the world, but on re-track it can be the same
as it was.

With this change, if we loose track due to packet loss and timeout
(thanks to a few commits ago), we recover in 1..4 seconds as opposed
to 5..9 without this commit.

Tested with "fault drop 0 1000" and later "fault drop 0 0", and
a syslog server:

   Jan  1 00:00:10 192.168.16.229 (22:33:44:55:66:77) Node up since 10 seconds
   Mar 14 15:48:55 192.168.16.229 Tracking after 7.178 s
   Mar 14 15:49:07 192.168.16.229 Lost track
   Mar 14 15:49:11 192.168.16.229 2-th re-rtrack after 4.171 s
   Mar 14 15:49:30 192.168.16.229 Lost track
   Mar 14 15:49:32 192.168.16.229 3-th re-rtrack after 2.485 s
   Mar 14 15:49:49 192.168.16.229 Lost track
   Mar 14 15:49:51 192.168.16.229 4-th re-rtrack after 2.559 s
   Mar 14 15:50:13 192.168.16.229 Lost track
   Mar 14 15:50:16 192.168.16.229 5-th re-rtrack after 3.171 s
   Mar 14 15:50:31 192.168.16.229 Lost track
   Mar 14 15:50:32 192.168.16.229 6-th re-rtrack after 1.589 s

With the original commit for this, Adam found that by unplugging
and re-plugging the fiber our setpoint is always increasing, up to big
values. I checked, and the softpll is always using the module. I brought
the phase value up to hundreds of nanos both positive and negative, without
any issues.  So this version of the commit makes a modulus of the set
point, to avoid it getting too big and scare a user watching the logs.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

63cb8ae9

wr-servo: simplify servo_reset (do as init does) · a72f6bdc
Alessandro Rubini authored Mar 14, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
a72f6bdc
wr-servo: trivial: set udate_time (for snmp) in p2p mode · c24e0e0e
Alessandro Rubini authored Mar 14, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
c24e0e0e
wr-calibration: remove now-unused variable · 9c98c978
Alessandro Rubini authored Mar 13, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
9c98c978
wrpc: add faults in timestamps · 0a49d0dd
Alessandro Rubini authored Mar 13, 2017
```
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>
```
0a49d0dd

Merge branch 'wrs-5.0.1' · 9ce04d1b

Alessandro Rubini authored Apr 05, 2017

This branch is a singe commit which is used in wr-switch-sw, where
ppsi is a submodule.

Let's merge it here to avoid loosing the hash.

9ce04d1b

29 Mar, 2017 1 commit

[BUG: 1551] proto-ext-whiterabbit: add a pre-master state to wr fsm table · ffcde2c5

Adam Wujek authored Mar 08, 2017

When two masters are connected to the same link one of them tries to enter
a state pre-master. However, this state is not compiled in the ppsi, so one
node stuck there forever. It is necessary to restart a node or re-establish
a link.

BUG introduced by a PPSI's commit:
2996dd7b compliance, 9.2.6.10: properly switch to MASTER or PRE_MASTER
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>

ffcde2c5

20 Mar, 2017 1 commit

arch-wrpc: remove uart-sw and hack to use it · 31c0e23b

Alessandro Rubini authored Mar 17, 2017

It was by me fore me, and I'm not using it. Simplify.
Signed-off-by: Alessandro Rubini <rubini@gnudd.com>

31c0e23b

14 Mar, 2017 1 commit
- Kconfig: disable ASSERTs by default · 325a2774
  Adam Wujek authored Mar 14, 2017
```
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>
```
  325a2774
08 Mar, 2017 1 commit

[BUG: 1551] proto-ext-whiterabbit: add a pre-master state to wr fsm table · 6ff21382

Adam Wujek authored Mar 08, 2017

When two masters are connected to the same link one of them tries to enter
a state pre-master. However, this state is not compiled in the ppsi, so one
node stuck there forever. It is necessary to restart a node or re-establish
a link.

BUG introduced by a PPSI's commit:
2996dd7b compliance, 9.2.6.10: properly switch to MASTER or PRE_MASTER
Signed-off-by: Adam Wujek <adam.wujek@cern.ch>

6ff21382