ENG-010459
The purpose of this document is to explain how GPON optical protection works and define the features and capabilities of the Tellabs Type B PON protection switching. The GPON system and type B protection are defined by the G.984.1 document.
PON - Passive Optical Networking. PON is a point-to-multi-point architecture with passive infrastructure supporting distances of up to 20 km. The infrastructure between the OLT and ONT is all optical and completely passive with no electronics in between the two.
Primary PON - The PON that by default carries all the traffic when no failures exist in the system in Revertive PON Protection groups(QOIU7 only). This link has no special significance for non-revertive PPGs(OIU8). There is a slight preference for the primary on startup in non-revertive systems, but that is all.
Secondary PON - The secondary PON is the PON that will carry the traffic in the event of a failure on the Primary PON for revertive PONs(QOIU7). This link has no special significance for non-revertive PPGs(OIU8)
Active PON - The active PON is the PON which is currently carrying traffic.
Standby PON - The standby PON is the PON which is not carrying traffic and in the normal case is providing standby protection for the Active PON.
Type B PON Protection - Type B PON protection is a part of the PON standards and allows the OLT, and OLT connections to be redundant and allows high reliability connections over the PON.
Revertive Switching - In revertive switching, there is a preferred PON and when the Primary PON failures clear, the traffic will switch back from the Secondary PON to the Primary PON. In the normal state traffic will always be on the Primary PON. The QOIU7 supports revertive switching only.
Non-Revertive Switching - When non revertive switching is employed, there is no preference for a particular link and the traffic will run on the Active link until a failure occurs and then will switch to the Standby link. It will not switch back on the failure being cleared on the previously failed Link. The OIU8 only supports non-revertive switching.
PPG - PON Protection Group. The Protection Group consists of a primary PON and secondary PON used to protect traffic in the event of an OLT failure.
Intra-OLT PON Protection - When type B PON protection is implemented within a single OLT for cost savings. It is not the preferred configuration as some OLT hardware in the path is not redundant.

Figure 1 Intra OLT Type B protection
Inter-OLT PON Protection - When type B PON protection is implemented between two OLTs which provides full redundancy of all OLT hardware, uplinks and PON. This is the preferred configuration for reliability reasons and is the option recommended by Tellabs.
Figure 2 Inter OLT Type B protection
This document applies to Type B PON protection on the Tellabs 1131, 1134, and 1150 OLTs and ONTs.
Type B protection makes use of a 2:N splitter which allows two inputs on the OLT side of the circuit(OLT to splitter side). From the splitter to the ONT is a single fiber going to each of the ONTs on the PON. This gives both equipment and facility protection for the OLT side of the PON. With Type B PON Protection one can accomplish OLT PON fiber path redundancy to the communication closet or zone box (e.g. 2:32 splitter).
PON Protection interfaces can be:
This allows the user to control what level of redundancy they require based on what makes the most sense in terms of a tradeoff of cost/benefit, usually driven by the importance of the data flowing over the PON network
Tellabs supports the G.984.1 Type B PON protection on all of its OLTs and ONTs. This allows the user to fully protect the PON side of the network in a cost effective manner.
Type B protection makes use of a 2:N splitter which allows two inputs on the OLT side of the circuit. There is a single fiber going to each of the ONTs on the PON. This gives both equipment and facility protection for the OLT side of the PON.
As of the 29.0 release, the system can detect the failure of the PON within 500 ms and switches the traffic in sub 5 seconds for the whole PON (up to 32 ONTs).
It should be noted that for the QOIU7, the protection switching is revertive meaning, when the primary link is returned to service, the system will switch back after the wait to restore timeout has expired. The default wait to restore time is 60 seconds. The revert time is used to provide hysteresis and avoid thrashing if the Primary PON is transitioning up and down.
The OIU8 has improved hardware that enables non-revertive switching. When the OIU8 switches due to a PON failure, it will remain on that link as long as it is operational and will not switch back. This is less disruptive than the revertive switching used on the QOIU7 as it does not need to switch back when the primary PON goes back into service.
When a PON protection switch occurs, for proper operation, the system must flush the MAC tables. MAC address re-learning will occur upon the next message sent upstream on the protected PON. In the case of an intra-OLT PPG, the ESU cards of the OLT will relearn the MAC addresses of the devices attached to the protected ONTs as coming from a different slot. In most cases, this learning is almost immediate actually done in the upstream direction because the devices connected to the ONTs are sending packets unaware that a change has occurred. This has the benefit of reducing the need for flooding. Any upstream switches are unaware of the change on the OLT. In the case of inter-OLT PPG switching, the same process occurs in the OLT, but the upstream switches must learn the new path to the client device since it is on a different OLT. Appropriately, it is typical in an environment with redundant PONs for the OLT to be connected to the core network via HSRP or VRRP, protocols that are designed for immediate response to path updates.
So, this behavior is normal and to be expected during a PON Protection Switch. Some flooding will occur downstream until the next message is sent upstream on the PON and the MAC is relearned.
The Tellabs implementation is able to detect hard failures of the PON and failover even if the two PONs within the protection group have no communication. If all ONTs on a PON stop communicating, the PON link will fail over to the secondary link.

While no messaging is required for PON protection switches to occur, it is beneficial to communicate status information from the primary to the secondary PON to speed up switches and minimize service disruption. Outages of up to 1 minute can occur if no sync channel is present and the system blindly switches as all services must be reconfigured. With the sync channel present and sufficient time to sync, the two sides will have exactly the same configuration and switches will be in the 1-5 second range.
For example, if a port has been authenticated via 802.1x, rather than force the 802.1x supplicant to re-authenticate when the PON protect switch occurs, the status can be shared with the protection PON. To do so, Tellabs has implemented a status update protocol between the primary and secondary PON. The status information is sent periodically, every few seconds to enable rapid protection switches. It is meant strictly to minimize the traffic disruptions in various scenarios. 802.1x and MAB can lengthen the time needed to sync the Primary and Secondary configurations as there is a good deal of status information that is exchanged.
In the case of Intra-OLT protection, the sync channel is configured by the EMS behind the scenes and requires no user intervention. In the case of Inter-OLT protection, the user will be prompted to configure the Sync Channel ID.

The Sync Channel VLAN ID configuration defines the VLAN the Active and Standby PON will use to communicate. The Sync Channel VLAN ID will be used to exchange layer 2 Ethernet frames containing proprietary Tellabs messages used to synchronize state. The VLAN used must be configured in the VLAN table and uplink NET interfaces on BOTH OLTs so that the protection group members can communicate. The user only needs to specify the Sync Channel VLAN when it is an Inter-OLT protection group.
After a switch the system will have a standing "PPG Sync Mismatch" Alarm indicating that the two PPG PON ports are syncing status. Once sync is complete the alarm will clear. PPGs should not be switched with a standing PPG Sync Mismatch alarm as it will always result in a slow switch. Also if a "PPG Sync Mismatch" alarm remains standing for long periods of time, it may indicate that there is a problem communicating on the Synch channel VLAN between the two OLTs. If this occurs verify that the sync channel VLAN is set up properly throughout the network between the two OLTs
To ensure sub 5s speed of protection operations, it is recommended that Protected PONs be paired in such a way to stripe the pairs across multiple OIUs. This will minimize the time needed to recover the link by sharing the load across 4 OIU processors. The below figure illustrates an example with a QOIU7, but the scheme also applies to the OIU8.

There are two key points associated with the optical budget of a protected PON network:
This 8dB must be planned for within the optical budget of the system being deployed is using the QOIU7. The OIU8 uses a slightly different hardware algorithm and does NOT require the 5dB pad and only needs to accommodate the 3dB losses of the 2:N splitter.
The goal should be, when using the QOIU7 to ensure that the system, when on the Primary PON has an optical power of 22-23dB or less, so that with the addition of the 5dB pad on QOIU7 cards, that the system stays below the 28dB recommended upper limit for receive power while on protect.
Since there can be variability in the launch power of lasers, it is very important that the actual outputs of the lasers be measured for QOIU7-based systems and ensure that there is a 5 dB differential in the signal levels.
The typical way this is accomplished is via insertion of a 5dB pad in the Standby PON optical path at the input to the 2nd splitter port. Ensure that the correct attenuator type is utilized (UPC if inserted at the OLT, APC if inserted at the splitter). The color of the attenuator stripe can be used to tell the connector type.

It is critical that PON Protection be tested prior to adding any service to protected PONs. A sample ONT should be placed onto the PON and traffic verified on both the Primary PON and the Secondary PON. This is necessary to verify that the PPG configuration is correct, the Primary and Secondary PON are cabled correctly, and that both chassis have access to the uplink network. Failure to do this can result in failures that are not discovered until the first protection event. If the cabling and configuration of the secondary link do not agree, the PPG will fail to the wrong PON and likely all services will be lost until repairs can be made, or the configuration corrected. See the section on striping to ensure the best failover performance when multiple PONs are impacted.
During Upgrades, System Maintenance and other scenarios, it is often useful to be able to force the protection over to the Secondary Link. This can be done by disabling of the Primary PON from the Ports View->PON Tab. Setting the context will show all relevant PONs within that context. Simply select the drop down to set the PON to either Enabled or Disabled.

During Upgrades, System Maintenance and other scenarios, PPGs should be used to minimize outage times during the time period of software switches. The general outline of a normal upgrade should be:
PON Protection is configured by clicking on the PON protection button on the top of the GUI(the shield icon). Buttons exist to create, edit and delete PON Protection Groups. The Create button will allow the creation of a new protection group. The menu for creating PON Protection groups can be reached in one of two ways:

For QOIU7 PPGs the prompts will be as follows:

Protection Group Creation
The Add button allows the creation of a new protection group.
Protection Group Name: Name given to the protection group.
Admin State: Enables or Disables the protection group from operation.
Protection Type: The only supported type for protection is Type B 2:N splitter protection. This type uses a 2:N splitter to allow a protection PON on the OLT side.
Sync Channel ID: Defines the VLAN used for synchronization of state between the primary and secondary PON of the protection group. While this channel is not required, it speeds up protection actions by synchronizing state information between the Primary and Secondary PONs. This Sync channel only needs to be configured for inter-OLT protection and will be auto-configured by the EMS for intra-OLT PPGs.
Wait to Restore Time: As noted before, the wait to restore time defines the time the Primary and Secondary must be active prior to switching back to the primary. This provides hysteresis for the switching events to ensure the primary is stable, available and ready to take over.
OLT TID/PON AID: These two fields will define the PON ports that are connected to the 2:N splitter. Any two PONs can be used that are of compatible types. The only limitations are that the OLT for both ports must be managed by the same EMS, the PONs must be of the same type (XGS-PON/GPON), and must be of the same card type(QOIU7/OIU8 cannot be mixed). The Primary and Secondary PON can be on the same OLT or two different OLTs. If available, using a second OLT allows for full equipment and facility redundancy and is the best protection option. In addition, the PON AIDs will be limited to PONs that are configured the same way and on the same card type.
Primary: Defines which of the two selected PONs is the primary PON that will carry the traffic under normal conditions.
The EMS will copy any configuration on the Primary PON to the secondary PON ONTs automatically so they do not need to be managed separately. All changes must be made on the Primary PON ONTs.
One other thing to note is that the secondary PON for QOIU7 units should have a signal level 5 dB lower than the primary. This is often accomplished using an SC-style single-mode fiber optic attenuator inserted into the optical path prior to the splitter. This ensures that the Primary PON always wins at startup and is selected as the Primary PON. This is not needed for OIU8 cards.
Once the PPG has been created and the secondary port enabled, the PPG configuration should be verified as follows:
1. Verify the ONTs can be seen on the Primary PON then pull the primary fiber or disable the primary PON port from the EMS.
2. Verify that all ONTs appear on the secondary PON. There should be ONT-OLT Link LOS alarms indicated for the ONTs previously active on the primary PON. The secondary PON should have an alarm indicating PON on Protect.
3. If traffic can be verified, do so. Traffic should restore during the switch in less than 5 seconds.
4. After a few minutes, switch the PON back by performing the reverse of step 1.
5. Verify traffic is restored and that alarms clear. In both cases, the switch should have taken less than 5 seconds.
It should also be noted that the OLT will suppress alarms on the Standby PON, such that generally speaking, all the alarms will only occur on the Active PON.
For best protection switching performance, it is recommended to spread the standby PON ports among QOIU7s. See the section on PON striping.

Sync Channel VLAN
The system also supports Inter OLT protection where the Primary PON is on one OLT and the Secondary PON is on a different OLT. Both OLTs must be managed by the same EMS. When configuring Inter OLT protection, you are required to configure a Sync Channel VLAN. This VLAN is not required but does speed up protection by allowing the two sides to exchange synchronization and state information to speed up the protection switching when a failure occurs. Any Data VLAN that is present on the Primary PON can be used for this purpose as long as there is layer 2 connectivity between the two OLTs. The protection messages are Ethernet frames but are NOT IP and will not be able to cross a routed network.
The VLAN must be configured in the VLAN properties table as a dynamic VLAN and full bridged.
It should also be noted that when Intra OLT protection is used, the system picks the VLAN and the sync channel option will not be available.
The system also includes a PON PPG status screen which indicates the current status of the protection group which is accessed via the purple shield icon on the Panorama button bar.
The protection group status for all PPGs managed by the EMS is shown in this display along with the ability to create, delete, or modify PPG groups. The reload button on the left of the GUI dialog will allow the ability to read and update the status of the OLTs.

The status display will give the full status of the PPG via PON-A Status(Primary PON) and PON-B Status(Secondary PON), the status will have one of the following states:
PPGs can be shown in multiple colors on the PON Status screen:
The PONs are also shown in the common tree with the PPG status information:

The common tree display for PPGs can be interpreted as follows:

Introducing a PPG also means that you must be aware of the PPG when retrieving the status of an ONT or a ONT's UNI ports. Just remember these rules.
The Primary PON which should be used for all configuration actions, can be identified by the coloring on the GUI. The Primary PON is colored green, only configure PPG PONs if they are green.
When retrieving status, ONLY the Active PON should be used. This is the PPG PON that has a check in the box that is shown prior to the PPG name.


Simply described, the Tellabs OLAN PON protection mechanism allows for a protection PON port to detect when the primary port is no longer communicating with the ONTs and, upon such detection, take over control of the PON.
PON Protection does not protect individual ONT fibers since there is a single fiber going to the ONT. Failures or breaks in this fiber cannot be protected with type B PON Protection.
Tellabs has also added additional PON protection mechanisms for handling some use cases where the Uplink fails in such a way as to isolate the OLT actively carrying traffic. Also Tellabs has designed failover software to detect and fail on most hardware failures on the board. Both use cases are detailed below. At this time Tellabs is the only PON provider we are aware of with these features.
Tellabs has implemented an enhancement to PON Protection to couple it to the spanning tree which is available as a part of SR30.1 and above. This feature enables the OLT to force a switch of the PON protection to the Standby link when the network access to the OLT is blocked on the OLT uplinks.
The new features allow for addressing protection under certain network conditions when in an inter-OLT PON Protection scheme:
The added logic is a huge improvement in the use cases that are addressed. There are still scenarios that may arise that do not result in the desired behavior:
The PON Path Protection configuration has been added to the PON Profile settings. It can be enabled by checking the box shown below, "Enable Path Protection".

The PPG software has been updated in SR30.1 and above to detect failures of all major components on the card and switch the traffic to the secondary link when a hardware failure is detected. The system will attempt to preserve unprotected links but, in some use cases, will reboot the PON line card. Some of the conditions detected are: