• head_banner

The current operation of the DCI network(Part One)

After the DCI network introduces the OTN technology, it is equivalent to adding a whole piece of work that did not exist before in terms of operation. The traditional data center network is an IP network, which belongs to logical network technology. The OTN in DCI is a physical layer technology, and how to work with the IP layer in a friendly and convenient way is a long way for operation.

Currently, the purpose of OTN-based operation is the same as that of each subsystem of the data center. They are all aimed at maximizing the effectiveness of resources invested in high-cost infrastructure and providing the best support for upstream services. Improve the stability of the basic system, facilitate the efficient operation and maintenance work, assist in the rational allocation of resources, make the invested resources play a greater role, and allocate the uninvested resources reasonably.

The operation of OTN mainly involves several parts: operation data management, asset management, configuration management, alarm management, performance management, and DCN management.

1 Operation Data

Make statistics on fault data, distinguish human faults, hardware faults, software faults, and third-party faults, and conduct statistical analysis on the types of high faults, formulate targeted processing plans, and pave the way for automatic processing of faults after future standardization. According to the analysis of fault data, the system is optimized for future work such as architecture design and equipment selection, so as to reduce the cost of later operation and maintenance work. For OTN, carry out fault statistics from optical amplifiers, boards, modules, multiplexers, cross-device jumpers, trunk fibers, DCN networks, etc., participate in manufacturer dimensions, third-party dimensions, etc., and conduct multi-dimensional data analysis for more accurate data. Can accurately reflect the status quo of the network.

10G Direct Attach Cable Copper Cable 10G SFP+ DAC Cable

Make statistics on the change data, distinguish the complexity and impact of the change, allocate personnel, and make changes according to the process of demand analysis, change plan, setting window, notifying users, operation execution, and summary review, and finally can make different changes It is divided into windows, even arranged to be executed during the day, so that the allocation of changing personnel is more reasonable, reducing the pressure of work and life, and improving the happiness of operating engineers. It can also integrate the final statistical data and use it as a reference for personnel work efficiency and work ability. At the same time, it also allows normal changes to develop in the direction of standardization and automation, reducing various expenses.

Collect statistics on OTN service distribution to help you keep abreast of network usage and control network-wide network distribution and service distribution after business volume increases. If you make it rough, you can know which network service a single channel is using, such as external network, intranet, HPC network, cloud service network, etc. If you make it detailed, you can combine the full flow system to analyze the usage of specific business traffic. Different bandwidth costs are apportioned to different business departments to help them optimize business traffic, recycle and adjust low-usage working channels at any time, and expand high-usage business channels.

Statistical stability data, which is the main reference data for SLA, is also the sword of Damocles on the head of every operation and maintenance personnel. Stability data statistics of OTN need to be distinguished because of their own protection. For example, if a single route is interrupted, the total bandwidth at the IP layer will not be affected, whether it will be included in the SLA; if the IP bandwidth is halved, but the business will not be affected, whether it will be included in the SLA; Whether a single channel failure is included in the SLA; the increase in protection path delay does not affect the network bandwidth, but it has an impact on the business, whether it is included in the SLA, and so on. The general practice is to inform the business side of risks such as jitter and delay changes before construction. The later SLA is calculated based on the number of faulty channels * the bandwidth of a single faulty channel, divided by the total number of channels * the sum of the corresponding channel bandwidth, and then multiplied by Based on the impact time, the obtained value is used as the calculation standard of SLA.

2 Asset Management

The assets of OTN equipment also need lifecycle management (arrival, on-line, scrapping, fault handling), but unlike servers, network switches and other equipment, the structure of OTN equipment is more complex. OTN equipment involves a large number of functional boards, so it is necessary to design a mode for full asset management during management. The main IP asset management platform in the data center is based on servers and switches, and the master-slave device level will be set. On this basis of OTN, the master-slave level will involve hierarchical management, but there are more layers. The management level is mainly carried out by network element->subrack->board card->module:

2.1. The network element is a virtual device, without physical objects. It is used for management and the first logical point in the OTN network, and belongs to the first-level unit in OTN network management. A physical equipment room may have one NE or multiple NEs. A network element contains multiple subracks, such as optical layer subracks, electrical layer subracks, and external multiplexers and demultiplexers are also considered as a subrack. Each subrack can be connected in series and belongs to a subrack within a single network element site. Numbering. In addition, the network element does not have an asset SN number, so it must be aligned with the management platform in this regard, especially with the information on the purchase list and the later operation and maintenance management platform, so as to avoid asset investigations that do not correspond to each other. After all, the network element is a virtual asset. .

2.2. The largest specific physical unit of OTN equipment is the chassis, that is, the subrack, which belongs to the second level of the first-level network element. It is a second-level unit, and a network element has at least one subrack device. These subracks are divided into different models of different manufacturers, with different functions, including electronic subracks, photon subracks, general subracks, and so on. The subrack has a specific SN number, but its SN number cannot be automatically obtained through the network management platform, and can only be checked on site. It is rare to move and change the subrack after it goes online. Various boards are inserted in the subrack.

2.3. Inside the second-level subrack of the OTN, there are specific service slots for placement. The slots have numbers and are used to insert various service boards of optical networks. These boards are the basis for supporting OTN network services, and each board can query its SN through the network management system. These boards are the third-level units in OTN asset management. Various business boards have different sizes, occupy different slots, and have different functions. Therefore, when a board needs to be assigned to a second-level unit subrack, the asset platform must allow a single board to use multiple or half slots to correspond to the slot numbers on the subrack.

2.4. Optical module asset management. Modules depend on the use of service boards. All business boards must allow optical module ownership, but not all OTN equipment boards must be plugged into optical modules, so boards must also be allowed to No module exists. Each optical module has an SN number, and the module inserted on the board must be aligned with the port number of the board for easy location search.

All these information can be collected through the northbound interface of the network management platform, and the accuracy of asset information can be managed through online collection and offline verification and matching. In addition, OTN equipment also involves optical attenuators, short jumpers, etc. These consumable devices can be directly managed as consumables.

 


Post time: Dec-12-2022