SG7000 Emergency Maintenance (V200R005C02 - 02)
SG7000 Emergency Maintenance (V200R005C02 - 02)
V200R005C02
Emergency Maintenance
Issue 02
Date 2009-12-14
and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.
All other trademarks and trade names mentioned in this document are the property of their respective holders.
Notice
The purchased products, services and features are stipulated by the contract made between Huawei and the
customer. All or part of the products, services and features described in this document may not be within the
purchase scope or the usage scope. Unless otherwise specified in the contract, all statements, information,
and recommendations in this document are provided "AS IS" without warranties, guarantees or representations
of any kind, either express or implied.
The information in this document is subject to change without notice. Every effort has been made in the
preparation of this document to ensure accuracy of the contents, but all statements, information, and
recommendations in this document do not constitute the warranty of any kind, express or implied.
Website: http://www.huawei.com
Email: [email protected]
Purpose
This section describes the organization, related versions, intended audiences, conventions and
update history of U-SYS SG7000 Signaling Gateway Maintenance Manual – Emergency
Maintenance.
Related Versions
The following table lists the product versions related to this document.
Intended Audience
The intended audiences of this document are:
l On-site engineers
l Maintenance engineers
l Commissioning engineers
Organization
This document consists of four chapters and three appendixes, and they are organized as follows.
Chapter Description
Chapter Description
5 Record Tables of Gives some samples for recording the emergency maintenance
Emergency operations.
Maintenance
6 Power Supply System Introduces the power supplies used in the SG7000.
of Cabinet
7 Acronyms and Lists the abbreviations and acronyms used in this manual and
Abbreviations the corresponding full expressions.
Conventions
Symbol Conventions
The following symbols may be found in this document. They are defined as follows.
Symbol Description
General Conventions
Convention Description
Convention Description
Boldface Names of files, directories, folders, and users are in boldface. For
example, log in as user root.
Command Conventions
Convention Description
GUI Conventions
Convention Description
Boldface Buttons, menus, parameters, tabs, window, and dialog titles are in
boldface. For example, click OK.
> Multi-level menus are in boldface and separated by the ">" signs. For
example, choose File > Create > Folder.
Keyboard Operation
Format Description
Key Press the key. For example, press Enter and press Tab.
Key 1+Key 2 Press the keys concurrently. For example, pressingCtrl+Alt+A means the
three keys should be pressed concurrently.
Format Description
Key 1, Key 2 Press the keys in turn. For example, pressing Alt, A means the two keys
should be pressed in turn.
Mouse Operation
Action Description
Click Select and release the primary mouse button without moving the pointer.
Double-click Press the primary mouse button twice continuously and quickly without
moving the pointer.
Drag Press and hold the primary mouse button and move the pointer to a certain
position.
Update History
Updates between document versions are cumulative. Therefore, the latest document version
contains all updates made to previous versions.
Contents
3 System Restart.............................................................................................................................3-1
3.1 Overview.........................................................................................................................................................3-2
3.1.1 Application Scope..................................................................................................................................3-2
3.1.2 Device Preparations................................................................................................................................3-2
3.1.3 Tool Preparations ..................................................................................................................................3-2
3.2 Restart Procedures...........................................................................................................................................3-3
Figures
Tables
This chapter describes the definition and classification of emergency faults. Emergency
maintenance refers to series of emergency measures used to minimize loss, remove the fault,
and recover the system or equipment when emergent faults occur.
1.1 Definitions of Emergent Faults
This section describes the definitions of emergency faults. Based on generation reasons, faults
can be classified into the equipment fault and the extremely heavy traffic fault.
1.2 Basic Principles of Emergency Maintenance
This section describes basic principles of emergency maintenance. The emergency faults are
dangerous and may cause call congestion and equipment breakdown, so you must follow the
basic principles during the maintenance.
In the default configuration, the main control cabinet refers to cabinet 0, the master service frame refers to frame
0 of the main control cabinet, and the slave service frame refers to all the service frames excluding the master
service frame.
BAM Breakdown
BAM breakdown is a severe fault resulting in the loss of all BAM functions. In actual
applications, any of the following conditions is regarded as a kind of BAM breakdown:
l Power failure of the BAM.
l Unable to start the BAM.
l Automatic and frequent resetting of the BAM.
l The CPU utilization of the BAM is near 100% for a long time.
l The Windows operating system of the BAM breaks down during the operation or it is unable
to boot when the BAM starts.
l Abnormal running of the "BAM Manager". For example, unable to start the BAM Manager,
unable to start all the service processes (their states are "Stopped"), or all the service
processes enter the "Exception" state after being started repeatedly.
Restart of Equipment
Restart the system when:
l The blue hot-swap indicator on the front panel of the board is on or all the indicators on
the front panel of the board are off.
l The indicator displayed is red when you open the system navigator window on the left of
the maintenance console and select System Setting > Board Position Management.
When a board is powered off unexpectedly, pull it out and insert it again to power it on. For
details, refer to section 4.2 Pulling Out and Inserting Boards.
To increase the efficiency of handling the emergent faults and minimize the loss, understand the
following basic principles before maintaining the equipment:
l The ultimate purpose of emergency maintenance is to recover the system operations and
service provisioning as soon as possible. To increase the efficiency of handling the
emergent faults, set up handling schemes based on the emergency maintenance manuals
and organize maintenance personnel to study and practice periodically.
l Maintenance personnel must attend the emergency maintenance training which is
mandatory for them before they handle the faults. They must learn the basic methods of
judging and handling emergent faults.
l When there is an emergent fault, check whether the equipment and the bearer network are
working normally. Then determine whether the emergent fault is caused by the equipment.
If yes, you can handle the fault according to the pre-prepared schemes or the procedures in
this manual.
l When emergent faults occur to the BAM, do not reinstall the system or format the hard
disk on the BAM. Consult Huawei for technical support to avoid the loss of important data.
l For immediate technical support, you can also contact the customer service center (see page
2 of the cover of this manual) or the regional office of Huawei.
l After handling an emergent fault, collect the alarm information related to this fault and send
the fault handling report, equipment alarm files and log files to Huawei for analysis.
This chapter describes the emergency maintenance flow. Emergency maintenance must be
performed based on a general flow. The maintenance handling flow various according to the
equipment fault and the extremely heavy traffic.
2.1 Overview of Emergency Maintenance Flow
This section describes the flow for handling emergent faults. In case of an unexpected alarm or
fault, you need to stay calm, observe the alarm or fault, and record it properly.
2.2 Handling Equipment Fault
This section describes the emergency measures taken when the equipment or BAM server breaks
down.
2.3 Handling Extremely Heavy Traffic Fault
This section describes the procedures for handling the extremely heavy traffic fault. Since the
equipment is designed with large capacity, extremely heavy traffic does not have much impact
on the transfer and translation of the traffic. Long-time extremely heavy traffic, however, will
cause faults such as rise of CPU utilization, generation of a large number of alarms, and no
response to delivered commands. To handle the faults, perform the following steps.
l Check if the power supply to the equipment is normal. That is, check if the power supply
to the cabinet, components within the cabinet, and frames is normal.
l Check if the components, including the LAN Switch and BAM, in the main control cabinet
are running normally.
l Check if the frames are running normally. In the system navigation window of the client,
select System Settings -> Board Position Management. Observe the running status of
the boards (front boards, back boards, and subboards), power frame, and fans.
Therefore, collect the fault information in time regardless of whether the fault has been
successfully handled or not. For detailed information about collecting fault information, refer
to section 2.1.2 Collecting Fault Information.
To shorten fault-handling time, record the emergency handling in detail, notify Huawei of the
replaced boards during the handling, and ask for corresponding standby boards. For the fax, you
can use the Emergency Maintenance Note. For details, refer to 5 Record Tables of Emergency
Maintenance.
You need to copy all the "*.dbg" and "*.log" files under the directory E:\MSSQL\SGData in the
BAM and save them to a new path in time. These files are "*.dbg", "debug.log", and
"DatamanError.log".
NOTE
In this version, the default storage path for the system debugging information is "E:\MSSQL\SGData". For the
actual path, refer to the actual installation directory of the system.
These debug log files are crucial to the location of the fault. When the size of these files reaches
a certain threshold, the system auto removes the information saved originally. Therefore, after
the emergent fault occurs, copy and save these files to another place in time.
You need to collect the alarm log information from 30 minutes before the occurrence of the fault
to 30 minutes after it. You can perform the query and saving operations on the alarm management
system. The operations are as follows:
1. Select Alarm Query -> Query on the menu bar of the alarm management system. Set the
type and time segment of the alarm to be queried on the pop-up window. Click OK.
2. On the output window of the query result displayed, right click and select Save as?on the
shortcut menu displayed. Then export the alarm log information and save it as a text file.
You need to collect the command log information from 30 minutes before the occurrence of the
fault to 30 minutes after it. You can run LST LOG on the client to export the related command
log information and save it as a text file.
Procedure
Step 1 Checking If Power Supply to Cabinet Is Normal
Power failure in the cabinet leads to the breakdown of the host. To check if the power supply to
the cabinet is normal, follow these methods:
l If the green "RUN" indicator on the front panel of the power distribution frame is on and
blinks once every two seconds, it indicates that the power supply to the cabinet is normal.
That is, the power supply to the power distribution frame and the circuit in it is normal.
l If the green "RUN" indicator on the front panel of the power distribution frame is off, it
indicates that the power supply to the cabinet is abnormal. That is, neither of the two channels
of the power supply to the power distribution frame has any voltage.
Step 2 Powering On and Restarting Cabinet After Repairing Power Supply System
Abnormal power supply to the cabinet is generally caused by faults of the power supply system
of the equipment room. In this case, to restore the power supply, perform the following steps:
1. To prevent faults, turn off all the power switches on the front panel of the power distribution
frame before the power supply system of the equipment room returns to normal.
2. Repair the power supply system of the equipment room immediately.
3. After the power supply system of the equipment room returns to normal, power on and
restart the cabinet based on the methods described in section 4.1 Power-on Operations.
4. When meeting problems during system start or restart, see chapter 3 System Restart for
solutions.
Step 4 Checking If HSYSs in Master Service Frame and Slave Service Frame Have Broken Down
The HSYS module supports the normal operation of service frames (OSTA frames). If all the
HSYSs break down, no boards in the frames can work normally.
l The "RUN" indicators on the front panels of the active and standby HSYSs are off or
constantly on.
l The "FAIL" indicator on the front panels of the active and standby HSYSs lights up.
l The high-speed bus status indicators "DOMA" and "DOMB" on the front panels of the active
and standby HSYSs are all off.
l The CPU utilizations of the active and standby HSYSs are close to 50% for a long time.
Regardless of whether the fault is removed or not, collect the fault information as described in
section 2.1.2 Collecting Fault Information.
----End
Procedure
Step 1 Checking If Power Supply to BAM Is Normal
A power failure in the BAM leads to a BAM breakdown. To check if the power supply to the
BAM is normal, perform the following steps:
l If the power indicator of the BAM is green, it indicates that the power supply to the BAM is
normal and the BAM is on.
l If the power indicator of the BAM is yellow, it indicates that the power supply to the BAM
is normal and the BAM is in the standby state.
l If the power indicator of the BAM is off, it indicates that the power supply to the BAM is
abnormal.
If the power supply to the BAM is abnormal, do not perform any operations before locating the
fault cause. Contact Huawei immediately for technical support.
When the status of a certain process in not Started, you can start the process manually. If it does
not work, proceed the next step.
1. If the BAM is in the standby state, press the power switch of the BAM to restart the BAM.
2. If the Windows operating system of the BAM is still running, use the restart function of
the operating system to restart the BAM.
3. If the Windows operating system of the BAM breaks down, press the Reset button of the
BAM to restart the BAM.
Step 4 Protecting Site Environment
If you cannot restore the normal operation of the BAM by restarting it, do not perform operations
to the BAM such as reinstalling the system or formatting the hard disk before technical support
personnel of Huawei arrive. Otherwise, you may lose important data on the hard disk of the
BAM.
Step 5 Collecting Fault Information
Regardless of whether the fault is removed or not, collect the fault information as described in
section 2.1.2 Collecting Fault Information.
----End
Context
NOTE
l Extremely heavy traffic may lead to the congestion of links, SBPU/SBPE, or HSYS, which further causes
mass congestion of links or boards. As a result, mass interruption of communication occurs. If the extremely
high traffic fault is not handled in time, the consequences can be severe.
l Extremely heavy traffic may cause no response for commands delivered. In the process of emergency
maintenance, if the system does not respond after a command has been delivered many times, contact Huawei
immediately for technical support.
Start
Extremely heavy No
traffic fault?
Yes
Clearing activation statistics, Other processing
ccanceling all message tracing flow
Yes
Deactivating link C
Yes
No Foreground/ background
communication times out?
Yes
Checking correpsonding board
End
Procedure
Step 1 Analyzing Fault Alarm
When a sudden alarm occurs, stay calm, observe the alarm carefully, record it, and analyze its
cause. After confirming that the alarm is caused by extremely heavy traffic, obtain technical
support and at the same time you can perform the following steps.
By using the DEL TRFACT command to clear activation statistics, you can reduce the load on
the SLPU, SBPU/SBPE, and HSYS in the foreground and reduce the CPU utilization. After the
statistics console is closed, the statistic data stored by the host in the input/output controller
(IOC) is not sent to the background terminal. Thus, you can reduce the communication load of
the BAM.
Run RST TRACE to cancel all message tracing tasks. Run STP ALLMTPTRAC, STP
ALLTRAC, STP M3TRAC, or STP SCTPTRAC to cancel the tracing of the MTP message,
SCCP, maintenance and debugging message, M3UA message, and SCTP message.
NOTE
If there is no response to the command, re-send the command several times. If there is still no response, contact
Huawei for technical support.
Step 4 Deactivating Links of Office Direction, Load of Which Constantly Exceeds 0.9 Erl
Sometimes, the load of all the links of an office direction constantly exceeds 0.9 erl, or the links
frequently receive TFP and TFA messages, which leads to frequent switchover of the links.
Therefore, you must run DEA N7LKS in the terminal maintenance and test system to deactivate
all the links in the office direction.
Step 5 Handling Faults on Links B and D in Peer Equipment and End Office
When links B and D of the equipment of the peer plane are faulty while links A and C are normal,
messages cannot be forwarded from the plane. At this time, the forwarded and translated
messages will be transferred to the equipment from link C. As a result, the load of the equipment
rises sharply. Therefore, deactivate link C.
If the end office statically shares the load of the equipment, the messages forwarded and
translated by the end office will be discarded when the peer equipment is faulty. Therefore,
modify the data of the end office, and direct the corresponding translation point to the local
equipment.
When the HSYS has a high CPU utilization, commands sent from the terminal workstation (WS)
such as one to deactivate a link often do not get a response, and the terminal prompts
"Communication Timeout". In this case, check engineering information to find the SLPU board
with high load or with links which receive TFP and TFA messages frequently that the links
switch over frequently. After that, pull out the corresponding board to deactivate the links and
reduce the load.
Regardless of whether the fault is removed or not, you must collect the fault information as what
is described in section 2.1.2 Collecting Fault Information.
----End
3 System Restart
This chapter describes the system restart. When an emergency fault occurs, you must start/restart
the equipment based on some conditions and general procedures.
3.1 Overview
This section describes the application scope of equipment restart, the tools required, and the
precautions.
3.2 Restart Procedures
This section describes the procedures for restarting the system. After the tools are available and
other conditions are ensured, you can restart the equipment based on equipment restart flow.
3.1 Overview
This section describes the application scope of equipment restart, the tools required, and the
precautions.
3.1.1 Application Scope
This section describes the process of equipment restart and the applicable circumstances.
3.1.2 Device Preparations
This section describes the conditions to be ensured before equipment restart.
3.1.3 Tool Preparations
This section describes the tools to be prepared before equipment restart.
l ESD-preventive glove
l Chip extraction tool
l Straight screwdriver
l Cross screwdriver
Context
Figure 3-1 shows the restart flow. See subsequent sections for details.
As for "BAM faults handling flow" in the figure, see section 2.2.2 Handling BAM
Breakdown for details. For the handling of other faults, see SG7000 Troubleshooting for details.
Yes
No
Device needs restart?
Yes
Yes
No Start and set FTP server
FTP server is started?
in BAM manager
Yes
Back up data
Power off and restart
from frame 0
Power off and restart from frame 0
No
Link and connection status
is normal?
Yes
Data is consistent in No
host and BAM?
Corresponding fault
Yes handling flow
End
Procedure
Step 1 Device Needs Restart?
If the system encounters hardware or power failure, you must handle it as soon as possible.
See section 3.1.1 Application Scope to decide whether to restart the system when a problem is
met.
Step 2 Software DIP Switches of All Boards Are Queried?
1. Run the following command to query the DIP switches of all boards.
DSP SFTSWT: MODE=ALL;
If the following contents are displayed, it indicates that the DIP switches of all boards are
queried.
RETCODE = 0 Operation succeeded
l Start: indicating the FTP server is stopped and you can start it directly.
l Abnormal: indicating the FTP server is abnormal. In this case, it is suggested to stop all
services and exit the BAM manager, and then start the BAM manager again.
Step 6 Start and Set FTP Server in BAM
In the BAM Manager, select FTP server and then File > Startor click the icon .
After starting the BAM manager, proceed as follows to set it.
Choose an FTP server task, click to open the FTP Server window. Then choose Configure
> Manage Userto display the window as shown in Figure 3-2.
The default user name is ioc_down and password is 123 (only *** is displayed). Enter D:
\SGDATA in the Work Path field.
Step 7 Set Software DIP Switches
l During system restart, if the program and data are not to be loaded from the BAM, proceed
the following settings:
SET SFTSWT: MODE=ALL, VAL=0;
l If the program and the data are to be load from the BAM, set the DIP switches based on the
contents in Table 3-1.
HSYS (main control frame) 224 (loading and saving program and data)
If nothing is entered in BKDIR field, the database and the registry will be backed up in the
same folder as that of the database log by default, that is, E:\MSSQL\SGDATA.
The name of the backup BAM database is Bam********.dat. In the file name, the symbols
******** stand for the current date, such as 20051221.
The name of the backup registry is BamReg.bak.
2. Run SQL server enterprise manager to back up the Alarm and Stats databases.
On both the BAM and the terminal, back up the BAM, STATS, and ALARM databases.
Choose Start > Program > Microsoft SQL Server > Enterprise Manage.
Choose a database to be backed up (such as Alarm), right click and then choose All tasks
> Backup Database, as shown in Figure 3-3.
In the displayed dialog box, click Add and enter the backup path and file name. Then click
OK.
After the backup, the system prompts the success. Click OK to finish the backup.
3. Back up loading folder and log folder.
a. Copy all files and sub-folders under the loading folder (default: D:\SGDATA) to a
PC for backing up files.
b. Copy all files and sub-folders under the log folder (default: E:\MSSQL\SGDATA)
to a PC for backing up files.
4. Back up the License file.
Before restart the system, you must back up the License file.
5. Back up the 128BAM.ini file.
The IP addresses of the HSYSs for communication between the host and the BAM are
recorded in the 128BAM.ini file, as shown in Figure 3-4. IP0 indicates the IP address of
the master HSYS and IP1 indicates that of the slave HSYS.
If the IP addresses of the HSYSs can be pinged on the BAM, it indicates that the
communication between the host and the BAM is normal.
l Authority group
l Statistics tasks (including activated and semi-permanent statistics tasks)
l Scheduled tasks (including sub-tasks)
l WS information
l User information
After restarting the system, check the current running data against that saved in the file. If
the data is missing, add it manually.
a. Querying the information of all authority groups (the following is an example for
querying one authority group):
LST CCG: CG=G_1;
b. Querying the information of all WSs:
LST WS:;
You can query the authorities and IP address of a WS based on its name. For example,
to query the authorities and IP address of a WS whose name is 11111, run the following
command:
LST WS: WS="11111";
c. Querying the information of all activated statistics tasks:
LST TRFACT:;
You can query an activated statistics task based on the task number, for example:
LST TRFACT: TSK=0;
d. Querying the information of all activated and semi-permanent statistics tasks:
LST TRFSEMI:;
You can query an activated and semi-permanent statistics task based on the statistics
type (such as LNK), for example:
LST TRFSEMI: OBJ=LNK;
To load the IO board, you must ensure that the system board works normally.
4. The dialog box as shown in Figure 3-6 is displayed. For the system board, the file name
is sysbios.hdr and for the IO board, the file name is iobios.hdr. Choose Xmodem for the
protocol. Then click Send. It takes about 30 seconds for the super terminal to load the file
to the board over the Xmodem protocol.
After successful loading, the super terminal prints the success prompt.
5. After three seconds, the board is auto reset and loaded with the stratum-2 BOOTROM.
During the reset, the system prompts to input "d" or "D" and you just ignore it. After loading
the stratum-2 BOOTROM, the board is restarted directly from the stratum-2 BOOTROM.
The system prompts "Booting from extended bootrom...." The file name of the stratum-2
BOOTROM of the system board is hsys.bin and that of the IO board is sbpx.bin.
6. After the board works normally, run DSP EXVER on the BAM to query whether the current
version is that to be loaded. If the loading fails, the ALM indicator is always ON. In this
case, you must locate the problem based on the printed information.
Step 12 Link and Connection Status Is Normal?
Proceed as follows to check the system status and handle the problem based on the specific flow
if there is any fault.
1. Query the board status.
The board status should be any of the following:
l Normal
l Master normal
l Backup normal
l Uninstall (for empty slot)
For example:
DSP FRM:;
For example:
DSP BRD: FN=0, SHN=0, SN=5, LOC=FPB;
For example:
DSP EXVERDETAIL: METHOD=ALL;
4. Query the status of the link.
For example:
DSP MULN7LNK: ST=0, EN=500;
For example:
DSP MULN7LKS: ST=0, EN=500;
For example:
DSP MULN7RT: ST=0, EN=500;
7. Query the status of the DSP.
For example:
DSP MULN7DSP: ST=0, EN=500;
%%ACT CNSCHK:;%%
RETCODE = 0 Operation succeeded
Result of data consistency check
------------------
Data of HSYS and BAM = All consistent
Data of backup HSYS and HSYS = All consistent
Data of MTP = All consistent
All data in tables = Consistent
--- END
If the data on the HSYS is inconsistent with that on the BAM, you can run RST DBTBL
to restore the equipment data of the BAM by using the data store in the FLASH of the active
HSYS.
If the data on the boards of the host is inconsistent, you can run RST TBL to restore the
equipment data of other boards by using the data store in the FLASH of the active HSYS.
NOTE
Only limited data can be restored when you run RST TBL each time. In this case, you must run the
command many times to restore all the data.
2. Check service data consistency.
Normally, all service data should be in "All consistent" state.
For example:
ACT SERCNSCHK: TBLID=ALL;
If the service data in the host is inconsistent with that on the BAM, you can run ACT
SERCNSCHK and set RESTORE to YES to restore the data on the BAM. After that, run
ACT SERCNSCHK to check whether the data is consistent.
----End
This chapter describes the related operations of the emergency maintenance, including power-
on operations, pulling out and inserting boards, resetting operations, BAM emergency handling,
and database backup and recovery.
4.1 Power-on Operations
This section describes the power-on operations. For the purpose of commissioning, or due to
equipment fault or power supply failure, you must restart the system by the power-on operation.
4.2 Pulling Out and Inserting Boards
This section describes the procedures for pulling out and inserting a board.
4.3 Resetting Operations
This section describes the resetting operations. Based on the reset object, resetting operations
can be classified into frame resetting and board resetting.
4.4 Handling BAM Failure
This section describes the procedures for handling the BAM failure. The EWS can act as a BAM
or a local WS. When the BAM is down, you can use the EWS to replace it.
4.5 Backing Up and Recovering Database
This section describes the procedures for backing up and recovering the BAM database. The
BAM database is also called the background database. It is an SQL database stored in the hard
disk of the BAM in the format of a data table. Its main function is to store all types of data
generated during data configurations.
Prerequisite
Cabinets should be powered on and restarted only in the following situations:
l During deployment or expansion, it is necessary to perform power-on test to the equipment
after completing the installation of all hardware.
l During important upgrading of software or hardware, it is necessary to power off the
devices, and then power them on again.
l Major faults of the power supply system cause power failures of the equipment. When the
power supply system becomes normal, it is necessary to power on the equipment.
Procedure
Step 1 Powering on the DC distribution cabinet
Turn on all switches on the DC power distribution box (PDB) of the equipment. The "RUN"
indicator on the panel of the PDBs should be on (green). It should blink twice every second,
indicating that the input power supply and internal circuits of the PDB are normal.
Step 2 Powering on PDB of main control cabinet
Turn on the switches on the PDB of the main control cabinet one by one, that is, to turn the
switches to "ON" in turn from SW1 to SW6(PDB DPD100-2-6) or from A1/B1 to A10/B10
(PDB DPD100-2-20). The power module indicators "PWR" on the panel of the frame monitoring
board "ALUI" should be on (green), indicating the power supplies for the frames are normal.
Step 3 Powering on LAN Switch 0 and LAN Switch 1
Turn on the switches of LAN Switch 0 and LAN Switch 1 in the main control cabinet one by
one, that is, to turn the switches of the LAN Switches to "ON". The "POWER" indicators on the
panel of the two LAN Switches should be on (green), indicating the power supplies for the LAN
Switches are normal.
Step 4 Powering on KVM/LCD converter
Turn on the switch of the KVM (keyboard, video, mouse)/LCD (liquid crystal display) converter
in the main control cabinet, that is, to turn the switch to the side with a "-" symbol. The "KVM/
PWR" indicator on the panel of the KVM should be on (green), indicating that the power supply
to the KVM/LCD converter is normal.
If the power indicator of the server is orange, it means that the server is in the standby state. You
can press the power switch of the server. The indicator of the switch will turn to green, indicating
a normal power supply of the server.
Turn on the switches on the PDB of the cabinet one by one, that is, to turn the switches on the
PDB to "ON" in the order of SW1 to SW6(PDB DPD100-2-6) or of A1/B1 to A10/B10(PDB
DPD100-2-20). The power module indicators "PWR" on the panels of the monitoring boards
(ALUIs) of the slave service frames should be on (green), indicating the power supply for the
frames is normal.
After the above operations, check the running status of the components in the cabinet carefully
to ensure there is no abnormality. This completes the power-on operation.
----End
Prerequisite
Frames should be powered on and restarted only in the following situations:
l The active and standby HSYSs in the frame are faulty or down.
l Major faults of the power supply system cause power failures of the frame. After repairing
the power supply system of the frame, power on the frame.
Procedure
Step 1 Confirming Electrical Controlling Relationships
Since the power supply of each frame is controlled by the two switches on the PDB, and the
controlling relationships are different, confirm electrical controlling relationships between the
PDB and the components in the cabinet carefully to avoid maloperations.
After confirming the electrical controlling relationships, turn off the two switches controlling
the power supply of the PDB one by one. The indicators on all the boards of the frame should
be off, indicating power-off of the frame.
Ten seconds after the frame is powered off, turn on the two switches controlling the power supply
of the frames on the PDB one by one. That is, turn the switches to "ON". The power module
indicators "PWR" on all the ALUIs of the frame should be on (green), indicating the frame has
been powered on again.
Step 4 End of Operation
After completing the power on operations, you can open the equipment panel on the client to
see whether the frame is powered on and working normally. If there is no abnormality, you can
complete the power-on operation.
----End
Context
NOTE
If the lower ejector lever of a board is pressed down or the hot-swap indicator (blue) is on, it indicates that the
board is powered off. At this time, if you press the lower ejector lever, you cannot power on the board. Instead,
you must pull out the board and then insert it into the backplane again. After that, press the lower ejector lever
to power the board on.
Procedure
Step 1 Wear an ESD-preventive wrist strap. Insert its grounding terminal into the ESD jack of the rack.
Step 2 Use a cross screwdriver to loosen the fixing screws in the upper and lower ejector levers of the
board, as shown by step (a) in Figure 4-1.
Step 3 Hold the upper and lower ejector levers with hands. Press down the red lockers on the ejector
levers with thumbs to release the ejector levers. After that, you can pull out the board from the
frame.
Step 4 Press outward the ejector levers hard with both hands. When the two ejector levers form an angle
of 45 degree, the board connectors break away from the backplane, as shown by step (b) in
Figure 4-1.
Step 5 Grasp the ejector levers with both hands, and pull out the board for one to two centimeters
smoothly along the slide rails of the frame until the board breaks away from the backplane.
Step 6 After confirming that the board has broken away from the backplane, insert it into the backplane
again. Stop pushing when the positioning pin on the front panel of the board touches the pin
positioning hole on the frame.
Step 7 Turn inward the ejector levers of the board hard with both hands. When the ejector levers are
vertical to the front panel, the locking keys lock the ejector levers. This indicates that the board
has been inserted into the frame.
Step 8 To check the status of the board, perform the following steps:
1. If the "RUN" indicator on the front panel of the board is on and blinks regularly, it indicates
that the board is running normally.
2. Open the system navigator pane on the left of the maintenance console window, and select
System Setting -> Board Position Management. If the board indicator is green or light
blue (when in the standby state), it indicates that the board is running normally. If the
indicator is red, it indicates that the board is running abnormally.
3. If the fault indicator "ALM" (yellow) on the front panel of the board is on for one second
and then off for one second, it indicates that the board is running abnormally.
----End
Postrequisite
If the board is running abnormally, replace it. For details, refer to SG7000 Parts Replacement
Guide.
Prerequisite
NOTE
If the board to be reset is an HSYS, use the BKP DB command to back up the BAM data before
resetting the board.
Procedure
Step 1 Three ways are available for resetting a board. You can choose a proper way based on actual
conditions.
l Resetting Boards Using RST BRD Command
During upgrade of the software, you can run RST BRD to reset boards. For example:
//Resetting the front board in slot 0 of frame 0 of rack 0.
RST BRD: FN=0, SHN=0, SN=0, LOC=FPB;
l Resetting Boards by Pressing "Reset" Buttons on Boards
During failure or down of the board, you can reset the board by pressing the "Reset" button
on the panel of the board.
l Resetting Front Boards by Powering It On and Off
Reset a board by using the PWR BRD command to power on and off the board.
----End
Prerequisite
NOTE
Before resetting a frame, use the BKP DB command to back up the BAM data.
Procedure
Step 1 Two ways are available for resetting a frame. You can choose a proper way based on actual
conditions.
l Resetting a Frame by Resetting All Boards in Frame
During failure or down of all the boards in the frame, you can reset the frame by resetting
all the boards in the frame according to the following steps:
1. Press the "Reset" button on the panel of the HSYS boards to reset the active and
standby HSYSs.
2. Press the "Reset" button on the panel of the service boards (front boards) to reset the
service boards.
l Resetting a Frame by Pulling Out and Inserting the Active and Standby HSYSs
When all boards in a frame are faulty or breakdown, you can reset the frame through pulling
out and then inserting the active and standby HSYSs if method 1 does not work.
----End
Procedure
Step 1 Restoring the Data in the EWS
Restoring the data in the EWS is the same as that in the BAM. For details, see section 4.5 Backing
Up and Recovering Database of this manual.
Step 2 Starting the BAM System of the EWS
After being restarted, the EWS does not start the BAM system automatically. You need to
proceed as follows to start the BAM system.
1. Start the BAM Manager.
Choose Start > Programs > SG7000 Administration System > SG7000 Server. The
BAM Manager window is displayed as shown in Figure 4-2.
Obtain the License file from a Huawei technical support engineer. Copy the file to D:
\SG7000\SERVICES.
3. Run the MML command (SET OPMODE) to enter the offline status.
4. Run the MML command (FMT) to convert the format of all data.
Proceed as follows to disconnect the BAM from the host and connect the EWS to the host:
l When the BAM is working, the EWS and other common WSs are connected to a LAN Switch. The host is
connected to the other LAN Switch. The BAM is connected to the two LAN Switches through two network
adapters.
l When the EWS replaces the BAM, the EWS and the host use the same LAN Switch.
l If inconsistency of board data in the host exists, run RST TBL to use the data saved in
the HSYS to recover the data of the other boards.
2. If service data exist, use ACT SERCNSCHK to check their consistency. If inconsistent
service data exist, set the parameter Whether restore it in the command to yes to recover
the data.
3. If inconsistency still exists, record the names of the inconsistent tables and contact a Huawei
technical support engineer.
4. After you have confirmed that all data tables and service tables are correct, run the command
(SET OPMODE) to start the EWS.
To verify whether the EWS has been started successfully, query the hardware configuration
panel and alarm information on the EWS.
----End
Procedure
Step 1 Backing Up the Data in the EWS
You can use BKP DB to back up system database and registry. The name of the database backup
file is BamYYYYMMDD.dat. The name of the registry file is BamReg.bak. The two files are
backed up to the E:\MSSQL\SGDATA path. Keep another copy of the two files in another
folder.
NOTE
Before starting the BAM again, you must back up the system data of the EWS and then restore the EWS to the
SGSmirror backup mode. The reason is as follows. After started, the SGSmirror process of the EWS checks the
backup files in the EWS and the source files in the BAM to ensure their consistency (mirror backup). That is,
when files are removed from or added to the BAM, the same files are removed from or added to the EWS.
Proceed as follows:
1. Stop BAM Manager. That is, click Exit on the left of the tool bar of BAM Manager. Or
choose File > Exit and then click OK when the system prompts "Are you sure that you
want to exit?"
2. Stop all application programs on the EWS.
3. Disconnect the EWS from the host. Change the IP address of the EWS back to the original
one.
4. Restore the network connections of the EWS with the original network. The EWS starts to
work as a common WS.
NOTE
Do not connect the BAM to the host before you have restored the data in the BAM.
Proceed as follows:
1. Copy the latest backup files of system data and registry files from the EWS to E:\MSSQL
\SGDATA of the BAM.
2. To restore the data in the BAM, see related contents in section 4.5 Backing Up and
Recovering Database of this manual.
Proceed as follows:
1. After restarting the BAM, convert the data format and set the online mode.
2. Insert the network cables of the BAM into the 100 Mbit/s Ethernet interfaces of the LAN
Switch(es) used by the host.
NOTE
l When the BAM is working, the EWS and other common WSs are connected to a LAN Switch. The host is
connected to the other LAN Switch. The BAM is connected to the two LAN Switches through two network
adapters.
l When the EWS replaces the BAM, the EWS and the host use the same LAN Switch.
To verify whether the BAM and emergency WS are working, query the hardware configuration
status and alarm information on them.
----End
At 4:30 a.m. (default time) every day, the system automatically backs up the BAM database and
the registry to the default directory E:\MSSQL\SGDATA.
NOTE
l In this version, the default storage path for the BAM database and registry is "E:\MSSQL\SGDATA". For
the specific path, refer to the actual installation directory.
l Ensure that the disk has adequate space.
l Twenty-two backup devices are required for routine backup (twenty-two erasable/writable compact disks
or twenty-two tape cassettes). Of the twenty-two devices, six are used to back up the data from Monday to
Saturday of the current week (ordinary backup), four to back up the data of the recent four Sundays (weekly
backup), and twelve to back up the data of the first day of each month of the current year (monthly backup).
l Automatic backup of the BAM database
l The backup of the BAM database is performed in a cyclic manner. Up to eleven database
files can be backed up. The backup file of the BAM database includes the following
contents.
l Saving the database backup of the past consecutive seven days from the current day (except
Sunday), "BamYYYYMMDD.dat" is the file name of these six files.
l Saving the database backup of the four Sundays of the past 28 days,
"BamYYYYMMDD.dat" is the file name of these four files.
l On the first day of each month, a BAM database backup has been stored in a file
"BamMonthBak.dat".
l Automatic backup of the BAM registry
l The system automatically stores the BAM registration information in the Windows NT
system registry every day. Only the latest backup is retained, with the file name of
"BamReg.bak".
NOTE
In the BAM.ini configuration file, you can set the start time of the automatic BAM database backup and registry
backup for the system by modifying the value of BkpDbStartHour (start hour of the database backup) and
BkpDbStartMin (start minute of the database backup) in DataMan. Restart the DataMan process for the new
setting to take effect after the modification of the content in the sub-item.
Backup Mode
Two modes are available for manual backup:
To perform the backup through the command is very easy. In the MML client of the service
maintenance system, run BKP DB to back up the current BAM database and the registry to the
default directory E:\MSSQL\SGDATA.
The following section details the mamual backup through the tool menu of the Enterprise
Manager of the SQL Server.
Procedure
Step 1 In the Windows NT interface of BAM, click Start -> Programs -> Microsoft SQL Server 7.0
-> Enterprise Manager to log in to the database server. The database server window is as shown
in Figure 4-3.
Step 2 Assume the BAM name is 2203 (Windows NT). Expand the navigation tree to open the node
Console Root -> Microsoft SQL Servers -> 2203 (Windows NT) -> Databases. Right-click
the node to display a floating menu as shown in Figure 4-4.
Step 3 Click All Tasks -> Backup Database to pop up a window as shown in Figure 4-5. Select
Bam from the Database field to select the database to be backed up. In the Backup field,
determine the method of data backup by selecting backup of all the data or of data that has been
modified. Select the default data backup file name of the system in the Destination field. Click
Remove to remove the value. Then click the Add to determine the file name and the path where
the backup file is located, as shown in Figure 4-6.
Step 4 As shown in Figure 4-7, select Overwrite existing media in the Overwrite area, which
indicates to overwrite the previously backed up data. If you do not want to overwrite the data,
select Append to media and click OK.
Step 5 The system backs up the data after confirmation. Moreover, an interface displaying the backup
progress appears as shown in Figure 4-8.
Step 6 When the system has successfully backed up the data, it displays a success prompt.
NOTE
l When BKP DB is carried out for the BAM database backup, the system does not back up the operation log.
When the Enterprise Manager Tool menu of the SQL Server is used for the data backup, the system will
back up the operation log.
l The file generated from manual backup is named based on the date. If several times of backup are necessary
within one day, the file name of the last backup has to be modified manually.
----End
Procedure
Step 1 Quitting All BAM Processes
Step 2 Restarting SQL Server Program
Stop the SQL Server program and then restart it. In this way, the recovery will not fail due to
the SQL Server's access to data in the BAM server. Its operation procedures are as follows:
1. In the terminal system of BAM server, click Start -> Programs -> Microsoft SQL Server
7.0 -> Service Manager. The system displays a dialog box as shown in Figure 4-9.
2. Click to stop the SQL Server program. Then the system displays a dialog box. Click
Yes to interrupt the communication between the foreground and the background.
4. If the equipment file displayed in Figure 4-10 is not the equipment data file to be restored,
proceed as follows:
5. Click Select Devices to display a window as shown in Figure 4-11.
6. Select the equipment file in the list. Click Remove All and Add. The system displays a
window as shown in Figure 4-12.
7. Click OK after selecting the equipment to be recovered based on the equipment name.
8. Select the data backup file to be recovered and click OK. The system will display a window
showing the progress of the data recovery.
9. After the data recovery is finished, set the BAM Service to the automatic startup mode and
restart the BAM.
----End
Postrequisite
After using the SQL Service Manager tool to recover BAM data, you must carry out the FMT
command, which converts the format of all data.
This appendix describes the usage of the records tables for the emergency maintenance.
5.1 Emergency Maintenance Note
This section describes the emergency maintenance note. If you failed to remove the fault, ask
for help from Huawei by phone or fax. In order to shorten the troubleshooting time, record your
operations in detail, notify Huawei of the replaced boards during the handling, and ask for
corresponding standby boards from Huawei according to the warranty in the contract.
5.2 Troubleshooting Record Table
This section describes the usage of the troubleshooting record table for the emergency
maintenance.
Auditor:
Stamp (your department):
Operator:
Date:
Unsolved problems:
Category of fault:
□ Hardware fault
□ Power supply fault
□ Clock fault
□ Hardware fault
□ Power supply fault
□ FE/E1 interface fault
□ Transmission network fault
Trunk link fault:
□ Fault in communication between foreground and background
□ User terminal fault
□ MTP link fault
Other faults:
Source of fault:
□ Customer complaint
□ Alarm system
□ Found in routine maintenance
Other sources:
Description of fault:
-48V2
PDB
…… ……
A1 A2 A10 B1 B2 B10
To increase the reliability of the power supplies in the cabinet, the PDB uses a large power diode
to combine the two input power supplies into one. Then the power supply is exported to the six
switches on the panel of the PDB through one power bus cable. These switches help to supply
the power for the parts in the cabinet through a power distribution system. In this way, when
one input power supply fails, the other can power the parts in the cabinet through the PDB.
different parts, their primary power supply system loops (namely -48 V power supply systems)
are different.
The LAN Switch 0, LAN Switch 1, and the KVMS in the basic cabinet are controlled by a power
switch, while the other parts are powered by the two switches on the PDB.
The relations between the PDB and the parts in the cabinet are as shown in Table 6-1.
FIREWALL 1 SW1 B4
FIREWALL 0 SW2 A4
LANSWITCH 3 SW1 B5
LANSWITCH 2 SW3 A5
LANSwitch0 SW2 A6
LANSwitch1 SW3 B6
KVM/LCD SW6 A7
A
AS Application Server
ALUI Alarm Unit Interface board
B
BAM Back Administration Module
D
DPC Destination Point Code
L
LAN Local Area Network
LCD Liquid Crystal Display
K
KVM Keyboard, Video, Mouse
M
MML Man-Machine Language
MTP Message Transfer Part
N
NGN Next Generation Network
P
PDB Power Distribution Box
S
SBPI Signaling Broadband Processing Interface
SBPU Signaling Broadband Processing Unit
SCCP Signaling Connection and Control Part
SCTP Signaling Control Transmission Protocol
SG Signaling Gateway
SLPU Signaling Low Processing Unit
SN Sequence Number
SQL Structured Query Language
STP Signaling Transfer Point
T
TFA TransFer-Allowed signal
TFP TransFer-Forbidden signal
W
WS Work Station