SGI® InfiniteStorage 15000 RAID User’s Guide007-5510-002
Prefaceviii 007-5510-002Power Cooling Module (PCM) Caution Label: Do not operate with modules missingWarning To ensure your system has warning of a
86 007-5510-002NOTE :Telnet connections are clear text. If Telnet connections are used, you may expose controller passwords to third parties. For
Troubleshooting007-5510-002 87- ROUTE GATEWAY=<aaa.bbb.ccc.ddd> sets the current gateway in the network routing table to the specified Interne
88 007-5510-0024.1.4 SNMP Set Up on Host Computer The following procedure explains how to set up the host computer to monitor the controller’s event
Troubleshooting007-5510-002 894.1.4.2 Traps • Generic Traps - coldStart (occurs when the controller is restarted) • Enterprise Specific Traps - Any
90 007-5510-0024. Remove the module’s thumbscrews, then slide the module out of the bay. 5. Slide the module into the bay making sure it is fully in
Troubleshooting007-5510-002 91NOTE :If more than one disk is failed in a single tier, only the first disk to fail will be replaced by the spare disk
92 007-5510-002DISK INFO will indicate what spare drive is replacing the failed drive (Figure 4–6). Figure 4–6 Obtaining Disk Information4.2.2.2 Ret
Troubleshooting007-5510-002 934.2.2.4 Changing the Rate of Rebuild The commands TIER DELAY and TIER EXTENT control the percentage of processor time
94 007-5510-0024.2.3 Component Failure on Enclosures The controller implements the SCSI Enclosure Services (SES) protocol to communicate with its dr
Drive Enclosure007-5510-002 95Chapter 5Drive Enclosure System5.1 The SGI InfiniteStorage 15000 Drive EnclosureThe SGI InfiniteStorage 15000 drive
Preface007-5510-002 ixWarning Operation of the Enclosure with ANY modules missing will disrupt the airflow and the drives will not receive sufficient
96 007-5510-0025.2 Enclosure Core Product The drive enclosure design concept is based on a subsystem together with a set of plug-in modules and (as
Drive Enclosure007-5510-002 975.2.1 Enclosure ChassisThe chassis assembly contains 60 drive bays at the front, each of which accommodates a plug-in
98 007-5510-0025.3.2 Input/Output (I/O) Module The drive enclosure storage subsystem includes an enclosure with rear facing bays which house two (2
Drive Enclosure007-5510-002 99ITable 5–2 defines the LED indicators incorporated on the I/O module.Figure 5–5 I/O ModuleHost Port 1 Expansion
100 007-5510-0025.3.3 Drive Carrier Module and Status IndicatorThe Drive Carrier Module comprises a hard disk mounted in a carrier (Figure 5–6). Eac
Drive Enclosure007-5510-002 101Note The DEM is a serviceable PCB which may be replaced by trained personnel.Each DEM connects to a single port of th
102 007-5510-002cards are fitted to drives. These LEDs indicate drive presence. The Front Display Panel LEDs are defined in Table 5–3. Figure 5–9 F
Drive Enclosure007-5510-002 103.Drive Fault AMBER Drive fault ON-one or more HDDs are faulted. SES must determine exact HDD.OFF-no detectable drive
104 007-5510-0025.4.2 Internal Indicators Note In some components, the failure LED is internal to the enclosure and visible only when the cover is o
Drive Enclosure007-5510-002 1055.5 Visible and Audible Alarms The functional modules have associated status LEDs. The Display Panel (Table 5–3) show
Prefacex 007-5510-002• The electrical distribution system must provide a reliable earth ground for each unit and the rack.• Each power supply in eac
106 007-5510-0025.6.2 Weight5.6.3 AC INPUT PCM5.6.4 DC INPUT PCMItem Quantity per System LBS (kg) Extended WeightChassisIncludes Base PCB & Midp
Drive Enclosure007-5510-002 1075.6.5 DC OUTPUT PCM5.6.6 PCM Safety and EMC Compliance5.6.7 Power Cord(minimum requirements)Note Power cord is not in
Drive Enclosure Installation007-5510-002 109Chapter 6Drive Enclosure Installation6.1 Introduction Caution When connecting up the drive enclosure su
110 007-5510-0026.2.1 Enclosure Bay Numbering ConventionWarning Operation of the Enclosure with ANY of the plug-in modules missing from the rear of
Drive Enclosure Installation007-5510-002 111Figure 6–2 1X60 Drive Numbering Table.Figure 6–3 2x30 Drive Numbering Table
112 007-5510-0026.3 Enclosure Installation ProceduresImportant SGI InfiniteStorage 15000 drive enclosures should only be installed in SGI InfiniteSt
Drive Enclosure Installation007-5510-002 1136.5 SAS DEMThe drive enclosure contains eight (8) top-loadable DEMs. The DEMs provide the SAS connectivit
Drive Enclosure Operation007-5510-002 115Chapter 7Drive Enclosure Operation7.1 Before You BeginBefore powering up the enclosure, please ensure that
Preface007-5510-002 xi• If the subsystem is used with modules or blanking plates missing for more than a few minutes, the enclosure can overheat, cau
116 007-5510-0027.2.2 I/O Panel LEDsThe I/O Panel LEDs fault and status conditions are shown in Figure 7–1.ITable 7–1 defines the LED indicators incor
Drive Enclosure Troubleshooting007-5510-002 117Chapter 8Drive Enclosure Troubleshooting8.1 Overview The SGI InfiniteStorage15000 Drive Enclosure incl
118 007-5510-0023 Check that there is a valid SAS signal present at the I/O connector. If there is no signal present, ensure the cable has been pro
Drive Enclosure Troubleshooting007-5510-002 119• Under Normal conditions, the LEDs should all be illuminated constant GREEN• If a problem is detected
120 007-5510-0028.3.5 Front Panel Drive Activity IndicatorsThe Front Panel Drive Activity Indicators show the aggregated status of all the modules.
Drive Enclosure Troubleshooting007-5510-002 1218.4 Audible AlarmWhen the Audible Alarm sounds, use the appropriate SES CLI command. 8.4.1 Top Cover O
122 007-5510-002Symptom Cause ActionIf the ambient air is cool (below 25 °C) and the fans are observed to increase in speed, then some restriction on
Drive Enclosure Troubleshooting007-5510-002 1238.5.2 Thermal Alarm8.5.3 Thermal ShutdownImportant An enclosure will shut down when a critical tempera
124 007-5510-0028.7 Continuous Operation During Replacement Depending on how the subsystem is set up, if a disk unit fails, it can normally be replac
Drive Enclosure Troubleshooting007-5510-002 125To remove an AC PCM - Thumb Screw Model:Warning Do not remove the faulty PCM unless you have a replace
126 007-5510-0028.8.2 I/O ModuleWarning Do not remove this module unless a replacement can be immediately added. The system must not be run without a
Drive Enclosure Troubleshooting007-5510-002 127spindown the drives prior to removal. If this is not possible, we recommend that you perform All steps
007-5510-002129AController Technical SpecificationsThe following are the technical specifications of the SGI InfiniteStorage 15000 controller. NOTE :S
130007-5510-002Physical, Power & EnvironmentalController chassis (H×W×D, Weight)3.5" × 19.0" × 31", 45 lbs Electrical/AC 100-120V /
007-5510-002131BDrive AddressingThis appendix illustrates the drive addressing for the disk enclosures. There are two drive numbering options:• 1 x
132007-5510-002Figure B–3 Controller-Drive Enclosure Couplet (1x Configuration)
007-5510-002133Figure B–4 Controller-Drive Enclosure Couplet (2x Configuration)PSPSABCDEFGHPSEEFABACDGHGBDHFCABCDEFGHPSController #1Controller #2Chann
007-5510-002135CCabling Controllers and Drive EnclosuresThis appendix illustrates the proper cabling of the following two configurations:• 2 controlle
Introduction007-5510-002 1Chapter 1IntroductionThe SGI InfiniteStorage 15000 controller is an intelligent storage infrastructure device designed an
136007-5510-002Figure C–1 Controller Cabling (2 Controllers and 5 Drive Enclosures)Figure C–2 Drive Enclosure Cabling (2 Controllers and 5 Drive Enclo
007-5510-002137Figure C–3 Controller Cabling (2 Controllers and 10 Drive Enclosures)A AChannel AB BChannel BC CChannel CD DChannel DE EChannel EF FCha
138 007-5510-002 Figure C–4 Drive Enclosure Cabling (2 Controllers and 10 Drive Enclosures)A AB BC CD DE EF FG GH HP PS SA AB BC CD DE EF FG GH HP
2 007-5510-002• Comprehensive, Centralized Management Capability The controller provides a wide range of management capabilities: Configuration Mana
Introduction007-5510-002 3Figure 1–1 SGI InfiniteStorage 15000 IB - Front and Rear ViewsThe controller is a high-performance controller designed to
4 007-5510-0021.2.1 Power Supply and Fan ModulesEach controller is equipped with two (2) power supply modules and one (1) fan module. The PSU (power
Introduction007-5510-002 51.2.2 I/O Connectors and Status LED IndicatorsFigure 1–3 shows the ports at the back of the controller 4 Infiniband (IB) u
COPYRIGHT© 2008 SGI. All rights reserved; provided portions may be copyright in third parties, as indicated elsewhere herein. No permission is grante
6 007-5510-002Figure 1–4 Host Port LEDsThe four HOST ports are used for IB or FC-8 host connections. You can connect your host servers IB HCA port(s
Introduction007-5510-002 7There are two AC Fail LEDs. Each LED is connected to its power supply independent of the other supply. The LEDs are gree
8 007-5510-002Table 1–1 LED IndicatorsStatus Indicator Led Activity ExplanationIB Solid Green (Infiniband) Physical Connectivity with hostSolid Ambe
Introduction007-5510-002 91.2.3 Uninterruptible Power Supply (UPS) Using an Uninterruptible Power Supply (UPS) with the controller is highly recomme
Controller Installation007-5510-002 11Chapter 2Controller InstallationThese steps provide an overview of the controller installation process. The s
Controller Installation12 007-5510-0022.2 Unpacking the SystemBefore you unpack your controller, inspect the shipping container(s) for damage. If you
Controller Installation007-5510-002 132.2.3 Connecting the Controller To set up the disk enclosures and connect them to the controller, do the follo
Controller Installation14 007-5510-002Configuration of disks in the enclosures must be in sets of complete tiers (Channels A through P). Allocating o
Controller Installation007-5510-002 152.2.7 Powering On the Controller NOTE :Systems that have dual controllers (couplets) should have the controlle
Contents007-5510-002 i1 Introduction ...
Controller Installation16 007-5510-0022.3 Configuring the Controller This section provides information on configuring your controller. NOTE :The conf
Controller Installation007-5510-002 17The controller supports various disk drive enclosures that can be used to populate the 10 <ABCDEFGHPS> d
Controller Installation18 007-5510-002For example, to change the system date to March 1, 2009, enter: date 3 1 2009<Enter> To set the system
Controller Installation007-5510-002 192.3.6 Checking Tier Status and Configuration Use the tier command to display your current tier status. Figure
Controller Installation20 007-5510-002Figure 2–6 Current Tier Configuration2.3.6.1 Heading Definitions• Total LUNs. LUNs that currently reside on the
Controller Installation007-5510-002 21Figure 2–7 Dual Controller ConfigurationIf you require multi-pathing to the LUNs, enable cache coherency. If y
Controller Installation22 007-5510-002NOTE :In dual mode, LUNs are “owned” by the controller unit where they are created. Hosts only see the LUNs on
Controller Installation007-5510-002 23After you have initiated LUN format, the message Starting Format of LUN is displayed. You can monitor the form
Controller Installation24 007-5510-0022.3.9 Setting Security Levels After you have formatted all the LUNs, you can define users’ access rights. Confi
Controller Installation007-5510-002 252. Check to ensure that the LUN Zoning chart is empty (Figure 2–11). Figure 2–11 LUN Zoning ScreenTo add a use
ii 007-5510-0023.4 Security Administration ...
Controller Installation26 007-5510-00210. Enter a new unique LUN mapping for this user. Options are shown in Table 2–2 on page 26.11. Connect user 2
Controller Installation007-5510-002 272.3.9.2 Host Port Zoning (Anonymous Access)Host Port Zoning (Anonymous Access) should only be used for non-SAN
Controller Management007-5510-002 29Chapter 3Controller Management3.1 Managing the ControllerThe controller provides a set of tools that enable adm
30 007-5510-0023. With the controller ready, press <Enter> to get the controller prompt. NOTE :To change the baud rate on controller, see sect
Controller Management007-5510-002 31NOTE :The default administrator account name is “admin” and its password is “password.” Similarly, the default u
32 007-5510-002Figure 3–4 WHOAMI Screen3.2 Configuration Management The controller provides uniform configuration management across heterogeneous SA
Controller Management007-5510-002 33The PORT=X|ALL parameter specifies the specific host port(s) (1 to 4) to be affected when used in combination wi
34 007-5510-002HOST STATUSCLEAR resets the error counts. Figure 3–7 IB Host Ports Status Screen3.2.1.3 Host IB UsersHOST ibusers displays additional
Controller Management007-5510-002 35Figure 3–9 Disk Channel ScreenIf the channel status is “acquiring loop synchronization,” this may indicate a cha
Contents007-5510-002 iii5.6.3 AC INPUT PCM ...
36 007-5510-002Figure 3–11 Disk Defect List ScreenDISK FAIL=<tier><channel> instructs the system to fail the specified disk at the physi
Controller Management007-5510-002 37Table 3–2 PHY Link Error Status Block InformationNOTE :SATA drives have an Active/Active MUX (AAMUX) installed.
38 007-5510-002Figure 3–12 Disk PLS Tier 1 Status ScreenFor other DISK parameters, see section 3.2.10 "Automatic Drive Rebuild" in this gu
Controller Management007-5510-002 393.2.2.2 Tier Configuration TIER CONFIG displays the detailed tier configuration information for all of the tiers
40 007-5510-002Figure 3–14 Tier Configuration ALL Screen3.2.2.3 LUN View Entering the LUN command displays the current status of the LUNs (). “Ready
Controller Management007-5510-002 41Figure 3–16 LUN List Screen3.2.2.4 LUN Configuration LUN CONFIG displays the configuration information for all t
42 007-5510-0023.2.2.7 Status of Drive Enclosures The SES command displays the failures reported by the enclosure (Figure 3–19), through the SCSI En
Controller Management007-5510-002 43TIER MAP displays the current mapping mode for the disks in the array. TIER CHANGEMAP changes the current tier m
44 007-5510-002NETWORK TELNETPORT=<port number> changes the Telnet port number for the current controller. The system must be restarted before
Controller Management007-5510-002 453.2.4.4 Displaying and Editing the Routing Table The ROUTE command displays the current routing table of the sys
iv 007-5510-0028.9 Replacing the DEM ...
46 007-5510-002RESTART DUAL restarts both units.RESTART KILL stops a timed restart that is in progress.3.2.5.2 System Shutdown SHUTDOWN shuts down t
Controller Management007-5510-002 473.2.6.2 System Time TIME displays the current system time.You can also change the system time. At the prompt, ty
48 007-5510-002The system will halt all I/O requests, delete all the LUNs and restore all the parameters back to their default values. This is a des
Controller Management007-5510-002 49To add a 32-bit LUN that will not exceed 2 TB, type: lun add=X<Enter> To add a 64-bit LUN, that e
50 007-5510-002Figure 3–24 Changing a LUN Label Screen3.2.9.6 Moving a LUN (Dual Mode Only) To change the ownership of a LUN from one controller to
Controller Management007-5510-002 51Use the TIER command to display the current setting (Figure 3–26). The rebuild operation can take up to several
52 007-5510-002SMART ENABLE enables SMART on all the disk drives installed in the system and updates the parameter blocks on the disk. This enables
Controller Management007-5510-002 53SMART TEST=ON enables the test bit in the Information exception mode page for all the disks installed. Setting
54 007-5510-002For example, a user is given access to internal LUNs 5, 6, and 7, which are mapped to external LUNs 0, 1, and 2, respectively. contro
Controller Management007-5510-002 55Figure 3–28 Labeling a Controller Unit3.2.12.4 SingletThe DUAL SINGLET command sets the system in the singlet mo
Preface007-5510-002 vPrefaceWhat is in this guideThis user guide gives you step-by-step instructions on how to install, configure, and connect the S
56 007-5510-002Figure 3–29 Cache Setting ScreenYou can use the LUN=x option to specify which LUN to change. If no LUN is specified, changes will be
Controller Management007-5510-002 57If the MF (Multiplication Factor) parameter is OFF, the system will only prefetch the number of blocks specified
58 007-5510-0023.3.2 Audio/Visual Settings of the SystemThe audio and visual (AV) settings of the system and the disks can be tuned to provide bette
Controller Management007-5510-002 59RC=ON|OFF enables the Read Continuous (RC) option for Audio/Video streaming data; the system starts the data tr
60 007-5510-002For example, LUNs 0 to 3 are locked in cache and all 50% of the total cache has been filled by data from LUN 0, 1, and 2. When a host
Controller Management007-5510-002 61Figure 3–33 System Performance Statistics ScreenRead Hits shows the percentage of Read I/O requests where the da
62 007-5510-002STATS DELAY displays a histogram of the time it takes for the host and disk I/O requests to complete in 100 msec intervals (Figure 3–
Controller Management007-5510-002 63STATS HOSTDELAY displays a histogram of the time delay between when the last data transfer is set ready and the
64 007-5510-002STATS DISK displays a histogram of the disks in the system that have taken an unusually long time to complete an I/O request (Figure
Controller Management007-5510-002 65STATS DUAL displays the statistics for the dual mode messages (Figure 3–38). Figure 3–38 Dual Message Statistics
Prefacevi 007-5510-002European RegulationsThis equipment complies with European Regulations EN 55022 Class A: Limits and Methods of Measurement of R
66 007-5510-002STATS OFFSET displays a histogram of the offset of the host I/O requests into the cache segments (Figure 3–40). Host I/O requests wit
Controller Management007-5510-002 67TIER DELAY=x is used to set the system rebuild/format delay. This value is in 100 millisecond increments. The ra
68 007-5510-002TIER VERIFY Displays a summary of verifications.To enable Tier Verify (Figure 3–42):1. At the prompt, type TIER VERIFY=ON <Enter&
Controller Management007-5510-002 69Figure 3–43 Tier Verify OFF ScreenTIER VERIFY=X A specified tier will be verified if possible.These API and CLI
70 007-5510-002communication, such as a cable failure, can cause the controller to fail numerous disk drives. Once the loss of communication is res
Controller Management007-5510-002 71To display the information about the rebuild journal, use the TIER JOURNAL command (Figure 3–44). To display the
72 007-5510-002NOTE :Improper use of the SES M_WAIT command can prevent the SES monitors from detecting an enclosure fault before the enclosure auto
Controller Management007-5510-002 733.4.1 Monitoring User Logins The AUDIT function continuously monitors logins to the controller and provides aler
74 007-5510-002Figure 3–50 Edit Zoning Configuration Screen3.4.3 User Authentication The controller creates correspondence between users (World Wide
Controller Management007-5510-002 753.5 Firmware Update Management SGI periodically releases firmware updates to enhance features of their products.
Preface007-5510-002 viiChassis Warning Label: Weight Hazard• Do not lift the drive enclosure by the handles on the power cooling module (PCM); they a
76 007-5510-002Figure 3–53 Downloading Controller Firmware6. Enter the TFTP server’s IP address: TFTP <IP_address>7. Enter the firmware path
Controller Management007-5510-002 77Figure 3–54 Telnet StatisticsThe administrator is strongly advised to perform any commands affecting the system’
78 007-5510-002Within the CLI subshell, the TELNET command allows the administrator to view information regarding the currently active Telnet sessi
Controller Management007-5510-002 793.7 System Logs3.7.1 Message LogAll controller events are logged and saved in non-volatile memory. The log will
80 007-5510-002To display the number of LUN array parity errors detected by the system, use the FAULTS ARRAYPARITY command. The system saves the cou
Controller Management007-5510-002 813.8 Other Utilities3.8.1 APC UPS SNMP Trap Monitor APC_UPS displays the status of the APC UPS SNMP trap monitor
82 007-5510-002Figure 3–64 Displaying the Serial Console SettingCONSOLE BAUD changes the baud rate of the CONFIG port of the controller (Figure 3–65
Controller Management007-5510-002 833.8.6 Disk Reassignment and Miscellaneous Disk CommandsThe DISK REASSIGN=tc 0xh command allows for the reassigni
84 007-5510-002test so it will not affect the performance of the system. Any changes applied to delay will affect tests in progress as well as fut
Troubleshooting007-5510-002 85Chapter 4Controller Remote Management and Troubleshooting4.1 Remote Management of the Controller The controller can b
Comments to this Manuals