This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Workstations where static sensitive equipment is handled must be equipped with the following equipment:
Bench top grounding points ESD protective table top or mat Wrist strap and grounding points Protective clothing, including hair protection ESD safe floor area ESD safe chair Ionized air blower ESD safe containers
• Near a Properly grounded, three-pronged power outlet for 200-240V
• Clean and relatively free of excess dust
• Well-ventilated and away from sources of heat, with the ventilating openings on the server kept free from obstructions
• Maximum ambient temperature shouldn't exceed 35 oC ( 95 oF)
• Isolated from strong electromagnetic fields and noise caused by electrical devices such as elevators, copy machines, air conditioners, large electric motors, radio and TV transmitters, and high frequency security devices
• Access space provided for handling the cables connected to the server
A typical Serial Attached SCSI system would consist of the following basic components: An Initiator is a device that originates device service and task management requests to be processed by a target device and receives responses for the same requests from other target devices. Initiators may be provided as an on-board component on the motherboard (as is the case with many server-oriented motherboards) or as an add-on host bus adapter. A Target is a device containing logical units and target ports that receives device service and task management requests for processing and sends responses for the same requests to initiator devices. A target device could be a hard disk or a disk array system. A Service Delivery Subsystem is the part of an I/O system that transmits information between an initiator and a target. Typically cables connecting an initiator and target with or without expanders constitute a service delivery subsystem. Expanders are devices that are part of a service delivery subsystem and facilitate communication between SAS devices. It facilitates connection of multiple SAS End devices to a single initiator port.
A set of chips that provides the interfaces between all of the PC's subsystems. It provides the buses and electronics to allow the CPU, memory and input/output devices to interact. Most Intel chipsets, which are contained on two to four chips, also include built-in EIDE support. In the past, Intel used the name "Triton" for its chipsets. It also used the name PCI set for PCI-based chipsets. Desktop Chipset:810/845/845GV/865/910/945G/Q35 Server Chipset Family:3000/5000/7000
IBM xSeries/SystemX continues to leverage decades of experience in designing, building, and supporting servers to provide the most innovative industry standard servers in the market today. Starting with our x-Architecture Blueprint we delivered innovations like LightPath Diagnostics, Extended PFA, Integrated Service Processor, Software Rejuvenation, etc…
ServeRaid is IBM nomenclature for IBM Raid Controllers. There are different types of ServeRaid controllers available which includes, Host Raid, ZCR, SCSI, SATA/SAS.
Below is table of commonly used ServeRaid models and its connection type for reference.
4L 4Lx 4M 4Mx 4H 5i 6M 6i/6i+ 7t 7e 7k 8e 8i 8k-l 8k
SCSI SCSI SCSI SCSI SCSI SCSI SCSI SCSI SATA SCSISATA
For full range of ServeRaid controller & its compatibility with picture can be found on the link given below. It also consists full technical details as supported raid level, Voltage, Bus etc.
IBM Server Guide IBM Server Guide is a tool that simplifies the process of installing and configuring IBM System x and Blade Center servers. Server Guide goes beyond mere hardware configuration by assisting with the automated installation of Windows server operating systems, device drivers and other system components, with minimal user intervention. The goal of Server Guide is to simplify and shorten installation. The built-in intelligence of Server Guide recognizes machine types and models as well as Serve RAID adapters and other hardware criteria. Based on the gathered hardware information, Server Guide offers installation and configuration choices tailored to your system. To Use it only we need to boot with server guide CD and follow the instruction. IBM Server Guide Can Be downloaded from IBM site. Support link has been given in last slides of this training module. Note: Installation from Server Guide CD will erase all existing data on the server, should not be used if required to format only one partition on raid subsystem.
• POST error messages and error logs which can be viewed using DSA or on system startup.
• System error LED and Light Path diagnostics
• F2 Diagnostics/Enhanced Diagnostic CD
•Diagnostic programs error messages – DSA
We can use BMC/ISMP/IPMI/RSA logs to analyze problem with the server hardware. All these logs can be collected through DSA on condition that Management controller (BMC/ISMP/IPMI/RSA etc.) is properly installed/updated on the system OS while trying to collect DSA logs on top of OS.
System error LED & Light Path Diagnostics System error LED & Light Path Diagnostics
System Error LED
Information LED
• Light Path panel is visible from the front of the unit.
This revolutionary technology helps expedite repairs and minimize downtime by quickly and clearly identifying those components that need maintenance. You are alerted to problems via an indicator light on the front of the server.
To troubleshoot hardware problems IBM has inbuilt (as a bootable CD in old systems) system utility called F2 Diagnostics. We can check system health using this tool. It includes all hardware test included in base system.
To run F2 Diagnosis follow below mentioned steps:
•Boot from Enhanced Diagnosis CD or press F2 when the message F2 for Diagnostics is displayed. •Select either Extended or Basic from the top of the screen. •From the diagnostic screen, select the test you want to run from the list and then follow all instructions.
It will show the result as passed or with some error codes. On hardware related errors troubleshoot with the troubleshooting information from the below link : http://www-947.ibm.com/systems/support/supportsite.wss/selectproduct?taskind=1&brandind=5000008&taskind=2
IBM Dynamic System Analysis (DSA) collects and analyzes system information to aid in diagnosing system problems. Additionally, DSA creates a merged log that allows users to easily identify cause-and-effect relationships from different log sources in the system. DSA also compares device driver and firmware on the system. DSA Portable: DSA Portable Edition runs from the command prompt on a supported system without altering any system files or system settings. It expands to temporary space on the target system, runs, and deletes all intermediate files after execution completes.
DSA Installable: DSA Installable Edition provides a permanent installation of DSA onto a system. This installation shares a similar command prompt interface with the portable edition.
DSA Bootable: DSA Bootable can be created by ISO downloaded from IBM Site to boot from the same on unavailability of OS.
To Run DSA Follow below steps: Installable edition: Go to StartProgramsIBMDSA Portable Edition: Click on the exe file downloaded as IBM Director Portable Edition. Bootable edition: Boot from the CD. It will auto start and will ask to View/Save the logs. Save in external media (e.g. FDD, USB Stick)
After running DSA (Installable or Portable) logs will be saved in folder c:\IBM_Support\ with system serial number time e.g. 8648IBS_99BAXM1_20080930-183757.xml.gz or 8648IBS_99BAXM1_20080930-183757.zip. Logs will be saved in both .zip and .xml format. Zip folder can be extracted and opening file index.html will show logs in web format. Files with .xml extension needs to be extracted only on the systems where DSA is installed by using following command. This will also save the file in c:\IBM_Support folder with above specified file name format.
IBM UpdateXpress can help reduce your cost of computing by providing an effective and simple way to update device drivers, server firmware and firmware of supported options contained within the server on most of your System x and BladeCenter products. If you have purchased an IBM server, UpdateXpress is available for download at no additional charge. IBM Update Xpress is ISO downloadable and very easy to update the server with minimal user intervention. We need to boot from the CD and choose components we want to update. This CD automatically recognize server type and updates the codes accordingly. It minimizes wrong updating of codes.
Important points to be considered before updating codes: •There may be some latest updates for particular server equipments. Need to cross verify on IBM site for latest updates and update it manually. Please read readme file before updating firmware codes to ensure all prerequisites. •Ensure to install latest device drivers first before updating firmware codes on a server. Drivers also can be updated with UpdateXpress CD. •To check & download UpdateXpress compatibility and version list visit below mentioned site: http://www-304.ibm.com/systems/support/supportsite.wss/docdisplay?lndocid=SERV-XPRESS&brandind=5000008
You must have below mentioned information handy to get required help exploring IBM site:
Machine Product family (e.g. IBM x345)
Machine Type number (IBM 4 digit machine identifier e.g. 8840)
Machine model number (32x)
Machine Serial number (7 Digit Serial Number)
Verify your warranty status - This data will assist IBM in quickly providing a solution to your issue, and ensuring you receive the appropriate level service for which may have contracted. Warranty Status you can check with IBM Site “Warranty lookup” .
Basic Diagnostic: - a. Check the Front panel for errors. b. Check the LPD panel for Errors. c. Check the Beep codes. d. Check the Component LED’s for Errors. e. Check for Cable loose connections & Switches etc. Extended Diagnostic: - a. Press NMI Switch and try b. Reset BIOS to defaults c. Reseat memory and add-ons. d. Start server with minimum configuration to identify the faulty component.
a. Replace the Spare identified, if any b. Take DSA Reports and analyze – Pre-update DSA c. Run F2 Diagnostics. d. Check System event log and Diagnostic log for errors. e. Update drivers, BIOS and Firmware – (Optional if there is no error reported in System Event log, DSA Log and Diagnostic Log.) f. Take DSA Report and analyze the results – Post-update DSA g. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access v. Save and Clear the Existing Hardware event logs. Tools and Utilities Required: DSA Utility IBM Update Express CD - Reference Documents: IBM Server Hardware Maintenance Manual
Basic Diagnostic: - a. Identify the Error from Error message b. Check the Error / POST Codes c. Check the System Event Log through F2 Diagnostics. d. Check the Front panel for errors. e. Check the LPD panel for Errors. f. Check the Beep codes. g. Check the Component LED’s for Errors. Extended Diagnostic: - a. Reset BIOS to defaults b. Update System BIOS & Firmware using update express c. Start server with Minimum Configuration and identify the faulty component.
a. Replace the Spare identified, if any b. Take DSA Reports and analyze – Pre-update DSA c. Run F2 Diagnostics. d. Check System event log and Diagnostic log for errors. e. Update drivers, BIOS and Firmware – (Optional if there is no error reported in System Event log, DSA Log and Diagnostic Log.) f. Take DSA Report and analyze the results – Post-update DSA g. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access v. Save and Clear the Existing Hardware event logs. Tools and Utilities Required: DSA Utility IBM Update Express CD
Basic Diagnostic: - a. Diagnose through DSA Log b. Run IBM diagnostics c. Run OS Stress tests d. Check the Front panel for errors. e. Check the LPD panel for Errors. f. Check the Component LED’s for Errors. Extended Diagnostic: - a. Update Drivers , BIOS and Firmware b. Check OS Patches and updates c. Observe system performance d. Eliminate present installed OS.
Resolution & Functionality Check a. Replace the Spare identified, if any b. Take DSA Reports and analyze – Pre-update DSA c. Run F2 Diagnostics / IBM Diagnostics d. Check System event log and Diagnostic log for errors. e. Update drivers, BIOS and Firmware – (Optional if there is no error reported in System Event log, DSA Log and Diagnostic Log.) f. Take DSA Report and analyze the results – Post-update DSA g. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access v. Save and Clear the Existing Hardware event logs. Tools and Utilities Required: DSA Utility IBM Update Express CD
Basic Diagnostic: - a. Diagnose through Raid manager ( From Installed Raid Manager , ServRaid Manager CD or IPSCEND Utility ) b. Diagnose through Raid Log ( DSA or Dump Log ) Extended Diagnostic: - a. Run Stress tools and try to simulate ( SQL IO Stress , HP Data creator , Multiple DD commands in Linux / Unix) b. Update Raid Driver & Firmware, Backplane Firmware & HDD Firmware, and then Observe. Resolution & Functionality Check: - a. Replace the Spare identified, if any b. Update Raid Card Drivers and then Firmware, if the replaced spare is Raid Card. c. Update backplane Firmware d. Run a disk subsystem stress tool like SQL IO Stress / HP Data creator / Multiple DD Commands.
e. Take the Raid Logs after stress test and analyze. f. Ensure there is no predictive failure in any HDD, through raid manager / dumplogs. g. Run DSA and confirm there is no error. h. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access i. Save and Clear existing event logs Tools and Utilities Required: DSA IBM Update Express CD IBM ServRaid Manager HP Data Creator and Reader
Server Health Check Basic Health Check : - a. Visual Inspection for bend , dent, breakage etc.. b. Open the Server and ensure the components and add-ons are fixed properly c. Power on and check for Display d. Check LED’s and LPD Panel for Errors e. Run F2 Diagnostics. f. Check System event log and Diagnostic log for errors. g. Update drivers, BIOS and Firmware – (Optional if there is no error reported in System Event log and Diagnostic Log. & Must for the new systems @ Logistics ) h. Configure Raid – If raid card is available and not configured already. i. Load Supported Windows / Linux OS - If OS is not loaded already.. j. Take DSA Report and analyze the results for errors and predictive failure. k. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway
l. Install IBM Director and run Real time diagnostics m. Run Stress test for CPU/Memory Subsystem – Linpack can be used directly on Linux servers and for Windows Servers, boot from KNOPIXX Linux CD and run the Linpack from HDD or PEN drive. n. Run Microsoft memory Diagnostics - Windiag o. Run Stress test for Disk Subsystem – HP Data Creator & Reader / SQL IO etc p. Save and Clear the existing Event logs Tools and Utilities Required: DSA IBM Update Express & ServRaid Manager HP Data Creator and Reader Intel Linpack for Windows and Linux Windiag utility – ( Img & ISO file available in Internal KB site )
PERC is Dell nomenclature for its RAID controller. Among the standard hardware components of a server, Dell is having proprietary PowerEdge-specific PERC (PowerEdge Expandable RAID Controller). The related software in the PERC Fault Management Suite offers facilities such as the Background Patrol read, which aims to fix bad sectors on online RAID disks running under some of the more recent PERC controllers. Below is the table for different PERC controller model and connection type and chipset used.
As other MNC servers, best way to deploy new Dell server is using Dell Server Assistant CD. This CD comes along with the server. Dell Server Assistant (DSA) delivers simple, automated deployment to help you minimize the time to value of your IT infrastructure. Shipped with every Dell PowerEdge server, this bootable, stand-alone CD-ROM guides you quickly through system setup, configuration and operating system installation. It provides Dell optimized and tested drivers, diagnostics and utilities.
The tool is easy to use with a series of simple-to-follow basic or advanced interview questions through an intuitive Graphical User Interface (GUI). Plus, reboots are minimized for faster deployment. DSA delivers quick deployment capabilities enabling you to quickly set up new servers or rebuild downed systems. It also includes replication abilities to help you automatically deploy multiple identical servers. DSA features increased platform, operating system and browser support over the previous version, including 9th generation PowerEdge™ server platforms, Vmware® ESX Server™ 3.0 and Windows® Storage Server.™
Dell PowerEdge™ Diagnostics is a suite of diagnostic programs, or test modules, that run locally on your system. When you start PowerEdge Diagnostics, the devices on your system are discovered. You select diagnostics tests to run from the Diagnostic Selection tree containing the hardware that PowerEdge Diagnostics discovers on your system. You can select tests for various parts of a system and run them by clicking the Run Tests button in the Tests Selected tab. You can select to run the diagnostic tests in normal or Quick Test mode.
In the Results tab you can view the results for each individually selected test. To view the entire result message for a particular test, double-click the device and test. A message box appears displaying the entire test result message.
You can view the saved results by extracting the .html file from the .zip file into a directory you create. When you open the .html file, you will get a screen similar to the one above:
Dell Server E-Support Tool (DSET) provides the ability to collect hardware, storage and operating system information of a Dell PowerEdge server. This information is consolidated into a single System Configuration Report that can be useful for troubleshooting or inventory collection of a system. The browser user interface provides a convenient means to view specific data through hierarchical menu trees.
DSET is intended to be a small, non-intrusive tool that does not require a reboot of the system to provide basic functionality. Immediately after installation, DSET can collect information about Windows drivers, services, network settings, etc. It will also collect basic information about the system's storage such as active drives, RAID containers, etc. DSET will also collect extended hardware information such as processors, memory, PCI cards, ESM log, BIOS/firmware versions and system health (fan/voltage levels).
* This tool only available for Windows and Linux OS.
For any downloads like firmware/manuals/drives etc visit http://support.dell.com. All you need is service tag of the server or exact type of the server.
To know about server generation is very important for support through hp site. Each HP server model is known through generation prefix to the model e.g. DL380 G5. You can find generation by looking in front of the server. Another important aspect of HP server is to know HP serial number and part number. You can find those details on the asset tag sticker of HP servers.
One of the above mentioned controllers are coming by default in DL 380 series server, Like DL380 , DL 380 G2, DL 380 G3 DL 380 G4.Optional Smart array Controller is mentioned Below.
Smart Array 532
Smart Array 642
Smart Array 6400
The HP Smart array Controller Comes in Dual & Single Port.
Note: For More information about different Smart Array Controllers, refer manuals.
SmartStart is an essential single-server deployment tool that simplifies HP ProLiant server setup, providing a repeatable way to deploy reliable and consistent single-server configurations.
HP Deployment tool - SmartStart
SmartStart is delivered with every ProLiant ML and DL 300, 500, and 700 series server and supports ProLiant BL servers to provide step-by-step ProLiant deployment assistance and is included in the HP Insight Foundation Suite for ProLiant. From array configuration and OS installation to the update of optimized ProLiant server support software, SmartStart ensures a stable and reliable configuration. The SmartStart CDs contain server system utilities and operating system support software for ProLiant servers.
HP Insight Diagnostics is a proactive server management tool, available in both offline and online versions, that provides diagnostics and troubleshooting capabilities to assist IT administrators who verify server installations, troubleshoot problems, and perform repair validation. HP Insight Diagnostics Offline Edition performs various in-depth system and component testing while the OS is not running. To run this utility, launch the SmartStart CD. HP Insight Diagnostics Online Edition is a web-based application that captures system configuration and other related data needed for effective server management. Available in Microsoft® Windows® and Linux versions, the utility helps to ensure proper system operation.
The HP Insight Diagnostics Online Edition provides the capability to use non-intrusive system level checks to diagnose Smart Array SCSI hard drives. Diagnosis supports SCSI, SATA, and SAS hard drives that are attached to a Smart Array controller and configured as part of a logical volume. Diagnosis is not component testing, but is a combination of drive history and controller error status. This Diagnosis feature should be used to: • To troubleshoot why a hard drive is in a particular state. • When HP SIM reports a hard drive pre-failure or that a hard drive has failed. • When data corruption or storage problems occur, but no physical indications are evident. • When conflicting errors exist. • When repeated failures occur.
HP Insight Diagnostics provides survey functionality that gathers critical hardware and software information on ProLiant servers. If a significant change occurs between data-gathering intervals, the survey function marks the previous information and overwrites the survey data files to reflect the latest changes in the configuration. Survey functionality is installed with every SmartStart-assisted HP Insight Diagnostics installation, or it can be installed through the HP PSP .
Integrated Management Log The IML records hundreds of events and stores them in an easy-to-view form. The IML timestamps each event with 1-minute granularity. You can view recorded events in the IML in several ways, including the following: From within HP SIM From within Survey Utility From within operating system: o For Windows®: IML Viewer o For Linux: IML Viewer Application From within the iLO and iLO 2 user interface From within HP Insight Diagnostics .
The process of updating system or option firmware is referred to as a flash process or flashing the ROM. A firmware or ROM flash removes the existing version of firmware from the ROM and replaces it with a more recent version. Update the firmware to do the following: • Support new hardware, such as a processor revision • Support new features • Correct problems in a previous firmware version
Several tools are available for updating firmware. HP recommends the following methods: • ROMPaq utility • HP Smart Update Manager • Online ROM flash components • ROM Update Utility The ROM update utility is being replaced by the HP Smart Update Manager on the Firmware Maintenance CD version 7.50 and later.
Although there are different methods to update firmware on HP systems, but it is recommended to proceed with HP firmware maintenance CD. Here we will be discussing about firmware updates using Firmware maintenance CD only.
HP Smart Update Manager deployment
The HP Smart Update Manager utility is available from the ProLiant Firmware Maintenance CD. This CD and others can be downloaded free-of-charge from the HP website . To update please follow the steps: •Boot the server from the Firmware Maintenance CD. •At the prompt, select a language and keyboard and Click Continue. •Read the End-User License Agreement, click Agree . •Click the Firmware Update tab. •Click Install Firmware. The HP Smart Update Manager is initiated. Select and install the desired components For more information, see the HP ProLiant Firmware Maintenance CD User Guide on the Firmware Maintenance CD.
HP ProLiant Support Packs: PSPs represent operating system-specific bundles of ProLiant optimized drivers, utilities, and management agents. HP Systems Insight Manager : HP Systems Insight Manager is the clear choice for managing HP servers and storage by being the easiest, simplest and least expensive way for HP system administrators to maximize system uptime and health. Version Control: The VCRM (version Control Repository Manager) and VCA(Version Control Agent) are Web-enabled Insight Management Agents. HP Systems Insight Manager uses these Insight Management Agents to facilitate software update tasks.
For more information about version control tools and HP SIM, refer to the HP Systems Insight Manager Help Guide and the Version Control User Guide on the HP Systems Insight Manager
HP Warranty Lookup: http://h20000.www2.hp.com/bizsupport/TechSupport/WarrantyLookup.jsp?country=&prodSeriesId=454811&prodTypeId=12454 HP Part Information: http://partsurfer.hp.com/search.aspx HP ProLiant Support Site http://h20180.www2.hp.com/apps/Nav?h_pagetype=s-001&h_lang=en&h_cc=us&h_product=241435&h_client=S-A-R163-1&h_page=hpcom&lang=en&cc=us
Basic diagnostics a. Check the Front panel LED. b. Check the physical connectivity. c. Check the Power and reset switch (some times reset switch struck). d. Check the Beep codes. e. Check the Amber LED’s on the motherboard and SMPS’s backside. Extended Diagnostics a. Reset BIOS to defaults (using system maintenance switch) b. Check the SMPS output power by multi-meter,(check the power good signal) c. Run F10 Diagnostics for G1 and f9 for next generation severs d. Remove the extra ram ( swap and check the first slot ram also) e. Start server with Minimum Configuration and identify the faulty component. f. Clear bios thru dip switch g. If still no display Do the bios recovery
a. Intend the spare with part number , server Generation and serial number(optional) otherwise you couldn’t identify the correct spare b. Check the part number of the spare and server serial number. c. Replace the Spare what you identified d. Run the bios setup and Run inspect utility, we can find out which spare gone bad. e. Update drivers , BIOS and Firmware
Resolution and functionality check
Error on POST or Not Booting Basic Diagnostics a. Identify the Error from POST error message b. Identify the error which it is related. c. Check the System bios Event Log through F2 Diagnostics. d. Check the array controller & array log e. Check the Front panel Lpd (light path diagnostics) for errors. f. Check the LCD panel for Errors.(optional) g. Check the Beep codes. h. Check the mother board Amber LED’s for Errors.
Extended Diagnostics a. Identify the problem due to hardware (or) Operating system b. Reset BIOS to defaults c. Do bios recovery d. Update System BIOS & Firmware through firmware update CD e. Check the generation of the server- use this Url-HP Firmware f. Start server with Minimum Configuration and identify the faulty component.
a. Check the Replaced spare part number and configuration b. Check the part number through part surfer c. After replaced the Spare check iml log file. d. Remove the replaced spare and check e. Update drivers, BIOS and Firmware f. Run setup inspect Diagnostics g. Check and Clear the existing System Event Log
Basic diagnostics a. Diagnose through IML Log b. Run OS Stress tests c. Check the motherboard and check the component over heat , d. Install bios and firmware suitable version. e. Check all boards if represent any amber LED Extended diagnostics a. Update Drivers , latest BIOS and Firmware b. Update the controller firmware c. Check OS Patches and updates d. Observe system performance e. Eliminate present OS and check with new os
j. Run setup inspect Diagnostics k. Check the IML log and ensure there is no error. l. Run the OS Stress test and ensure system is working fine. F. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access
Resolution & Functionality Check:
Compaq Smart Array & HDD related issues Basic diagnostics a. Diagnose through Raid Log ( use smart start cd , view log file) b. Diagnose through smart array diagnose(ADU) utility c. Run smart start cd and view the log file d. Update Raid Driver & Firmware, Backplane Firmware & HDD Firmware, and then Observe.(before update check the i2c cable with back plane) e. Use specific scsi cable mentioned by hp use for the back plane f. If you have an error code check proliant server error codes
a. Run Stress tools and try to simulate (HP Data creator, Multiple DD commands in Linux / Unix) b. Update Raid Driver & Firmware, Backplane Firmware & HDD Firmware, and then Observe.
Extended Diagnostic
Resolution & Functionality Check a. Replace the Spare identified, if any b. Update the HDD firmware to the latest one, check the part number before replaced (confirm with old replaced spare). c. Update Raid Card Drivers and then Firmware, if the replaced spare is Raid Card. d. Update backplane Firmware e. Check the IML log and ensure there is no error. f. Run the OS Stress test and ensure system is working fine. g. Run Standard Peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway iv. External Storage access – Drive access
Basic Health Check a. Visual Inspection for bend , dent, breakage etc.. b. Open the Server and ensure the components and add-ons are fixed properly c. Power on and check for Display d. Check LED’s and Panel for Errors e. Run Inspect utility and check for errors. f. Check IML log for errors. g. Update drivers, BIOS and Firmware – (Optional if there is no error reported in System Event log and Diagnostic Log. & Must for the new systems @ Logistics ) h. Configure Raid – If raid card is available and not configured already. i. Load Supported Windows / Linux OS - If OS is not loaded already.. j. Take IML Report and analyze the results for errors and predictive failure. k. Run Standard peripherals test. i. CD / DVD – Read , Floppy Drive – Read & Write ii. Tape Drives – Read & Write. iii. Network – Ping to Gateway
l. Install Insight manager and run diagnostics and survey utility. m. Run Stress test for CPU/Memory Subsystem – Linpack can be used directly on Linux servers and for Windows Servers, boot from KNOPIXX Linux CD and run the Linpack from HDD or PEN drive. n. Run Microsoft memory Diagnostics - Windiag o. Run Stress test for Disk Subsystem – HP Data Creator & Reader/SQL IO etc. p. Save and Clear the existing IML logs
Extended Health Check
01, Smart Start CD – configure array, deploy os, generate log file Visit For Generation identification: Generation 1 server – Use Smart start 5.5 Generation 2 Server – Use Smart start 7.7 Above all model server – latest version 02, firmware maintenance cd- you can update bios, firmware.: For Generation 1 server – Firmware CD 7.3 Generation 2&above server – latest version
If you need to save on restore time and hassle in the event of a disaster, choose a plan that includes full and differential backups. In this strategy, you perform a regular backup of all files (weekly, bi-weekly, etc.), then a more frequent backup (daily) of all files that have changed since the last full backup session. This full/differential backup method helps the restore process run more efficiently, because only one full backup tape and one differential backup tape are required for a complete restore of the system. However, this method is slower on the backup process because more files are copied daily.
o Standardized protocol dealing with electrical, mechanical and signaling interfaces.
o Concerned with the transmission of bits
Data Link Layer:
o Logical Link Control (LLC) defines how data is transferred over the cable and provides data link service to the higher layers.
o Medium Access Control (MAC) defines who can use the network when multiple computers are trying to access it simultaneously (i.e. Token passing, Ethernet [CSMA/CD], etc...).
o Handles errors in the physical layer.
o Groups bits into frames and ensures their correct delivery.
Network:
o Concerned with the transmission of packets.
o Choose the best path to send a packet (routing).
o It may be complex in a large network (e.g. Internet).
o Shortest (distance) route vs. route with least delay.
o Most popular protocol widely used is x.25 and TCP/IP
o Network layer does not deal with lost messages.Transport layer ensure reliable service. It breaks the message (from sessions layer) into smaller packets, assigns sequence number and sends them.
o TCP/IP is widely used for network/transport layer. UDP (Universal Datagram Protocol): Internet connectionless transport protocol.
Session & Presentation Layer:
o Just theory! Very few applications use it.
Application Layer:
o Collection of miscellaneous protocols for high level applications
o Electronic mail, file transfer, connecting remote terminals, etc.
o E.g. SMTP, POP, IMAP, FTP, Telnet, SSH, HTTP, HTTPS, SNMP, etc...
o A repeater connects two segments of your network cable. It retimes and regenerates the signals to proper amplitudes and sends them to the other segments.
Hub:
o A type of repeater used on several network architectures which usually connects several stations.
Switch:
o A network switch is a computer networking device that connects network segments.
o The term commonly refers to a network bridge that processes and routes data at the data link layer (layer 2) of the OSI model. Switches that additionally process data at the network layer (layer 3 and above) are often referred to as Layer 3 switches or multilayer switches.
o The term network switch does not generally encompass unintelligent or passive network devices such as hubs and repeaters.
o A bridge reads the outermost section of data on the data packet, to tell where the message is going. It reduces the traffic on other network segments, since it does not send all packets. Bridges can be programmed to reject packets from particular networks. Bridging occurs at the data link layer of the OSI model, which means the bridge cannot read IP addresses, but only the outermost hardware address of the packet. In our case the bridge can read the Ethernet data which gives the hardware address of the destination address, not the IP address. Bridges forward all broadcast messages. bridges use one of:
• Transparent Bridging - They build a table of addresses (bridging table) as they receive packets. If the address is not in the bridging table, the packet is forwarded to all segments other than the one it came from. This type of bridge is used on Ethernet networks.
• Source route bridging - The source computer provides path information inside the packet.
o A router is used to route data packets between two networks. It reads the information in each packet to tell where it is going. If it is destined for an immediate network it has access to, it will strip the outer packet, readdress the packet to the proper Ethernet address, and transmit it on that network. If it is destined for another network and must be sent to another router, it will re-package the outer packet to be received by the next router and send it to the next router. The section on routing explains the theory behind this and how routing tables are used to help determine packet destinations. Routing occurs at the network layer of the OSI model. They can connect networks with different architectures such as Token Ring and Ethernet. Although they can transform information at the data link level, routers cannot transform information from one data format such as TCP/IP to another such as IPX/SPX. Routers do not send broadcast packets or corrupted packets. If the routing table does not indicate the proper address of a packet, the packet is discarded.
o A router is used to route data packets between two networks. It reads the information in each packet to tell where it is going. If it is destined for an immediate network it has access to, it will strip the outer packet, readdress the packet to the proper Ethernet address, and transmit it on that network. If it is destined for another network and must be sent to another router, it will re-package the outer packet to be received by the next router and send it to the next router. The section on routing explains the theory behind this and how routing tables are used to help determine packet destinations. Routing occurs at the network layer of the OSI model. They can connect networks with different architectures such as Token Ring and Ethernet. Although they can transform information at the data link level, routers cannot transform information from one data format such as TCP/IP to another such as IPX/SPX. Routers do not send broadcast packets or corrupted packets. If the routing table does not indicate the proper address of a packet, the packet is discarded.
Internet Protocol Because TCP/IP networks are interconnected across the world, each computer on the Internet must have a unique address (called an IP address) to make sure that transmitted data reaches the correct destination. (e.g. 192.168.1.1) Classes of IP
Class A addresses can have up to 16,777,214 hosts on a single network. Class A addresses are in this range: 1.x.x.x to 126.x.x.x.
Class B addresses can have up to 65,354 hosts on a single network. Class B addresses are in this range: 128.1.x.x to 191.254.x.x.
Class C addresses can have up to 65,354 hosts on a single network. Class B addresses are in this range: 192.0.1.x to 223.255.254.x.
Netmask In each of the address classes previously described, the size of the two parts (network address and host address) is implied by the class. This partitioning scheme can also be expressed by a netmask associated with the IP address. A netmask is a 32-bit quantity that, when logically combined (using an AND operator) with an IP address, yields the network address. For instance, the netmasks for Class A, B, and C addresses are 255.0.0.0, 255.255.0.0, and 255.255.255.0, respectively. By looking at the addressing structures, you can see that even with a Class C address, there are a large number of hosts per network. Such a structure is an inefficient use of addresses if each end of a routed link requires a different network number. It is unlikely that the smaller office LANs would have that many devices. You can resolve this problem by using a technique known as subnet addressing.
Netmask Contd… The following table lists the additional subnet mask bits in dotted-decimal notation. To use the table, write down the original class netmask and replace the 0-value octets with the dotted-decimal value of the additional subnet bits. For example, to partition your Class C network with subnet mask 255.255.255.0 into 16 subnets (four bits), the new subnet mask becomes 255.255.255.240. The following table displays several common netmask values in both the dotted-decimal and the mask-length formats.
VLAN & Its Usage A virtual LAN, commonly known as a VLAN, is a group of hosts with a common set of requirements that communicate as if they were attached to the same broadcast domain, regardless of their physical location. A VLAN has the same attributes as a physical LAN, but it allows for end stations to be grouped together even if they are not located on the same network switch. Network reconfiguration can be done through software instead of physically relocating devices. VLANs are created to provide the segmentation services traditionally provided by routers in LAN configurations. VLANs address issues such as scalability, security, and network management. Routers in VLAN topologies provide broadcast filtering, security, address summarization, and traffic flow management. By definition, switches may not bridge IP traffic between VLANs as it would violate the integrity of the VLAN broadcast domain. This is also useful if someone wants to create multiple Layer 3 networks on the same Layer 2 switch. For example, if a DHCP server (which will broadcast its presence) was plugged into a switch it will serve any host on that switch that was configured to get its IP from a DHCP server. By using VLANs you can easily split the network up so some hosts won't use that DHCP server and will obtain link-local addresses, or obtain an address from a different DHCP server
ARP ARP (Address Resolution Protocol) is used to resolve the ethernet address of a NIC from an IP address in order to construct an ethernet packet around an IP data packet. This must happen in order to send any data across the network. An IP address alone cannot be used to deliver data from one LAN device to another. To send data between LAN devices, you must convert the IP address of the destination device to its MAC address. Each device on an Ethernet network has a unique MAC address, which is a 48-bit number assigned to each device by the manufacturer. The technique that associates the IP address with a MAC address is known as address resolution. Internet Protocol uses the Address Resolution Protocol (ARP) to resolve MAC addresses. If a device sends data to another station on the network and the destination MAC address is not yet recorded, ARP is used. An ARP request is broadcast onto the network. All stations (computers, for example) on the network receive and read the request. The destination IP address for the chosen station is included as part of the message so that only the station with this IP address responds to the ARP request. All other stations discard the request.
Layer 2 Networks: Basically a layer 2 switch operates utilizing Mac addresses in it's caching table to quickly pass information from port to port. A layer 3 switch utilizes IP addresses to do the same. A layer 2 switch is essentially a multiport bridge. A layer 2 switch will learn about MAC addresses connected to each port and passes frames marked for those ports. It also knows that if a frame is sent out a port but is looking for the MAC address of the port it is connected to and drop that frame. Whereas a single CPU Bridge runs in serial, today's hardware based switches run in parallel, translating to extremely fast switching. It includes terms like, VLAN, Spanning Tree etc.
Layer 3 Networks: The network layer provides the functional and procedural means of transferring variable length data sequences from a source to a destination via one or more networks, while maintaining the quality of service requested by the Transport layer. The Network layer performs network routing functions, and might also perform fragmentation and reassembly, and report delivery errors. Routers operate at this layer sending data throughout the extended network and making the Internet possible. This is a logical addressing scheme values are chosen by the network engineer. The best-known example of a layer 3 protocol is the Internet Protocol (IP). It manages the connectionless transfer of data one hop at a time, from end system to ingress router, to router to router, and from egress router to destination end system.
Ethernet Straight Cabling: The UTP cable contains eight conductors, arranged in four twisted pairs, and is terminated with an RJ45 type connector. A normal straight-through & Cross Over UTP Ethernet cable follows the EIA568B standard wiring as described below.
Note that here on the other end of cable we are using pin 3 & 6 in replacement of 1 & 2 and vice versa
These are infections that infect both the program files and boot
records. These are difficult to remove, the infections from both the
locations should be removed or else they might re-infect any one of the
locations back again. Example:-One_Half,Emperor.
Macro viruses:-
These type of Viruses infect data files. They Have cost Corporations millions. The Evolution Of Visual Basic in 97,a macro Virus can be written that can infect data files. It infects MS Word, Excel, Power point. They are circulating good numbers across the world because of the ease with which they can be created. Ex:-W97M.Mellisa,W97M.NiceDay.
Case I: - Unknown Infection •Suspicious activity observed and no files detected •Activities such as pop ups, file creation, URL redirection, network congestion are observed. Troubleshoot/Locate the files responsible •Check netstat output for connections made to any unknown IP addresses. •View files based on their modified dates to check for recent changes made on the system ,typically in folders like /Windows/System32 , /Windows/System32/drivers •Check the registry HKeyLocalMachine\SOFTWARE\Microsoft\Windows\CurrentVersion\Runfor any startup entries made by the virus •Check the task manager for any unknown, random name processes running •Enable view for hidden files and folders
Case II: - Unable to clean •Virus detected but unable to clean/delete/quarantine •As seen in the virus logs files are detected but not cleaned/removed from the affected machines. Troubleshooting •If files are in the temporary internet folder then would be in use and may not get deleted. We recommend you to delete the contents of the temporary internet files folder. •Boot machine in safe mode and run a manual scan •Use tools from recognized AV vendors like Symantec, Mcafee and Trend Micro like:- •Mcafee Avert Stinger. •Trend Micro House call 7.1 •Norton Security Scanner from Symantec.
Case III: - Re-infection •Virus detected but re‐infection occurring in the network. •As seen in the virus logs files are detected and cleaned/removed from the affected machines but re‐infection occurring through shared folders/mapped drives or infected USB drives. Troubleshooting •Since malware are detected and taken care of in such a way that you may need to check the sources of re‐infection and security holes in the network responsible for them. •Check virus logs to look for the infection source of the malware. •Infection level in the network will not reduce until infection sources are eliminated.