- Topic ID: task_czp_f1d_zhb
- Version: 4.0
- Date: Nov 8, 2021 5:12:29 AM
Z8G4 Recon GPU Card Replacement
Prerequisites
Overview
This procedure shall be followed when replacing the Recon GPU card in a Z8G4 computer.
Please complete Recon GPU Card Troubleshooting before ordering and replacing the Recon GPU card. This procedure shall be followed when replacing the Recon GPU card in the Open Console.
Figure 1. Recon GPU card
1 Preparation
Procedure
- Shutdown system. Select one of the following methods to Power OFF the Console:
-
If Applications are up, click on the Shut Down button on desktop display and select Shutdown.
-
If Applications are down, open a Terminal Window. Type: halt , then press ENTER.
-
When halt command has finished, power Off the console at the front panel switch.
-
- Apply LOTO. See Equipment Service - Lockout-Tagout-PPE procedure.
2 Recon GPU Card Replacement
Procedure
- Open and remove the module small side cover
Figure 2. Remove module small side cover

- Locate Recon GPU Card by referencing the Illustration below.
Figure 3. Z8G4 Component Location
- Disconnect the power cable from the Recon GPU card, press the card slot release lever and pull out the old Recon GPU card.note: Lift up the card latch when removing the card. (See Figure 4)
Figure 4. PCI card Latch

- Replace the existing Recon GPU Card with new one, insert in slot #4.
- Carefully push the GPU card in the slot #4 and avoid cables interference.
Figure 5. Install GPU Card

- Connect the power cable to GPU Card, confirm it is firmed connected and fix it to avoid vibration or loose
- Close the PCI card retention clamp.
- Carefully push the GPU card in the slot #4 and avoid cables interference.
- Reinstall all removed components, and reconnect any cables that have been disconnected.
3 Restore the Console
Procedure
- Remove LOTO from console.
- Reinstall the module small side cover.
4 Finalization
Procedure
- Confirm Host computer powers up when console power is turned on.
- Check the GPU card installed. Open a shell, then type:
{ctuser@hostname} ls /proc/driver/nvidia/gpus | wc -l

-
If “2” displays, the Recon GPU card is installed.
-
If “1” displays, the Recon GPU card is un-installed.
-
- Ensure GPU ECC state is ON:
- Open a Unix shell and log on as root.
- Type: su - [ENTER].
- Type the root password [ENTER].
- Type: nvidia-smi [ENTER].
- Check GPU ECC status as below:
- If the GPU ECC is ON, below is what the output would look like (boxed in green):
- If the GPU ECC is OFF, below is what the output would look like (boxed in red):
- If the GPU ECC is ON, below is what the output would look like (boxed in green):
- How to turn ECC back on:
- Type: nvidia-smi -g 0 --ecc-config=1 [ENTER]
- A message will show that ECC is enable and a reboot is required:
- After reboot, check that the ECC is ON according to previous steps.
- Perform the Functional Checks → System Scanning Test instructions from the procedure list.