Description
As a Data Center Operations Technician at xAI, you will be responsible for the health of our server and network infrastructure for Data Centers and Global Points of Presence. You will be responsible for our two most important data center operations metrics: mean time to detect (MTTD) and mean time to repair (MTTR).
Your primary responsibilities will include:
- Reporting to the job site during initial construction and reporting back to the engineering team as required.
- Performing troubleshooting and monitoring of the servers and network in our data centers and global points of presence.
- Rack and stacking of data center network equipment.
- Maintaining Warehouse inventory and asset management using our internal application.
- Labelling and troubleshooting for fibre/optics cables.
- Power supply cabling, installation, troubleshooting and repair.
- Installation of racks, servers and switches; this includes staging racks in place, cabling, power up and handoff of hardware to the provisioning team for customer capacity allocation.
- Managing, responding and resolving of data center operations tickets used cross functionally within xAI via Jira.
- Creating and maintaining documentation of tasks and standard operating procedures.
- Receipt and decommissioning of data center hardware.
- Vendor returns for infrastructure under and out of warranty.
- Managing spare parts inventory within the data center.
- Defining, designing, and implementing network layouts and solutions within our data centers.
To be successful in this role, you will need:
- A high school diploma or equivalency certificate.
- 2+ years of experience working with server, storage, compute and network hardware.
- 2+ years of experience troubleshooting and repairing servers and networking infrastructure.
- 2+ years of experience in Inventory Management, and ordering, receiving and shipping server and network equipment.
- Strong Linux skills, including navigating the system's directories and filing system, manipulating files in the Linux shell, user permission configuration, package installation and software management.
- Ability to identify and apply different filesystem types, using Linux commands for process management, basic troubleshooting and debugging, and Bash or other scripting.
- Experience being on-call and ability to respond to critical events as needed.
- Experience leading Data Center Infrastructure projects.
- Curious to always learn new things within the Data Center World.
- Excellent prioritization and time management skills.
- Able to work in a fast-paced environment.
- Detail-oriented.
- Oracle Experience.
- Inventory Management.
- 4+ years of experience in Structured Cabling Copper/Fibre.
- 4+ years of experience in Power and Cooling concepts inside the data center.
This listing is enriched and indexed by YubHub. To apply, use the employer's original posting:
https://job-boards.greenhouse.io/xai/jobs/4741579007