Senior Data Center Deployment Engineer
full-time
senior
Posted 1 month ago
About this role
Why work at Nebius Nebius is leading a new era in cloud computing to serve the global AI economy. We create the tools and resources our customers need to solve real-world challenges and transform industries, without massive infrastructure costs or the need to build large in-house AI/ML teams. Our employees work at the cutting edge of AI cloud infrastructure alongside some of the most experienced and innovative leaders and engineers in the field.
Where we work Headquartered in Amsterdam and listed on Nasdaq, Nebius has a global footprint with R&D hubs across Europe, North America, and Israel. The team of over 1400 employees includes more than 400 highly skilled engineers with deep expertise across hardware and software engineering, as well as an in-house AI R&D team.
The role
Nebius operates large-scale, GPU-dense AI infrastructure across mission-critical data center environments. As a Senior Delivery Deployment Engineer, you will own the end-to-end delivery, deployment, and production readiness of next-generation GPU platforms inside our data centers. This role sits at the intersection of hardware, Linux systems, and operational execution. You will lead on-site rack bring-up, validate NVIDIA-based AI systems, coordinate repairs, and ensure GB-series infrastructure moves from installation to fully operational production environments with precision and reliability. You will collaborate closely with hardware engineering, networking, and infrastructure teams to deploy and stabilize H200 and B200-based GPU systems at scale.
Your responsibilities will include:
Lead end-to-end deployment of GB-series racks within data center environments
Oversee installation, bring-up, validation, and production readiness of NVIDIA H200 and B200-based servers
Troubleshoot complex hardware, firmware, Linux OS, and networking issues
Execute structured testing and validation procedures during deployment
Develop and maintain basic Linux-based hardware health-check and diagnostic scripts
Coordinate on-site hardware repairs, part replacements, and vendor escalations
Drive root cause analysis and ensure corrective actions are implemented
Manage and prioritize deployment timelines across multiple concurrent rollouts
Provide technical leadership and guidance to on-site engineers and technicians
Partner with networking and infrastructure teams to ensure seamless integration
Document deployment processes, validation standards, and operational runbooks
What we expect you to have:
Strong hands-on experience deploying and operating data center infrastructure
Deep familiarity with GPU-dense systems, ideally NVIDIA H-series platforms
Experience working with high-density rack deployments (GB-series or similar)
Solid Linux experience, including troubleshooting and scripting
Ability to diagnose issues across hardware, OS, firmware, and network layers
Experience coordinating field repairs and working directly with hardware vendors
Proven experience leading technical teams or overseeing field operations
High ownership mindset and ability to operate in production-critical environments
Clear communication skills and ability to collaborate across distributed teams
It will be an added bonus if you have:
Experience deploying AI or HPC clusters at scale
Familiarity with automated provisioning or infrastructure lifecycle systems
Background in hardware qualification, burn-in testing, or factory validation
Experience supporting rapid infrastructure expansion
Exposure to ARM-based or heterogeneous compute environments
Working conditions:
Collaboration with globally distributed engineering and operations teams
Key employee benefits:
Health insurance: 100% company-paid medical, dental, and vision coverage for employees and families
401(k) plan: up to 4% company match with immediate vesting
Parental leave: 20 weeks paid for primary caregivers, 12 weeks for secondary caregivers
Remote work reimbursement: up to $85/month for mobile and internet
Disability & life insurance: company-paid short-term, long-term, and life insurance coverage
Compensation
We offer competitive salaries, ranging from $125k- $180k base + quarterly performance bonuses.
Join Nebius today and help build the software that powers the next generation ofAI infrastructure.
What we offer
Competitive salary and comprehensive benefits package.
Opportunities for professional growth within Nebius.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.
We’re growing and expanding our products every day. If you’re up to the challenge and are excited about AI and ML as much as we are, join us!
Similar Jobs
Related searches: