Product Manager, Compute NPI

FluidStack · San Francisco, CA · $150k - $250k
full-time senior Posted 1 month ago

About this role

ABOUT FLUIDSTACK At Fluidstack, we build the compute, data centers, and power that will fuel artificial superintelligence. We work with Anthropic, Google, Meta, AMI Labs, and Black Forest Labs to deploy gigawatts of compute at industry defining speeds. We are investing tens of billions of dollars in US infrastructure. In 2026, we will deploy 1GW. In 2027, 10GW. Our team is small, fast, and obsessed with quality. We own outcomes end-to-end, challenge assumptions, and treat our customers' problems as our own. No task is beneath anyone here. There are a few thousand people who will shape the trajectory of superinteligence. Come and be one of them. ABOUT THE ROLE We're hiring a Product Manager to lead NPI (New Product Introduction) for GPU infrastructure, working closely with datacenter, infrastructure, and networking teams to introduce new GPU SKUs and compute offerings. You'll define how Fluidstack evaluates, qualifies, and brings new GPU generations to market—from NVIDIA Blackwell and Rubin to AMD MI300X and future accelerators. This is a highly cross-functional role requiring deep technical judgment, vendor relationship management, and an understanding of how hardware capabilities map to customer workload requirements. You'll ensure Fluidstack maintains its competitive edge by offering the right mix of compute options optimized for training, inference, and specialized AI workloads. WHAT YOU'LL DO - Own the NPI roadmap for GPU SKUs, including evaluation criteria, qualification timelines, and go-to-market strategy for new hardware generations - Partner with datacenter teams to define requirements for power delivery (HVDC/LVDC), cooling (liquid vs. air), rack architecture, and physical infrastructure needed for next-gen GPUs - Work with infrastructure engineers to validate hardware performance across key dimensions: training throughput (MFU), inference latency (TTFT, TBT), memory bandwidth, interconnect topology (NVLink, InfiniBand) - Drive vendor engagement with NVIDIA, AMD, and emerging XPU providers—conducting technical deep dives, negotiating supply agreements, and managing early access programs - Define product specifications for system configurations: single-GPU instances, multi-GPU nodes, full rack deployments, and megacluster topologies - Analyze customer workload profiles to determine optimal GPU mix: H100 for large model training, L40S for inference, B200 for frontier research, MI300X for cost-sensitive workloads - Build business cases for new SKU introductions, including CapEx requirements, depreciation models, utilization forecasts, and competitive pricing analysis - Create technical documentation and benchmarking reports that help customers select the right GPU for their use case - Monitor GPU availability, supply chain constraints, and allocation strategies to ensure Fluidstack can meet customer demand while maintaining healthy margins - Collaborate with networking teams to ensure interconnect fabric (RoCE, InfiniBand) scales with GPU performance and supports distributed training patterns ABOUT YOU - 5+ years product management experience with at least 3 years focused on infrastructure, hardware platforms, or cloud compute services - Strong technical background in GPU architecture, accelerator performance characteristics, and AI workload requirements - Experience managing NPI processes from evaluation through production deployment—including vendor relationships, qualification testing, and rollout planning - Deep understanding of datacenter infrastructure: power distribution, thermal management, rack design, and high-density deployment constraints - Track record of making build vs. buy decisions on hardware platforms based on TCO analysis, competitive positioning, and customer demand signals - Familiarity with GPU performance metrics (TFLOPS, HBM bandwidth, TDP, MFU) and how they translate to real-world training and inference performance - Ability to work with engineering teams to debug hardware issues, analyze telemetry data, and identify root causes of performance degradation - Experience conducting competitive analysis of cloud GPU offerings from AWS, GCP, Azure, CoreWeave, Lambda Labs, and other specialized providers - Comfortable navigating supply chain complexity, allocation negotiations, and procurement timelines with hardware vendors - Bonus: Experience with networking topologies (fat tree, rail-optimized), storage systems (NVMe, Ceph), or HPC infrastructure design COMPENSATION To provide greater transparency to candidates, we share base pay ranges for all US-based job postings. Our compensation package includes base salary, equity, benefits, and for applicable roles, commissions plans. Our cash compensation range for this role is $150,000-$250,000. Final offers vary based on geography, candidate experience, relevant credentials, and other factors. Outstanding candidates may be eligible for adjusted terms plus meaningful equity. W

Similar Jobs

Related searches:

On-site Jobs Senior Jobs On-site Senior Jobs Senior Backend & SystemsSenior AI Infrastructure AI Jobs in San Francisco Backend & Systems in San FranciscoAI Infrastructure in San Francisco distributed-systemsgpu