ML Compiler Engineer

HuggingFace · Remote (Global) · $180k - $320k

full-time senior Posted 3 months ago

Apply Now Stand out: build a proof-of-work pitch →

Free GitHub-based preview. Direct apply stays one click away.

Get weekly job alerts like this →

Hiring for this role?

AI Market Demand Pack · $29 one-time

Compare this role's skills with the full AI hiring market. Get ranked demand, salary bands, leading companies, public source URLs, and a decision brief.

See the live sample →

ml-compiler python c++ cuda quantization optimization inference

About this role

Optimize model inference for HuggingFace's inference API. Work on model compilation, quantization, and hardware-specific optimization. Make models run faster and cheaper for millions of users.

Requirements

Experience with ML compilers (TVM, XLA, TensorRT) or model optimization. Strong C++/Python. Understanding of hardware architectures (GPU, TPU).

Job Details

Company: HuggingFace
Location: Remote (Global)
Workplace: Remote
Hiring region: Global - international candidates welcome
Type: full-time
Level: senior
Salary: $180k - $320k

Similar Jobs

Senior Software Engineer - Robotics, Perception (C++, Python)

api-designroboticsc++python

Infrastructure Engineer — GPU Cloud

San Francisco, CA / New York, NY · $200k - $350k

pythondistributed-systemsgpucudainfrastructurecloud

ML Engineer — Pre-Training

Paris, France · $180k - $350k

pytorchdistributed-trainingllmpythoncudatransformerspre-training

Staff Python / PyTorch Developer — Frontend Inference Compiler – Dubai

pytorchllmgenerative-aitensorflowagentspythonfrontendinference

Related searches:

Remote Jobs Senior Jobs Remote Senior Jobs Senior AI Infrastructure Senior Backend & Systems ml-compiler python c++cuda quantization optimization inference

Get jobs like this delivered weekly

Free AI jobs newsletter. No spam.