설명

About the Team

The Model Deployment & Inference Solutions team in GM AV deploys machine learning models from training frameworks (e.g. PyTorch) onto autonomous vehicle hardware. Our mission is two-fold: build the ML deployment platform that makes model rollouts fast and predictable, and optimize models so they meet the real-time latency and memory budgets required to run on-vehicle. Our work is on the critical path of GM's publicly committed launch of eyes-off (hands-free, eyes-free) autonomous driving in 2028, debuting on the Cadillac Escalade IQ, building on Super Cruise's billion-plus hands-free miles.

About the Role

This role sits in the team's Platform pillar. We own the unified ML deployment platform that automates the path from a trained model to inference on the vehicle, along with the developer-experience and agentic-tooling layer that makes deployment self-serve for every ML model development team at GM.

What you’ll be doing (Responsibilities)

Design, build, and operate the ML deployment platform that automates the path from trained model to on-vehicle inference.

Drive cross-organization model deployments to the autonomous vehicle stack, partnering with model development teams to take high-value models from training to production on-vehicle.

Build agentic tools that diagnose and fix deployment-blocking issues, automating workflows currently performed manually by engineers.

Build the developer experience that ML model development teams use day to day: tooling, dashboards, automation, and observability.

Drive shift-left validation that surfaces deployment risk (compile, runtime, parity, latency) early in the model development cycle.

Build platform tools that integrate the work of our sister teams (kernels, compiler, reduced precision and parity) so their optimization wins land directly in the deployment workflow.

Partner with the team's Performance pillar and model development teams across the AV organization.

Your Skills & Abilities (Required Qualifications)

BS, MS, or PhD in Computer Science or a related technical field.

3+ years of relevant industry experience.

Strong fundamentals and excellent coding ability in Python.

Experience building or operating production platform or infrastructure systems where reliability, observability, and extensibility matter.

Experience with ML model deployment, inference integration, model optimization workflows, or model serving infrastructure, with at least one prior context where you owned the path from a trained model to a running inference workload.

Experience using coding agents (Cursor, Claude Code, GitHub Copilot, or equivalent) as part of your engineering workflow.

Experience designing clean, well-tested software with clear interfaces and good abstractions.

Strong cross-team collaboration skills.

What Will Give You A Competitive Edge (Preferred Qualifications)

Experience building agentic or LLM-powered developer tooling.

Experience with ML or workflow orchestration frameworks (Airflow, Temporal, Flyte, Ray, Kubeflow, or equivalent).

Familiarity with the NVIDIA GPU stack at the integration level (CUDA-aware Python, TensorRT, Triton inference server, torch.compile, ONNX).

Experience with inference-serving frameworks (Triton, TorchServe, Ray Serve, vLLM) or edge-deployment toolchains.

Experience with low-latency or real-time systems.

Experience in autonomous vehicles, robotics, or other safety-critical ML deployment domains.

Open-source contributions to PyTorch, Ray, Airflow, Temporal, vLLM, TensorRT, or related projects.

3+ years of relevant industry experience.

Compensation: The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.

The salary range for this role: is $128,700 to $261,300. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.

Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

#GM-AV-1

이 직무는 재택 기반이지만, 선발된 지원자가 GM 허브에서 특정 거리 이내에 거주하는 경우 주 3회 {또는 관리자가 지정한 다른 빈도로} 출근해야 합니다.

선발된 지원자는 이 직무를 위해 25% 미만의 출장을 다녀야 합니다.

이 직무는 리로케이션 혜택을 받을 수 있습니다.

다양성 정보

General Motors는 법적으로 금지된 차별을 배제하는 것은 물론 포용성과 소속감을 진정으로 장려하는 직장이 되기 위해 노력하고 있습니다. 당사는 다양성이 보장되는 환경에서 직원들이 역량을 발휘하고 우리 고객을 위한 더 좋은 제품을 개발할 수 있다고 믿습니다. 따라서 입사에 관심 있는 사람이 있다면 포지션별 주요 업무와 자격을 확인하고 본인이 보유한 기술과 능력에 부합하는 모든 포지션에 적극적으로 지원하기를 장려합니다. 지원자는 채용 과정에서 역할 관련 평가(해당하는 경우) 및/또는 채용 전 스크리닝을 통과해야 합니다. 자세한 정보는 GM 채용 과정 안내를 참고하십시오.

공평한 취업 기회 선언 (미국)

General Motors는 공평한 기회를 제공하는 고용주임을 자부합니다. 자격을 만족하는 지원자는 인종과 피부색, 성별, 성적 지향, 성별 정체성, 국적, 장애, 재향 군인 보호법 적용 여부와 상관없이 채용 후보로서 심사를 받습니다.

숙소 (미국 및 캐나다)

General Motors는 장애인을 포함한 모든 구직자들에게 취업 기회를 제공합니다. 구직이나 취업 지원에 도움이 되는 합리적인 숙소가 필요한 경우 [email protected]으로 이메일을 보내시거나 800-865-7580으로 전화주십시오. 이메일에, 귀하가 요청하는 특정한 숙소에 대한 설명과 귀하가 지원하는 직무와 채용 요청서 번호를 포함해주세요.

Senior ML Inference Engineer - Platform

설명

다양성 정보

공평한 취업 기회 선언 (미국)

숙소 (미국 및 캐나다)