설명

General Motors is a global leader in advanced driver assistance, with Super Cruise hands-free technology in more than 500,000 equipped vehicles on the road and over 700 million hands-free miles driven—demonstrating that automation can be trusted, intuitive, and helpful while reaching everyday drivers at unprecedented scale. Within GM AV, the Model Deployment & Inference Solutions team deploys machine learning models from training frameworks (e.g., PyTorch) onto autonomous-vehicle hardware; our two-fold mission is to build the ML deployment platform that makes model rollouts fast and predictable, and to optimize models so they meet the real-time latency and memory budgets required to run on-vehicle. Our work sits on the critical path for GM’s publicly committed launch of eyes-off (hands-free, eyes-free) autonomous driving in 2028 on the Cadillac Escalade IQ, and we’re hiring engineers to help deliver the next generation of safe, delightful personal autonomous-vehicle experiences.

About the Role

As an early career Engineer on the Model Deployment & Inference Solutions team, you’ll contribute across both sides of our mission: building the ML deployment platform and optimizing models for on-vehicle inference. You’ll work with and learn from senior engineers on real production deployments, platform features, and model-optimization workflows that ship to GM’s Super Cruise fleet at large scale, with structured mentorship and a clear onboarding plan. You’ll also collaborate closely with our sister teams (kernels, compiler, reduced precision, and parity) on the end-to-end path that takes trained models from research frameworks to ultra-efficient, safety-critical inference on the car. This is an early-career / new graduate role designed for candidates who have recently or will be completing their degree by June 2026.

What You’ll Do (Responsibilities)

Contribute production code across the ML deployment platform, model-optimization workflows, and inference benchmarking/profiling infrastructure.

Pair with senior engineers on deployment workflows, performance investigations, model-optimization experiments (e.g., quantization, pruning, distillation), and platform tooling.

Build, test, and maintain platform tools (e.g., validators, performance probes, parity and sensitivity analyzers, agentic specialists) with technical guidance and code review support.

Investigate and help root-cause production deployment or performance issues; learn and apply the diagnostic playbook for compiler, kernel, runtime, and parity bugs.

Collaborate with cross-functional teams across the AV organization; including kernels, compiler, reduced-precision, parity, and model-development groups—to plan and execute model deployments to the AV stack, working under the guidance of senior engineers

Participate in code reviews, design discussions, and technical documentation to ensure reliability, correctness, and clear abstractions in a large-scale codebase.

Learn and follow secure coding, safety, and compliance practices required for on-vehicle autonomous driving software.

Your Skills & Abilities (Required Qualifications)

Recently completed or completing a Bachelor’s or Master’s degree by Spring 2026 in Computer Science, ECE, or a related technical field. (Degree must be completed before your start date.)

Strong computer science fundamentals (e.g., data structures, algorithms, operating systems, computer architecture) and solid coding skills in Python and/or C++, demonstrated through coursework, internships, or substantial projects.

Hands-on experience in AI/ML (e.g., machine learning, deep learning, computer vision, NLP, or ML systems) via classes, research, internships, or personal projects.

Depth in at least one of: computer architecture, operating systems, distributed systems, or compilers.

Demonstrated software-engineering experience (internships, coursework, open-source, research code, or competitions) showing good judgment around r eliability, correctness, and clean abstractions.

Experience with—or strong interest in—using coding assistants/agents (e.g., Cursor, Claude Code, GitHub Copilot) as part of your workflow.

Ability to work effectively in collaborative, cross-functional teams and communicate clearly—both in writing and verbally—including explaining technical work partners

What Will Give You a Competitive Edge (Preferred Qualifications)

Internship, research, or advanced coursework in ML systems, ML compilers, GPU programming (CUDA, OpenAI Triton), inference optimization, or distributed training/serving infrastructure.

Familiarity with PyTorch and modern ML compiler/runtime stacks (e.g., torch.compile, TensorRT, ONNX, Triton Inference Server, vLLM, or equivalent).

Exposure to model optimization (quantization, pruning, distillation) or GPU profiling tools (Nsight Systems, Nsight Compute, PyTorch Profiler).

Familiarity with workflow/ML platforms such as Airflow, Temporal, Flyte, Ray, or Kubeflow.

Experience building agentic or LLM-powered tools or workflows.

Open-source contributions related to PyTorch, TensorRT, vLLM, OpenAI Triton, or similar projects.

Coursework, projects, or publications touching ML systems (e.g., MLSys, OSDI, ASPLOS, HPCA, NeurIPS systems track).

Familiarity with a systems language (e.g., C++) and development in a Linux environment.

Location

Sunnyvale, CA

This role is categorized as hybrid. This means the selected candidate is expected to report to a specific location at least 3 times a week.

This job may be eligible for relocation benefits

Compensation

The compensation information is a good faith estimate only. It is based on what a successful applicant might be paid in accordance with applicable state laws. The compensation may not be representative for positions located outside of New York, Colorado, California, or Washington.

The salary range for this role is $119,250 to $150,850. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.

Bonus Potential : An incentive pay program offers payouts based on company performance, job level, and individual performance.

Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.

다양성 정보

General Motors는 법적으로 금지된 차별을 배제하는 것은 물론 포용성과 소속감을 진정으로 장려하는 직장이 되기 위해 노력하고 있습니다. 당사는 다양성이 보장되는 환경에서 직원들이 역량을 발휘하고 우리 고객을 위한 더 좋은 제품을 개발할 수 있다고 믿습니다. 따라서 입사에 관심 있는 사람이 있다면 포지션별 주요 업무와 자격을 확인하고 본인이 보유한 기술과 능력에 부합하는 모든 포지션에 적극적으로 지원하기를 장려합니다. 지원자는 채용 과정에서 역할 관련 평가(해당하는 경우) 및/또는 채용 전 스크리닝을 통과해야 합니다. 자세한 정보는 GM 채용 과정 안내를 참고하십시오.

공평한 취업 기회 선언 (미국)

General Motors는 공평한 기회를 제공하는 고용주임을 자부합니다. 자격을 만족하는 지원자는 인종과 피부색, 성별, 성적 지향, 성별 정체성, 국적, 장애, 재향 군인 보호법 적용 여부와 상관없이 채용 후보로서 심사를 받습니다.

숙소 (미국 및 캐나다)

General Motors는 장애인을 포함한 모든 구직자들에게 취업 기회를 제공합니다. 구직이나 취업 지원에 도움이 되는 합리적인 숙소가 필요한 경우 [email protected]으로 이메일을 보내시거나 800-865-7580으로 전화주십시오. 이메일에, 귀하가 요청하는 특정한 숙소에 대한 설명과 귀하가 지원하는 직무와 채용 요청서 번호를 포함해주세요.

Machine Learning Engineer, AI Inference Solutions (University Grad)

설명

다양성 정보

공평한 취업 기회 선언 (미국)

숙소 (미국 및 캐나다)