설명
The Staff Data Engineer, AI and Robotics will join the AI Research team within the Autonomous Robotics Center (ARC) . This role sets the technical direction for the robotics data backbone that enables scalable robot learning in manufacturing — from data capture and curation through versioning, serving, and auditing. Your work will make model development reproducible, testable, and production-ready, while establishing the infrastructure standards and operating patterns that accelerate robotics AI across programs.
This is a senior technical leadership role in robotics and machine learning infrastructure , focused on multimodal robotic datasets and continuous model iteration. You will work across AI research, robotics engineering, manufacturing, and validation teams to turn real-world robot behavior and failures into high-quality training data, robust production systems, and durable platform capabilities used broadly across the organization.
What You’ll Do
-
Define and drive the technical vision for multimodal robotics data infrastructure spanning vision, depth, force/torque, joint states, events, and metadata across lab and plant-adjacent environments.
-
Architect and scale reliable data capture, ingestion, and serving pipelines that support robot learning workflows from experimentation through production deployment.
-
Establish reproducible data logging and replay frameworks, including ROS 2 bagging where applicable, to enable debugging, regression testing, root-cause analysis, and dataset creation at scale.
-
Own the strategy for dataset lifecycle management, including versioning, lineage, provenance, governance, retention, and quality gates, to support trustworthy model training and evaluation.
-
Lead the integration of experiment tracking, model/data traceability, and auditability patterns so teams can compare runs, reproduce results, and understand system changes over time.
-
Design and implement MLOps automation patterns, including CI/CD/CT-style pipelines for ML systems, that reduce manual effort and improve deployment confidence for robotics AI updates.
-
Partner with AI/ML, planning, validation, and plant teams to define data contracts such as schemas, labeling standards, and failure taxonomies, and convert field failures into curated training datasets and measurable learning loops.
-
Influence architecture across adjacent systems and mentor engineers on best practices in data engineering, ML infrastructure, observability, and production reliability.
-
Drive cross-functional technical decisions, balancing research velocity with platform robustness, governance, and long-term maintainability.
What You’ll Need (Required Qualifications)
-
B.S. or M.S. in Computer Science, Computer Engineering, Data Engineering , or a related field.
-
8+ years of experience building production data systems and/or ML infrastructure, including practical experience supporting training pipelines end-to-end.
-
Strong proficiency in Python and at least one of: C++ , Scala , or Java .
-
Demonstrated engineering discipline in testing, documentation, system design, and operational reliability.
-
Experience with dataset versioning, lineage, and reproducibility tooling such as DVC or equivalent approaches.
-
Experience with experiment tracking and model registry patterns such as MLflow or equivalent tools.
-
Experience designing technical systems that support multiple stakeholders and use cases, with the ability to influence architecture beyond an individual project.
-
Ability to work onsite with hardware and robotics teams, and to design pipelines that handle real-world robotic logging constraints such as bandwidth limits, dropped frames, and timing drift.
What Will Give You a Competitive Edge (Preferred Qualifications)
-
Hands-on robotics logging and replay experience, including ROS 2 bags and system telemetry pipelines.
-
Experience with simulation-to-real data workflows and dataset synthesis strategies.
-
Familiarity with data governance requirements and auditability in safety-adjacent or safety-critical systems.
-
Experience building tools to support data labeling workflows, quality assurance, and active learning loops.
-
Experience serving as a technical lead, setting engineering standards, and mentoring senior or mid-level engineers across complex initiatives.
다양성 정보
General Motors는 법적으로 금지된 차별을 배제하는 것은 물론 포용성과 소속감을 진정으로 장려하는 직장이 되기 위해 노력하고 있습니다. 당사는 다양성이 보장되는 환경에서 직원들이 역량을 발휘하고 우리 고객을 위한 더 좋은 제품을 개발할 수 있다고 믿습니다. 따라서 입사에 관심 있는 사람이 있다면 포지션별 주요 업무와 자격을 확인하고 본인이 보유한 기술과 능력에 부합하는 모든 포지션에 적극적으로 지원하기를 장려합니다. 지원자는 채용 과정에서 역할 관련 평가(해당하는 경우) 및/또는 채용 전 스크리닝을 통과해야 합니다. 자세한 정보는 GM 채용 과정 안내를 참고하십시오.
공평한 취업 기회 선언 (미국)
General Motors는 공평한 기회를 제공하는 고용주임을 자부합니다. 자격을 만족하는 지원자는 인종과 피부색, 성별, 성적 지향, 성별 정체성, 국적, 장애, 재향 군인 보호법 적용 여부와 상관없이 채용 후보로서 심사를 받습니다.
숙소 (미국 및 캐나다)
General Motors는 장애인을 포함한 모든 구직자들에게 취업 기회를 제공합니다. 구직이나 취업 지원에 도움이 되는 합리적인 숙소가 필요한 경우 [email protected]으로 이메일을 보내시거나 800-865-7580으로 전화주십시오. 이메일에, 귀하가 요청하는 특정한 숙소에 대한 설명과 귀하가 지원하는 직무와 채용 요청서 번호를 포함해주세요.
