설명
Description:
At BrightDrop, we are reshaping e-commerce by developing smarter, greener, and more efficient ways to deliver goods and services to your door, while delivering a brighter future for the cities where we live. We are building an ecosystem of all-electric, zero-emissions delivery solutions – from electric vehicles, to ePallets and software that leverages real-time data to drive intelligent optimizations for e-commerce. To deliver on our mission we are growing fast and building a team, based in Palo Alto, with offices in Atlanta and Detroit, that is customer-focused, agile and passionate about innovating for a more sustainable future.
From engineering to product management and operations, BrightDrop is looking for people who can combine a passion for technology and sustainability with high doses of curiosity and rigorous thinking to deliver a better future.
Backed by General Motors, BrightDrop is striving to improve the communities where we live and deliver a better future for generations to come. We hope you’ll join us.
In this Senior Site Reliability Engineer role, you will develop and maintain key elements of the infrastructure, platform health and reliability monitoring for BrightDrop’s new electrified fleet. We are an innovation first team, and we need your help to ensure we meet the highest standards. Come join us and let’s innovate!
What you get to do in this role:
-
Implement scalable, reliable, secure SRE and Observability platform to monitor health of our production system, and provide a holistic view of the environment.
-
Deliver tools/software to improve the reliability, scalability and operability of services.
-
Collaborate with engineering teams to analyze and provide inputs in architecture, infrastructure resources, observability to achieve reliability and scalability goals.
-
Collaborate with engineering teams to conduct production readiness reviews, deployment, operation and refinement
-
Partner with stakeholders to ensure data and observability tools are effectively integrated with other systems and processes.
-
Partner with stakeholders to identify, measure and monitor availability, latency and overall service health.
-
Participate in on-call engineering duty to support production.
-
Perform initial incident root cause analysis, carryout incident postmortem.
-
Build run books, tooling to carry out production support activities.
-
Actively participate in technical discussions and deep dives with Architectural group
Additional Description
Qualifications:
-
5+ years of hands-on SRE experience (software development, systems monitoring) with at least one of the public cloud providers – Azure(preferred), AWS, GCP
-
Experience operating high-availability, fault-tolerant, scalable, distributed software in production: Building monitoring, defining alerts, writing run books, establishing dashboards etc.
-
Experience with monitoring and log aggregation frameworks, such as Data Dog(preferred), Splunk, Elasticsearch, Kibana, Logstash.
-
Strong working knowledge of Docker, Kubernetes, Terraform, Chef or Ansible
-
Experience troubleshooting JVM based applications.
-
Strong experience in scripting/programing – Python, Java, PowerShell, Bash.
-
Experience with configuration and management of SSO, Big Data/ No-SQL in cloud infrastructure.
-
CI/CD automation frameworks knowledge - Jenkins/Azure DevOps
-
Strong understanding of public cloud networking components.
-
Working experience with Git and source control management tools, such as Bitbucket, GitHub
-
Experience with IoT stack is a big plus
-
BS/MS in Computer Science/Engineering preferred
The compensation information is a good faith estimate only. It is based on what a successful applicant in the California Bay Area which includes the following counties: Marin, Contra Costa, San Francisco, Alameda, San Mateo, Santa Clara, and Santa Cruz might be paid in accordance with the California law.
The compensation may not be representative for positions located outside of the California Bay Area.
The annual salary range for this role is $146,900 - $225,000. The actual base salary a successful candidate will be offered within this range will vary based on factors relevant to the position.
Bonus Potential: An incentive pay program offers payouts based on company performance, job level, and individual performance.
Benefits: GM offers a variety of health and wellbeing benefit programs. Benefit options include medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts and more.