System Hardware Reliability Engineer
Job Overview
-
Date PostedFebruary 14, 2026
-
Location
-
Expiration date--
Job Description
2026-01-21T10:00:04.410Z
81517409516561094
Minimum qualifications:
- Bachelor’s degree in Reliability Engineering, Electrical Engineering, Industrial Engineering, or Mechanical Engineering, or equivalent practical experience.
- 5 years of experience in cloud quality and reliability engineering, computing or network infrastructure hardware, or relevant experience.
- Experience in Design Failure Mode and Effect Analysis (DFMEA), Design of Experiments (DOE), derating analysis, test plans, Ongoing Reliability Test (ORT), reliability block diagrams, and other system level reliability analysis tools.
Preferred qualifications:
- Master’s degree or PhD in Reliability Engineering, Electrical Engineering, Industrial Engineering, or Mechanical Engineering, or equivalent practical experience.
- Experience with failure analysis and fault isolation techniques and how to apply them to find root causes of failure.
- Experience leading cross-functional problem-solving teams using practical approaches.
- Experience in training on system level reliability tools such as Reliability Block Diagrams (RBDs), Mean Cumulative Function (MCF).
- Understanding of quality and reliability engineering principles.
About the job
Behind everything our users see online is the architecture built by the Technical Infrastructure team to keep it running. From developing and maintaining our data centers to building the next generation of Google platforms, we make Google’s product portfolio possible. We’re proud to be our engineers’ engineers and love voiding warranties by taking things apart so we can rebuild them. We keep our networks up and running, ensuring our users have the best and fastest experience possible.
Responsibilities
- Lead analysis of system hardware designs to enable proactive design evaluations and product de-risk at an early stage of development.
- Lead system reliability efforts by working with other organizations to define reliability goals and reliability plans, securing the resources needed to execute the plan.
- Implement the reliability plan and lead all efforts to assess and mitigate risk of failure early during New Product Introduction (NPI).
- Drive reliability test plans and collect, analyze, and synthesize the test data to enable verification of the design reliability goals.
- Lead system reliability monitoring efforts (availability, repair trends) and alert product teams on unwanted system behavior, working on mitigation strategy definition and implementation.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.