Software Engineer, Data Center Infrastructure Management Cooling
Job Overview
-
Date PostedApril 11, 2026
-
Location
-
Expiration date--
Job Description
2026-04-03T17:00:05.469Z
103917606632596166
Minimum qualifications:
- Bachelor’s degree or equivalent practical experience.
-
2 years of experience with coding in C++ and Python, or 1 year of experience with an advanced degree.
- 1 year of experience with distributed computing.
- 1 year of experience with SQL.
Preferred qualifications:
-
Master’s degree or PhD in Computer Science, or a related technical field.
- Experience working with two or more from the following: data analysis, data processing pipelines, distributed and parallel systems, machine learning, information retrieval, web application development, Unix/Linux environments, mobile application development, natural language processing, networking, developing large software systems, or security software development.
- Experience with monitoring systems and cooling systems.
About the job
The DCIM (Data Center Infrastructure Management) Cooling team’s mission is to deliver reliable, efficient, and intelligent cooling solutions for Google data centers, enabling the future of technology. The team owns the life-cycle management of all cooling devices deployed in Google’s data center that includes telemetry collection, monitoring health and alerting Data Center Operations teams to take action on them. The DCIM Cooling team operates one of the large-scale monitoring systems at Google, reading telemetry from thousands of devices in every Google data center. Our issues include handling the rapid growth and diversification of the Google fleet and hardware, new use cases for critical monitoring of third-party facilities, and retiring technical debt.
In this role, you will work with your teammates to design, code, and put into production very large-scale distributed monitoring systems and work with your team and partner teams to enable new use cases for large-scale telemetry gathering. You will also create various system monitoring dashboards, defining Service Level Objectives (SLOs), documentation and playbooks. You will have the opportunity to take onsite trips to one or more of Google’s data centers each year to work with new systems and data center technical staff in person.
Responsibilities
- Write code for large-scale distributed systems.
- Participate in design of new products and features for data center cooling monitoring, reliability, and performance.
- Work with teammates and users on defining requirements for new use cases.
- Create system monitoring, dashboards, SLOs, documentation, and playbooks.
- Participate in the team’s interrupts rotation (e.g., business hours only, we don’t carry pagers).
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google’s EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.