Software Engineer, Site Reliability Engineering, Buying Platform Ads
Join to apply for the Software Engineer, Site Reliability Engineering, Buying Platform Ads role at Google
About the Job
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large‑scale, massively distributed, fault‑tolerant systems. SRE ensures that Google Cloud’s services – both internally critical and externally visible – have reliability, uptime appropriate to customer needs, and a fast rate of improvement. The role focuses on optimizing existing systems, building infrastructure, automating work, and addressing complex challenges of scale unique to Google Cloud.
Minimum Qualifications
Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
5 years of software development experience in one or more programming languages.
3 years of experience designing, analyzing, and troubleshooting distributed systems.
2 years of experience leading projects and providing technical leadership.
Preferred Qualifications
Master’s degree in Computer Science or Engineering.
Experience with problem‑solving and analyzing global‑scale distributed systems.
Experience in Kotlin, Java, Python, Go, or eagerness to learn new technologies and languages.
Strong background in algorithms, data structures, complexity analysis, software design, performance, and application issues.
Ability to troubleshoot complex, distributed systems from first principles, focusing on deep dives into code, applications, and storage systems.
Passion for AI and awareness of emerging tools, market trends, and strategic opportunities within the AI landscape.
Responsibilities
Employ AI techniques to scale impact by 10x, revolutionizing developer velocity, system reliability, and machine‑resource efficiency.
Deliver on near‑ and long‑term high‑priority projects from inception to completion with minimal supervision.
Collaborate across teams, sharing expertise in building tools and solutions for system configuration, maintenance, and scaling; improve SLOs, monitoring, and production‑health compliance of critical Buying Platform systems.
Apply SRE principles of reliability, predictability, convergence, and efficiency to proactively identify and solve issues and provide on‑call coverage for key Buying Platform systems and flows.
Participate in production and design reviews, post‑mortem reviews, Disaster Recovery Testing (DiRT), and serve as a reliability expert, guiding and supporting partner engineering teams.
Additional Information
Location: London, England, United Kingdom. Employment type: Full‑time. Job function: Information Technology and Engineering. Seniority level: Not Applicable.
Google is an equal‑opportunity employer and an affirmative‑action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or veteran status. We consider qualified applicants regardless of criminal histories, consistent with legal requirements.
If you have a disability or special need that requires accommodation, please complete our Accommodations for Applicants form.
#J-18808-Ljbffr