Software Engineer, TPU Infrastructure, Google Cloud

India Entry Posted 2026-04-21

Don't apply into the void — reach the hiring manager

ResuMail finds the recruiters and hiring managers behind this Software Engineer, TPU Infrastructure, Google Cloud role at Google, drafts a personalised outreach email, and schedules the send — so your application actually gets seen.

Reach the hiring manager ›

About this role

Design and build scalable software capabilities to manage the availability, scheduling, and reliability of the Cloud TPU Hypercomputer stack (VMs, Networking, Storage, GKE etc.). Architect infrastructure solutions to ensure industry-leading availability guarantees for large-scale training and inference workloads. Develop telemetry and tooling to establish service level objectives (SLO) and service level agreements (SLA), and to enable rapid debugging of complex infrastructure issues across the fleet. Collaborate with platform, hardware, networking, and SRE teams to scale and manage accelerator capacity, including new TPU generations, ensure a seamless experience for customers. Design and implement reliable ML infrastructure that enables training and serving cutting edge models at massive scale, troubleshoot complex distributed system issues across the stack (hardware, kernel, network), build the automation, tooling, and telemetry needed to turn operational findings into permanent software fixes and improved SLOs. Minimum qualifications: Bachelor’s degree or equivalent practical experience. 2 years of experience in backend Infrastructure development. Experience in general purpose coding languages like C++, Go, or Python development. Experience with algorithms, data structures, software development, and distributed computing. Preferred qualifications: Experience designing reliable, fault-tolerant and high performance distributed systems. Experience with building cloud based services ideally with GCP. Experience with large-scale distributed systems or Machine Learning (ML) systems (training and serving for computer vision, speech recognition, natural language processing, machine translation models). Experience with reliability, large-scale distributed systems, Go, Google Cloud Platform, tensor processing unit (TPU), and service level objectives.

How to get this job at Google

Don't rely on the portal. Cold applications for a role like Software Engineer, TPU Infrastructure, Google Cloud land in a pile of hundreds. A direct, personalised message to the hiring manager or a referrer is the fastest way in.
Find the right person. ResuMail surfaces the actual recruiters and hiring managers at Google — not a generic careers inbox.
Send tailored outreach. ResuMail drafts an email personalised to your resume and this role, then paces and schedules sends so you stay out of spam.
Follow up. One polite nudge after 5–7 days roughly doubles reply rates — scheduled for you.

Reach Google's hiring managers today.

Free to start. No credit card. Built for Indian job seekers.

Start free with ResuMail ›