Be a trusted advisor to customers, helping them understand and incorporate AI accelerators into their overall cloud strategy by recommending migration paths, integration strategies, and application architecture that incorporate Google Cloud AI optimized infrastructure.
Demonstrate how Google Cloud is differentiated, highlighting the power of accelerators by working with customers on proof-of-concepts, demonstrating features, optimizing model performance, profiling, and bench-marking.
Influence Google Cloud strategy at the intersection of infrastructure and AI/ML by advocating for enterprise customer requirements.
Travel to customer sites and events as needed.
Be responsible for business growth and workload acceleration on AI infrastructure products and solutions for Google Cloud Platform (GCP).
Minimum qualifications:
Bachelor's degree in Computer Science, Mathematics, a related technical field, or equivalent practical experience.
6 years of experience with cloud native architecture in a customer-facing or support role.
5 years of experience with cloud infrastructure.
5 years of experience in a technical role focused on AI infrastructure, GPU programming (e.g., CUDA, OpenCL) and optimization techniques.
Experience building and operationalizing machine learning models.
Preferred qualifications:
Experience training and fine-tuning large models (e.g., image, language, segmentation, recommendation, genomics) with accelerators.
Experience with performance profiling tools (e.g., TensorFlow profiler, PyTorch profiler, Tensorboard).
Experience designing/architecting large-scale infrastructure farms for specialist Artificial Intelligence (AI) use cases.
Experience with running Machine Learning (ML) Perf benchmarks, distributed training and optimizing performance versus costs.
Experience with High Performance Computing (HPC) environments and contributions to open-source projects related to AI or infrastructure.
Excellent communication, presentation, and teamwork skills.