Role Overview
We are seeking an experienced Staff Performance Engineer to lead and scale performance engineering practices for our cloud-native SaaS platform. This role is responsible for driving performance, scalability, reliability, and cost efficiency at an organizational level, with a strong focus on serverless and distributed architectures.
You will define performance engineering strategy, build scalable and AI-driven performance platforms, and influence architectural decisions across teams. The role requires deep expertise in modern cloud environments and a strong focus on embedding performance into the entire software lifecycle, from development to production.
Key Responsibilities
Define and drive organization-wide performance engineering strategy aligned with business KPIs, customer experience, and cost efficiency
Architect and build scalable, self-service performance engineering platforms enabling teams to run performance tests and analysis independently
Design and implement AI-driven performance engineering solutions including anomaly detection, predictive performance insights, adaptive load testing, and automated optimization recommendations
Lead the design and execution of advanced performance testing strategies for serverless, distributed, and event-driven systems
Establish and standardize performance benchmarks, SLAs, SLOs, and KPIs across services
Drive integration of performance testing and validation into CI/CD pipelines to enable continuous performance engineering (shift-left approach)
Analyze system-wide performance bottlenecks including latency, cold starts, concurrency limits, and resource utilization across distributed systems
Collaborate with engineering, SRE, and architecture teams to influence system design for scalability, resilience, and performance optimization
Own performance in production environments by leveraging observability tools, distributed tracing, and real-time monitoring systems
Implement intelligent observability solutions using tools such as CloudWatch, Datadog, New Relic, and AI-based monitoring platforms
Lead capacity planning and scalability initiatives for high-throughput and globally distributed systems
Drive cost-performance optimization strategies in cloud-native environments (FinOps alignment)
Mentor and guide engineers across teams, promoting a performance-first culture and best practices
Stay updated with emerging trends in performance engineering, including AI/ML-driven optimization and cloud-native innovations
Desired Skill and Requirements
Must Have
8+ years of experience in performance engineering within large-scale SaaS or cloud-native environments
Performance testing tools - JMeter, Gatling, Locust, or similar
Serverless architectures - AWS Lambda, API Gateway, event-driven systems
Performance monitoring and observability tools - CloudWatch, Datadog, New Relic, distributed tracing systems
Building performance engineering frameworks or platforms at scale
Performance optimization in distributed and serverless systems - latency, cold starts, concurrency, and scaling behavior
Integration of performance engineering into CI/CD pipelines
Programming/scripting - Python (preferred), Java, or similar
AI/ML-based performance optimization techniques - anomaly detection, predictive analysis, adaptive load modeling
Cloud platforms (AWS preferred) and performance optimization techniques
Ability to identify and resolve complex performance bottlenecks
Large-scale load testing and capacity planning
Cost-performance optimization in cloud environments
Good To Have
Kubernetes, containerized, and serverless architectures
Chaos engineering and resilience testing
Internal developer platforms and self-service tooling
FinOps and cloud cost optimization strategies
Globally distributed and multi-region architectures
API performance optimization
Modern distributed data stores - DynamoDB, Aurora Serverless, NoSQL systems
AIOps platforms and intelligent observability systems
Soft Skills
Strong problem-solving and analytical thinking
Ability to influence architectural and technical decisions across teams
Excellent communication and stakeholder management skills
Ownership mindset with the ability to drive cross-functional initiatives
Mentorship and leadership capabilities
Ability to operate in a fast-paced, high-growth SaaS environment
Experience
8+ years of experience in performance engineering in large-scale SaaS or cloud-native environments
3+ years of experience in Senior, Lead, or Staff-level performance engineering roles
4+ years of experience performance testing large-scale SaaS or distributed systems
5+ years of hands-on experience with performance testing tools such as JMeter, Gatling, k6, or Locust
Experience designing and executing large-scale performance tests in production-like environments
Experience identifying and resolving performance bottlenecks across application, database, network, and infrastructure layers
Experience tuning databases for performance at scale
Experience defining and implementing performance benchmarks, KPIs, and capacity planning strategies
Experience working with observability and monitoring platforms for performance analysis
Experience optimizing event-driven and serverless architectures
Experience influencing architecture and engineering decisions across teams and domains
Experience operating in fast-paced, high-growth SaaS environments
Education
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
Equivalent practical experience in performance engineering or cloud-native systems