resu·mail

AI SW Stack Deployment Architect

at SanDisk

Bengaluru, India Senior Posted 2026-05-18

Don't apply into the void — reach the hiring manager

ResuMail finds the recruiters and hiring managers behind this AI SW Stack Deployment Architect role at SanDisk, drafts a personalised outreach email, and schedules the send — so your application actually gets seen.

Reach the hiring manager ›

About this role

Role Overview We are looking for a Software Architect (12+ years experience) to lead the application/framework layer and deployment stack for the Next Generation Accelerator AI platform. This role owns how models run on Next Generation Accelerator—from vLLM / PyTorch / TensotFlow/XLA to production deployment—ensuring correctness, performance, and scalability. Key Responsibilities Architect integration of vLLM, PyTorch, and TensorFlow, JAX/XLA into Next Generation Accelerator stack Define framework → compiler → runtime APIs and contracts Own LLM execution behavior (batching, KV cache, streaming inference) Design and implement end-to-end deployment workflows (packaging, versioning, reproducibility) Drive performance optimization across model → framework → runtime Work cross-functionally with compiler, runtime, and low-level SW teams Support customer workloads, model onboarding, and debugging Impact Own customer-visible AI execution and deployment on Next Generation Accelerator , closing the gap between models and system performance , and enabling enterprise-grade AI solutions Required Qualifications 10+ years in AI/ML systems or software architecture Strong experience with PyTorch / Transformers / LLMs Hands-on experience with LLM deployment and scalable inference engine systems e.g. vLLM, Triton, SGLang etc. Experience building scalable AI platforms (cloud/edge) Expertise in system design, APIs, and cross-layer integration Preferred Qualifications Experience with vLLM or similar LLM serving systems Familiarity with XLA / MLIR / compiler frameworks Exposure to AI accelerators (GPU/NPU) and runtime systems Experience in distributed or multi-agent AI systems Sandisk thrives on the power and potential of diversity. As a global company, we believe the most effective way to embrace the diversity of our customers and communities is to mirror it from within. We believe the fusion of various perspectives results in the best outcomes for our employees, our company, our customers, and the world around us. We are committed to an inclusive environment where every individual can thrive through a sense of belonging, respect and contribution. Sandisk is committed to offering opportunities to applicants with disabilities and ensuring all candidates can successfully navigate our careers website and our hiring process. Please contact us at  jobs.accommodations@sandisk.com  to advise us of your accommodation request. In your email, please include a description of the specific accommodation you are requesting as well as the job title and requisition number of the position for which you are applying.

How to get this job at SanDisk

  1. Don't rely on the portal. Cold applications for a role like AI SW Stack Deployment Architect land in a pile of hundreds. A direct, personalised message to the hiring manager or a referrer is the fastest way in.
  2. Find the right person. ResuMail surfaces the actual recruiters and hiring managers at SanDisk — not a generic careers inbox.
  3. Send tailored outreach. ResuMail drafts an email personalised to your resume and this role, then paces and schedules sends so you stay out of spam.
  4. Follow up. One polite nudge after 5–7 days roughly doubles reply rates — scheduled for you.

Reach SanDisk's hiring managers today.

Free to start. No credit card. Built for Indian job seekers.

Start free with ResuMail ›