resu·mail

ML / AI Data Engineer (Contract)

at Tech Holding

Remote Senior Posted 2026-05-08

Don't apply into the void — reach the hiring manager

ResuMail finds the recruiters and hiring managers behind this ML / AI Data Engineer (Contract) role at Tech Holding, drafts a personalised outreach email, and schedules the send — so your application actually gets seen.

Reach the hiring manager ›

About this role

<div class="content-intro"><p><strong>About us:</strong></p> <p>Working at Tech Holding isn't just a job, it's an opportunity to be a part of something bigger. We are a full-service consulting firm that was founded on the premise of delivering predictable outcomes and high-quality solutions to our clients.&nbsp; Our founders and team members have industry experience and have held senior positions in a wide variety of companies – from emerging startups to large Fortune 50 firms – and we have taken our combined experiences and developed a unique approach that is supported by the principles of deep expertise, integrity, transparency, and dependability.</p></div><div>We are looking for a highly skilled <strong>Senior ML / Data Pipeline Engineer</strong> who can translate complex machine learning and multimodal concepts into <strong>scalable, production-ready pipelines and workflows</strong>.<br>This role focuses on building and optimising <strong>large-scale video and multimodal data systems</strong>, enabling high-throughput ingestion, processing, and model training across distributed cloud environments.<br><br><strong>Key Responsibilities</strong></div> <ul> <li>Design, deploy, and scale <strong>large-scale ML and data processing pipelines</strong> across cloud infrastructure.</li> <li>Build systems to ingest, process, and serve <strong>250,000+ hours of multimodal data</strong> (video, audio, metadata).</li> <li>Architect and optimize <strong>GPU-based compute environments</strong> (e.g., NVIDIA Tesla clusters) for distributed training and inference.</li> <li>Develop <strong>high-throughput backend systems</strong> for video ingestion from desktop and mobile platforms.</li> <li>Implement <strong>distributed processing workflows</strong>, including job scheduling, fault tolerance, and resource allocation.</li> <li>Design and build <strong>human-in-the-loop and automated annotation systems</strong> to ensure data quality and scalability.</li> <li>Translate <strong>ML and multimodal research</strong> into scalable, production-grade cloud architectures.</li> <li>Optimize pipelines for <strong>performance, reliability, and cost efficiency</strong> across compute, storage, and networking layers.</li> <li>Collaborate with ML, data, and engineering teams to deliver <strong>end-to-end data workflows</strong>.</li> </ul> <div><strong>Requirements</strong></div> <ul> <li><strong>5+ years</strong> of experience in <strong>data engineering, ML pipelines, or distributed systems</strong>.</li> <li>Strong experience building <strong>scalable data pipelines</strong> for large datasets (video/audio preferred).</li> <li>Hands-on experience with <strong>cloud platforms</strong> (AWS, Azure, or GCP).</li> <li>Experience working with <strong>GPU-based environments</strong> and distributed computing.</li> <li>Strong programming skills in <strong>Python, Scala, or similar languages</strong>.</li> <li>Experience with <strong>data processing frameworks</strong> (Spark, Ray, Kafka, Airflow, or similar).</li> <li>Understanding of <strong>ML workflows, training pipelines, and inference systems</strong>.</li> <li>Experience designing <strong>fault-tolerant, high-availability systems</strong>.</li> <li>Strong knowledge of <strong>data storage systems</strong> (data lakes, object storage, distributed file systems).</li> <li>Ability to handle <strong>high-throughput, large-scale data ingestion and processing</strong>.</li> </ul> <div><strong>Good to Have</strong></div> <ul> <li>Experience with <strong>multimodal AI (video, audio, NLP)</strong> systems.</li> <li>Familiarity with <strong>annotation tools and data labeling workflows</strong>.</li> <li>Experience with <strong>containerization and orchestration</strong> (Docker, Kubernetes).</li> <li>Knowledge of <strong>cost optimization strategies</strong> for large-scale cloud workloads.</li> </ul><div class="content-conclusion"><p>Tech Holding is proud to be an Equal Opportunity Employer and is committed to fostering a diverse and inclusive workplace. We welcome applicants from all backgrounds and experiences, and we consider qualified applicants without regard to race, color, religion, gender, sexual orientation, gender identity, national origin, disability, veteran status, or any other legally protected characteristic. If you require accommodation in the application process, please contact our HR&nbsp;</p></div>

How to get this job at Tech Holding

  1. Don't rely on the portal. Cold applications for a role like ML / AI Data Engineer (Contract) land in a pile of hundreds. A direct, personalised message to the hiring manager or a referrer is the fastest way in.
  2. Find the right person. ResuMail surfaces the actual recruiters and hiring managers at Tech Holding — not a generic careers inbox.
  3. Send tailored outreach. ResuMail drafts an email personalised to your resume and this role, then paces and schedules sends so you stay out of spam.
  4. Follow up. One polite nudge after 5–7 days roughly doubles reply rates — scheduled for you.

Reach Tech Holding's hiring managers today.

Free to start. No credit card. Built for Indian job seekers.

Start free with ResuMail ›