Data Engineer

Hyderabad, India Mid Posted 2025-09-10

Don't apply into the void — reach the hiring manager

ResuMail finds the recruiters and hiring managers behind this Data Engineer role at Blend360, drafts a personalised outreach email, and schedules the send — so your application actually gets seen.

Reach the hiring manager ›

About this role

We are implementing a Media Mix Optimization (MMO) platform designed to analyze and optimize marketing investments across multiple channels. This initiative requires a robust on-premises data infrastructure to support distributed computing, large-scale data ingestion, and advanced analytics. The Data Engineer will be responsible for building and maintaining resilient pipelines and data systems that feed into MMO models, ensuring data quality, governance, and availability for Data Science and BI teams. The environment integrates HDFS for distributed storage, Apache NiFi for orchestration, Hive and PySpark for distributed processing, and Postgres for structured data management. This role is central to enabling seamless integration of massive datasets from disparate sources (media, campaign, transaction, customer interaction, etc.), standardizing data, and providing reliable foundations for advanced econometric modeling and insights. Responsibilities: Data Pipeline Development & Orchestration o Design, build, and optimize scalable data pipelines in Apache NiFi to automate ingestion, cleansing, and enrichment from structured, semi-structured, and unstructured sources. Ensure pipelines meet low-latency and high-throughput requirements for distributed processing. Data Storage & Processing o Architect and manage datasets on HDFS to support high-volume, fault-tolerant storage. o Develop distributed processing workflows in PySpark and Hive to handle large-scale transformations, aggregations, and joins across petabyte-level datasets. o Implement partitioning, bucketing, and indexing strategies to optimize query performance. Database Engineering & Management o Maintain and tune Postgres databases for high availability, integrity, and performance. o Write advanced SQL queries for ETL, analysis, and integration with downstream BI/analytics systems. Collaboration & Integration o Partner with Data Scientists to deliver clean, reliable datasets for model training and MMO analysis. o Work with BI engineers to ensure data pipelines align with reporting and visualization requirements. Monitoring & Reliability Engineering o Implement monitoring, logging, and alerting frameworks to track data pipeline health. o Troubleshoot and resolve issues in ingestion, transformations, and distributed jobs. Data Governance & Compliance o Enforce standards for data quality, lineage, and security across systems. o Ensure compliance with internal governance and external regulations. Documentation & Knowledge Transfer o Develop and maintain comprehensive technical documentation for pipelines, data models, and workflows. o Provide knowledge sharing and onboarding support for cross- functional teams. Bachelor’s degree in Computer Science, Information Technology, or related field (Master’s preferred). Proven experience as a Data Engineer with expertise in HDFS, Apache NiFi, Hive, PySpark, Postgres, Python, and SQL. Strong background in ETL/ELT design, distributed processing, and relational database management. Experience with on-premises big data ecosystems supporting distributed computing. Solid debugging, optimization, and performance tuning skills. Ability to work in agile environments, collaborating with multi-disciplinary teams. Strong communication skills for cross-functional technical discussions. Preferred Qualifications: Familiarity with data governance frameworks, lineage tracking, and data cataloging tools. Knowledge of security standards, encryption, and access control in on- premises environments. Prior experience with Media Mix Modeling (MMM/MMO) or marketing analytics projects. Exposure to workflow schedulers (Airflow, Oozie, or similar). Proficiency in developing automation scripts and frameworks in Python for CI/CD of data pipelines.

How to get this job at Blend360

Don't rely on the portal. Cold applications for a role like Data Engineer land in a pile of hundreds. A direct, personalised message to the hiring manager or a referrer is the fastest way in.
Find the right person. ResuMail surfaces the actual recruiters and hiring managers at Blend360 — not a generic careers inbox.
Send tailored outreach. ResuMail drafts an email personalised to your resume and this role, then paces and schedules sends so you stay out of spam.
Follow up. One polite nudge after 5–7 days roughly doubles reply rates — scheduled for you.

Reach Blend360's hiring managers today.

Free to start. No credit card. Built for Indian job seekers.

Start free with ResuMail ›