GCP Data Engineer | Java & PySpark

Gurugram, India Mid Posted 2026-05-11

Don't apply into the void — reach the hiring manager

ResuMail finds the recruiters and hiring managers behind this GCP Data Engineer | Java & PySpark role at Synechron, drafts a personalised outreach email, and schedules the send — so your application actually gets seen.

Reach the hiring manager ›

About this role

Job Summary Synechron is seeking a highly skilled GCP Data Engineer with expertise in Java and PySpark to lead the development of scalable data pipelines and microservices on Google Cloud Platform. In this role, you will collaborate with cross-functional teams to design, implement, and optimize data-driven solutions that support our global marketing and personalization initiatives. Your technical leadership will enable innovative cloud-native architectures that enhance business capabilities and drive operational efficiency. Software Requirements Required Software Proficiency: Java (preferably version 8 or newer) — extensive experience in developing backend services using Java and Spring Boot PySpark — proven experience in large-scale data processing within GCP environments SQL — hands-on experience with relational databases and query optimization techniques Google Cloud Platform (GCP): DataProc, BigQuery, GKE, Cloud Storage — solid working knowledge of deploying and managing data systems on GCP UNIX/Linux shell scripting, Python, Perl — practical knowledge in scripting for automation and data manipulation RESTful web services — experience in designing and implementing APIs for data exchange Version control: Git — strong familiarity with code management and collaboration workflows Preferred Software Skills: GCP-native tools: Dataflow, Cloud Composer, Cloud Storage — experience with fully managed cloud data workflows Hadoop, Hive, or additional big data processing tools — for data migration and legacy system integration Machine learning libraries: TensorFlow, Scikit-learn — knowledge for implementing AI/ML features Data streaming tools: Kafka — experience with real-time data pipelines and event processing Overall Responsibilities Design, develop, and optimize large-scale data pipelines leveraging PySpark, DataProc, and BigQuery, ensuring robustness and scalability Build and maintain microservices and APIs using Java (Spring Boot), deployed on GKE (Google Kubernetes Engine) Collaborate with product teams, data scientists, and stakeholders to analyze data requirements and translate into technical solutions Modernize architecture by migrating workloads from Hadoop, Spark, and Hive to GCP cloud infrastructure Conduct performance tuning, troubleshooting, and system optimization for data pipelines and services Write detailed technical documentation, including architecture diagrams, APIs, and deployment procedures Stay informed on emerging trends in cloud data engineering, AI/ML, and big data to apply innovations effectively Lead code reviews, enforce coding standards, and foster best practices across the team Technical Skills (By Category) Programming Languages: Essential: Java (Spring Boot), PySpark, SQL — proven ability to develop scalable backend and data processing solutions Preferred: Shell scripting, Perl, Python — for automation and scripting tasks Databases/Data Management: BigQuery, DataProc, Hadoop, Hive — experience in designing, managing, and optimizing large datasets and query performance Cloud Technologies: GCP (DataProc, BigQuery, GKE, Cloud Storage) — thorough knowledge of cloud deployment, data integration, and management Frameworks & Libraries: PySpark, Dataflow for data processing; REST API development with Spring Boot Development Tools & Methods: Git, CI/CD pipelines (Jenkins, Cloud Build), Agile/scrum practices for collaboration and continuous deployment Security & Data Governance: Understanding of data security, privacy standards, and best practices for cloud-based data platforms Experience Requirements 4+ years of professional experience in data engineering, software development, or related roles, preferably on cloud platforms Demonstrated experience developing and deploying scalable data pipelines and microservices in GCP environment Proven expertise in Java and PySpark for large data processing and system integration Familiarity with cloud migration projects, big data frameworks, and AI/ML libraries is a plus Effective in collaborative, agile team environments with a focus on delivering results Day-to-Day Activities Develop, test, and refine data pipelines using PySpark, DataProc, and BigQuery Design and implement scalable back-end microservices and APIs with Java Spring Boot, deployed on GKE Participate in Agile ceremonies, including sprint planning, stand-ups, and reviews Monitor system performance, troubleshoot issues, and optimize data workflows for efficiency and reliability Collaborate with data scientists, product managers, and analysts to deliver data-driven features Document system architecture, data models, and deployment instructions to ensure maintainability Continuously explore new cloud, big data, and AI/ML tools to improve infrastructure and solutions Qualifications Bachelor’s or Master’s degree in Computer Science, Data Science, Engineering, or a related field 4+ years of experience in data engineering, big data processing, or software development, with strong cloud expertise Certifications such as GCP Professional Data Engineer are advantageous Proven ability to design scalable solutions and lead data migration projects on cloud platforms Professional Competencies Strong analytical and problem-solving skills for designing optimized data pipelines Leadership capabilities to manage project goals and mentor junior team members Excellent communication skills for translating technical solutions to stakeholders Adaptability and eagerness to explore cutting-edge technologies and methodologies Results-driven, with a focus on delivering reliable, scalable, and innovative data solutions Effective time management to prioritize workload and meet project deadlines S YNECHRON’S DIVERSITY & INCLUSION STATEMENT Diversity & Inclusion are fundamental to our culture, and Synechron is proud to be an equal opportunity workplace and is an affirmative action employer. Our Diversity, Equity, and Inclusion (DEI) initiative ‘Same Difference’ is committed to fostering an inclusive culture – promoting equality, diversity and an environment that is respectful to all. We strongly believe that a diverse workforce helps build stronger, successful businesses as a global company. We encourage applicants from across diverse backgrounds, race, ethnicities, religion, age, marital status, gender, sexual orientations, or disabilities to apply. We empower our global workforce by offering flexible workplace arrangements, mentoring, internal mobility, learning and development programs, and more. All employment decisions at Synechron are based on business needs, job requirements and individual qualifications, without regard to the applicant’s gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law . Candidate Application Notice

How to get this job at Synechron

Don't rely on the portal. Cold applications for a role like GCP Data Engineer | Java & PySpark land in a pile of hundreds. A direct, personalised message to the hiring manager or a referrer is the fastest way in.
Find the right person. ResuMail surfaces the actual recruiters and hiring managers at Synechron — not a generic careers inbox.
Send tailored outreach. ResuMail drafts an email personalised to your resume and this role, then paces and schedules sends so you stay out of spam.
Follow up. One polite nudge after 5–7 days roughly doubles reply rates — scheduled for you.

Reach Synechron's hiring managers today.

Free to start. No credit card. Built for Indian job seekers.

Start free with ResuMail ›