Role : Sr Cloudera Developer (Data Engineer)
Exp : 6 to 10 Years
Location : Pune
Job Description :
Proficiency in working with Spark
Understanding of Spark s architecture and fault tolerance mechanisms
Proficiency in using Spark DataFrames and Spark SQL for querying structured data
Experience in optimizing Spark execution plan is a plus
Skills in performing Extract Transform and Load ETL processes using Spark
Experience with integrating Spark Streaming with other technologies like Kafka is an advantage
Familiarity with the Hadoop ecosystem including tools such as HDFS Hive Cloudera stack can be of advantage
Experience with deploying and managing Spark applications on a Hadoop cluster or on GCP Dataproc
Strong knowledge of Python experience with Java is beneficial as well
DevOps tools and practices CI CD Docker
Hands on experience in GCP services Dataproc Cloud Function Cloud Run Pub Sub BigQuery
Responsibilities and Duties :
• Design, develop, and implement data solutions using Cloudera technologies such as Hadoop, Spark, and Hive
• Collaborate with data engineers to optimize data pipelines and data processing workflows.
• Work closely with data analysts and data scientists to ensure data quality and integrity.
• Troubleshoot and resolve issues with data processing and data storage systems.
• Stay up-to-date on the latest trends and best practices in Cloudera development
• Participate in code reviews and provide feedback to team members.
Qualifications and Skills:
• Bachelor’s degree in computer science, Information Technology, or a related field
• Proven experience as a Cloudera Developer or similar role
• Solid understanding of Cloudera technologies such as Hadoop, Spark, and Hive
• Experience with data modeling, data warehousing, and data integration.
• Strong programming skills in Java, Scala, or Python
• Excellent problem-solving and communication skills
• Ability to work independently and as part of a team.