Data Scientist
Position Overview
We are seekinga strong Data Scientist to support a strategic data transformation initiative focused on building scalable, AI-enabled data quality and data management capabilities.
The role combines Generative AI, traditional machine learning, and data engineering to develop intelligent solutions for metadata generation, DQ rule recommendation, anomaly detection, entity resolution, profiling automation, and data quality monitoring.
The ideal candidateshould be hands-on,technically strong, and able to translate advanced AI/ML methods into practical, production-ready solutions within an enterprise environment.
Key Responsibilities
Develop AI-powered solutions usingLLMs for metadataenrichment, semantic classification, summarization, and data quality automation
Build and optimize RAG pipelines groundedin enterprise metadata,profiling outputs, rule libraries, and technical documentation
Develop ML models for anomaly detection, entity resolution, clustering, predictive analytics, and pattern recognition
Build scalable Python and SQL pipelinesto automate profiling, data onboarding, quality monitoring, and AI-assisted recommendations
Design Human-in-the-Loop workflows to ensure AI outputs are validated, auditable, and aligned with business requirements
Collaborate with business, governance, and technical teams to operationalize AI/ML solutions in enterprise environments
Technical Expertise& Skills
Strong hands-on experience with LLMs, promptengineering, and production-grade RAG architectures
Strong experience in Python,SQL, and ML frameworks such as Scikit-learn, LangChain, LlamaIndex, or equivalent
Experience with supervised and unsupervised ML techniques, including XGBoost, Random Forest, clustering, PCA, Isolation Forest, and anomaly detection
Good understanding of metadatamanagement, data qualitydimensions, data governance, and enterprise data platforms
Ability to evaluate modelperformance, improve outputquality, and explaintechnical results to non-technical stakeholders
Professional Qualifications
Proven experience as a Data Scientistdelivering AI/ML solutionsin enterprise or large- scale data environments
Strong analytical, problem-solving, and communication skills
Understanding of responsible AI, data security, privacy, and governance practices
Bachelor’s or Master’s degreein Computer Science,Data Science, Engineering, Statistics, Mathematics, or a related field