The Ideal candidate will be responsible for Designing, Developing and maintaining machine learning development pipelines.
Adopt emerging standards and review processes while promoting best practices and consistent framework usage and review processes with our AIML Goverance team
Serving as a core member of an agile team that drives user story analysis and elaboration, designs and develops responsive web applications using the best engineering practices.
You will closely work with data analysts and other partners to ensure development of robust models and machine learning applications.
You will be Building and optimize reports for analytical and business purposes.
Monitor and solve data engineering issues to ensure smooth operation.
Implementing data quality checks and validation process to ensure the accuracy, completeness, and consistency of data.
Implementing data governance policies , access controls , and security measures to protect critical data and ensure compliance.
Developing deep understanding of integrations with other systems and platforms within the supported domains.
Bring a culture of innovation, ideas, and continuous improvement.
Challenging the status quo, demonstrating risk taking, and implementing creative ideas
Manage your own time, and work well both independently and as part of a team.
Work with Product Owners to define requirements for new features and plan increments of work.
Preferred Qualifications
Machine Learning and Data Analytics skills - Good knowledge of machine learning methods. Understanding of data analysis tools e.g. PowerBI and Tableau.
Demonstrated experience of developing and defending Machine Learning, AI, or statistical models
BS or MS degree in computer science, computer engineering, data science or other technical subject area or equivalent 5+ years of work experience
Hands-on experience with SQL, including schema design, query optimization and performance tuning.
Experience with distributed computing frameworks like Hadoop, Hive, Spark for processing large scale data sets.
Proficiency in any of the programming language python, pyspark for building data pipeline and automation scripts.
Understanding of cloud computing and exposure to any cloud GCP,AWS or Azure.
knowledge of CICD, GIT commands and deployment process.
Understanding of AI tools.
Strong analytical and problem-solving skills, with the ability to troubleshoot complex data issues and optimize data processing workflows