Strong Programming skills using Python, Java, Java Scripts
Strong in Statistics and probability: very well versed with
Probability distributions
Over and under sampling
Bayesian and frequentist statistics
Dimension reduction
Linear regression
Clustering
Decision Trees
Strong in Data wrangling and database management which involves process of cleaning and organizing complex data sets to make them easier to access and analyze. Manipulating the data to categorize it by patterns and trends, and to correct and input data values can be time-consuming but necessary to make data-driven decisions.
Develop custom ML models and algorithms to apply to data sets and hands on experience in building various ML models like:
Linear regression
Logistic regression
Naive Bayes
Decision tree
Random forest algorithm
K-nearest neighbor (KNN)
K means algorithm
Ensemble models
Simulation
Scenario Analysis
Knowledge and experience in statistical and data mining techniques: GLM/Regression, Random Forest, Boosting, Trees, text mining, etc.
Mine and analyze data from company databases to drive optimization and improvement of product development, marketing techniques and business strategies.
Develop company A/B testing framework and test model quality.
Coordinate with different functional teams to implement models and monitor outcomes.
Develop processes and tools to monitor and analyze model performance and data accuracy.