AI & Machine Learning
Certifications
Post Graduate Program in AI and Machine Learning
Purdue University
Intermediate Python
Codeacademy
Deployment of Machine Learning Models in Production
Udemy
Python for Data Science
IBM
Visualize Data with Python Skill Path
Codeacademy
Statistics Essential for Data Science
Simplilearn
SQL Course
Codeacademy
Machine Learning
Simplilearn
Deep Learning with Tensorflow and Keras
Simplilearn
Taming Big Data with Apache Spark and Python
Udemy
Advanced Deep Learning: Computer Vision
Simplilearn
Projects
NLP for Opinion Mining: Twitter Disaster Relief
Detecting particular words in a tweet that determine real tweets for occurence of natural disaster.
- Text extraction and feature engineering using KGP Talkie.
- Classification models with TD-IDF, Word2vec SVM with 78% accuracy.
- Deep learning models with Word Embedding, Bert and DistilBert with ~ 90% accuracy.
- Deployed DistBert model for production using Flask, uWSGI and NGINX at AWS EC2.
Retail: Walmart
Exploring the sales demand at store-day level for three year time-span.
- Predictive analysis, regularized regression model (Ridge, Laso) vs non-linear regression models (XGB boost regressor, RF) for all stores and separate model for each store with 85% accuracy.
- Time series analysis to identify yearly trends and seasonal months.
- Deep learning methods, LSTM vs. traditional time series analysis to compare accuracy.
Health Care: Cancer Detection
Identifying various cancer types based on genes to reduce fatality rate.
- Dimensionality reduction algorithm using PCA, LDA, t-SNE.
- Classification using Scikit Learn, SVM, RF, KNN, NB, parametrical methods and deep learning methods, ANN, MLP with 90%+ prediction accuracy.
- Data validation using statistical testing models, t-test, F-test.
Cyber Security Malware
Identifying URLs for malware.
- Classification Pipeline, binary classification (LR, SVC, KNN) and ensemble technique (XGBoost, RF) with 90%+ accuracy.
- Illustration of diagnostic ability using ROC, AUC.
- Validation accuracy using K-Fold cross-validation & hyperparameter tuning using GridSearchCV.
Finance & Computer Vision
Lending club loan data analysis, emotion recognition and face recognition.
- Deep learning algorithm models using Keras, Pytorch, Torch vison.
- Customized CNN, transfer learning models (ResNet, VGG-16, Mobilnet).
- Data augmentation and image representation learning using computer vision.
Skills & expertise
Machine Learning
Supervised,
Unsupervised
Clustering
Dimensionality Reduction
Predictive Modeling
Statistics
Hypothesis Testing
Exploratory Data Analysis
Artificial Intelligence
Data Cleansing
Data Visualization
Model Deployment
Neural Networks,
Deep Learning
Image Classification
Object Detection
Transfer learning
Technical stack
Python
Pandas
NumPy
Scikit-learn
XGBoost
Matplotlib
Seaborn
Keras
Tensorflow
OpenCV
SQL
AWS
Optuna
PyTorch
Github
Apache Spark
Docker
uWSGI
NGINX
Flask