Sohrab Alex Mofid

AI & Machine Learning

Certifications

Post Graduate Program in AI and Machine Learning

Purdue University

Intermediate Python

Codeacademy

Deployment of Machine Learning Models in Production

Udemy

Python for Data Science

IBM

Visualize Data with Python Skill Path

Codeacademy

Statistics Essential for Data Science

Simplilearn

SQL Course

Codeacademy

Machine Learning

Simplilearn

Deep Learning with Tensorflow and Keras

Simplilearn

Taming Big Data with Apache Spark and Python

Udemy

Advanced Deep Learning: Computer Vision

Simplilearn

Projects

NLP for Opinion Mining: Twitter Disaster Relief

Detecting particular words in a tweet that determine real tweets for occurence of natural disaster.

  • Text extraction and feature engineering using KGP Talkie.
  • Classification models with TD-IDF, Word2vec SVM with 78% accuracy.
  • Deep learning models with Word Embedding, Bert and DistilBert with ~ 90% accuracy.
  • Deployed DistBert model for production using Flask, uWSGI and NGINX at AWS EC2.

Retail: Walmart

Exploring the sales demand at store-day level for three year time-span.

  • Predictive analysis, regularized regression model (Ridge, Laso) vs non-linear regression models (XGB boost regressor, RF) for all stores and separate model for each store with 85% accuracy.
  • Time series analysis to identify yearly trends and seasonal months.
  • Deep learning methods, LSTM vs. traditional time series analysis to compare accuracy.

Health Care: Cancer Detection

Identifying various cancer types based on genes to reduce fatality rate.

  • Dimensionality reduction algorithm using PCA, LDA, t-SNE.
  • Classification using Scikit Learn, SVM, RF, KNN, NB, parametrical methods and deep learning methods, ANN, MLP with 90%+ prediction accuracy.
  • Data validation using statistical testing models, t-test, F-test.

Cyber Security Malware

Identifying URLs for malware.

  • Classification Pipeline, binary classification (LR, SVC, KNN) and ensemble technique (XGBoost, RF) with 90%+ accuracy.
  • Illustration of diagnostic ability using ROC, AUC.
  • Validation accuracy using K-Fold cross-validation & hyperparameter tuning using GridSearchCV.

Finance & Computer Vision

Lending club loan data analysis, emotion recognition and face recognition.

  • Deep learning algorithm models using Keras, Pytorch, Torch vison.
  • Customized CNN, transfer learning models (ResNet, VGG-16, Mobilnet).
  • Data augmentation and image representation learning using computer vision.

Skills & expertise

Machine Learning

Supervised,

Unsupervised

Clustering

Dimensionality Reduction

Predictive Modeling

Statistics

Hypothesis Testing

Exploratory Data Analysis

Artificial Intelligence

Data Cleansing

Data Visualization

Model Deployment

Neural Networks,

Deep Learning

Image Classification

Object Detection

Transfer learning

Technical stack

Python

Pandas

NumPy

Scikit-learn

XGBoost

Matplotlib

Seaborn

Keras

Tensorflow

OpenCV

SQL

AWS

Optuna

PyTorch

Github

Apache Spark

Docker

uWSGI

NGINX

Flask