๐Ÿงช

Machine Learning

10 projects
Fall 2021 Classification

Traffic Signs Image Classification

Multi-class classification of traffic sign images using various ML algorithms aimed at categorization of high impact classes with an accuracy of 93%.

Toolkit

SVM Naive Bayes Random Forest KNN MLP PCA scikit-learn
Fall 2021 Regression

Predicting Game Success

Applying machine learning models to predict new games' success rate in the current market, along with sentiment analysis of user reviews represented in an interactive visualization.

Toolkit

Linear Regression Poisson Sentiment Analysis Python HTML
Spring 2021 Computer Vision

Face Recognition using PCA

Face recognition of subjects by computing top eigen faces from images shot with different expressions.

Toolkit

Python scikit-learn PCA Classification
Spring 2021 Clustering

Image Compression

Performing image compression of multiple image sizes using K-means clustering algorithm implemented from scratch.

Toolkit

Python K-Means From Scratch
Spring 2021 Clustering

Spectral Clustering

Experimenting Spectral clustering algorithm with a synthetic dataset to identify non-convex clusters.

Toolkit

Python Spectral Clustering scikit-learn
Fall 2020 Classification

Comparing Classifiers

A comparison of several classifiers in scikit-learn on synthetic datasets to illustrate the nature of decision boundaries of different classifiers.

Toolkit

Naive Bayes Decision Trees Random Forest AdaBoost Neural Nets SVM QDA
Fall 2020 Dimensionality Reduction

ISOMAP for Dimensionality Reduction

Implementing the ISOMAP algorithm to obtain a two-dimensional embedding for images corresponding to different poses of the same person and comparing its results with PCA.

Toolkit

Python ISOMAP Non-linear DR
Fall 2020 Clustering

Clustering Digits

Applying clustering techniques on hand-written images of digits for unsupervised pattern recognition.

Toolkit

Python K-Means MNIST
Fall 2020 Dimensionality Reduction

Principal Component Analysis

Applying PCA on multiple datasets to identify trends, directions, and reduce dimensionality while preserving variance.

Toolkit

Python PCA EDA
Fall 2020 Regression

Regularization

Applying Regularization techniques on a Housing dataset to prevent overfitting and improve generalization.

Toolkit

Python Ridge Lasso Regression
๐Ÿ“ˆ

Data Visualization

7 projects
Spring 2022 Big Data

NFT Analytics Dashboard

NFT Big data Analytics & dashboard built on large volumes of Opensea trading data and Twitter sentiment scraping.

Toolkit

D3.js Django Python R Twitter API
Fall 2021 Big Data

COVID-19 Big Data Pipeline

Implementing an end-to-end data pipeline on AWS using real-time streaming, PySpark and HDFS to analyse trends and impacts with COVID-19 dataset.

Toolkit

HDFS PySpark AWS Kafka NiFi Tableau
Spring 2021 Interactive

Choropleth Visualization

Interactive Map visualization of ratings of board games specific to each region.

Toolkit

HTML D3.js GeoJSON
Spring 2021 Network

Co-Actor Network Visualization

Building a network graph of co-actors and visualizing them as an interactive graph.

Toolkit

Argo-lite Python Network Analysis
Fall 2020 Network

Force Directed Graph

A network visualization identifying similarities between board games using force-directed simulations.

Toolkit

D3.js Force Simulation
Fall 2020 Interactive

Interactive Frequency Polygon

An Interactive frequency polygon chart to visualize a dataset containing list of games, their ratings, and supporting information.

Toolkit

D3.js Interactive Charts
Fall 2020 Charts

D3.js Line Chart

Experimenting different features in D3 line charts for visualizing board game ratings over time.

Toolkit

D3.js SVG
๐Ÿ’ป

Programming

3 projects
Spring 2021 Game Dev

Tic-Tac-Toe

A classic game of tic-tac-toe implementation using Python with AI opponent options.

Toolkit

Python Game Logic
Fall 2020 Simulation

Gamble Game Simulation

Implementing a gamble game of die rolls and performing Monte-Carlo simulation and distribution tests.

Toolkit

Python Monte-Carlo Probability
Fall 2020 Statistics

Fitting Distributions

Implementing routines to fit random variables to a set of continuous/discrete probability distributions.

Toolkit

Python Probability Goodness of Fit