Experienced Deep Learning and Computer Vision Engineer with 4+ years of industrial experience. Skilled in Deep Learning, Computer Vision, NLP, and Data Science. Strong R&D professional currently working as a Sr. Machine Learning Engineer at Octopus Digital. I am a passionate developer who thrives to design and develop efficient A.I solutons to solve industrial problems.
I got my bachelors from the department of CS & IT, University of Sargodha with a gold medal. My BS was financially supported by University of Sargodha's merit scholarship & Fauji Foundation scholarhip. I obtained my Masters degree in Computer Science (Deep Learning and Medical Image Analysis) from one of the oldest and prestigious engineering universities in Pakistan — University of Engineering and Technology, Lahore.
I have worked in diverse fields of Artificial Intelligence including deep learnig, computer vision, natural language processing and data science. My main area of expertise is in industrial problem solving using state of the art AI tools and technologies. I like to design and develop AI solutions for various business problems involving textual, visual and time series data.
I am learning and practicing MLOps and Cloud technologies for Machine Learning development. I also love to do researach and read about the latest technologies in the field. I regularly follow top notch researchers and labs in the field so I can use thier algorithms in my problems. I am a medium pacer and love to bowl. I am also a casual photographer and a universal foodie.
Summary: Acral melanoma is life threatning cancer. Due to acral melanoma infrequent occurrences,
limited data is available so its early diagnosis is hard. To overcome this problem, we applied data centric
techniques to develop a large(comparatively) dataset to train a deep learning models. Our proposed
convolutional neural network achieved an accuracy of 91% on test set.
Summary: I worked on IOT based smart solution for smart home security. We utilized a RaspberryPi
and a Pi Camera and
PIR motion sensors to develop a prototype of security system. Final system was able to capture suspicious
movements and notify the home owner via push and email notifications.
This is a small CLI tool to download YouTube frames without downloading and manually extracting frames
from YouTube videos. Just provide a YouTube video link and the CLI tool will automatically download the
frames in your target directory.
Tech Stack & Libraries : Python, OpenCV, VidGear, ArgrParse
GitHub Link
A python program for generating video subtitles. The program utilizes OpenAI's Whisper model for
converting speech to text and then this data is used to generate subtitle files. The program currently
can generate .srt and .vtt foramts for subtitles.
Tech Stack & Libraries : Python, whisper
GitHub Link
This is a useful cli tool currently under development. It accepts a video (local video file or
YouTube
link) and utilized optimized deep learning model to generate datasets for high level computer vision
tasks. It reads video, runs deep learning model on individual frames and classified and sorts output
frames into seprate directory based on predicted class.
Tech Stack & Libraries : Python, OpenVino, Inception V3, VidGear, ArgreParse
Github Link
This repo has multiple automation scripts using Python. The scripts can be used in any computer
vision projects. It has scripts to read, write, resize and perform multiple operations on images.
Tech Stack & Libraries : Python, OpenCV, Augmentor, PyTorch, NumPy, Pandas
GitHub Link
In this project a dataset for steel defect detection by NEU was used classification of steel surface
defects. Dataset has total six classes. A ResNet-18 was finetuned to classify the input images into
one
of the six classes. The model achieved 97% accuracy (approx.)
Tech Stack & Libraries : Python, FastAI, PyTorch, OpenCV, Pandas, Scikit-learn, Matplotlib
GitHub Link
This was a term project for my Machine Learning course @UET during my MS. We collected and anotated a
large datasets of firearms divided into five categories. Data was collected from YouTube and other video
files from web. A YOLOv3 model was trained for detection and classification while a Mask-RCNN model was
trained for instance segmentation of firearms. We achieved a 97% accuracy for detection and
classification.
Tech Stack & Libraries : Python, PyTorch, FastAI, OpenCV, yt_downloader
GitHub
Link | Report
Check out all my contributions @ github/qaixerabbas
We worked on Adversarial Attacks on AIOT Operations Technolgies with a focus on theme parks. In this
project we developed a robust video classification model for human related crimes identification using
efficient and compute efficient convolutional architectures. To show the impact of adversarial attacks
on such life threatning scenarios we used various sparse black box attacks on video classification
models. A detailed paper is being written and its in progress.
Tech Stack & Libraries : Python, OpenCV, PyTorch, Keras, TensorFlow, ResNets,
MobileNets, EfficienNets, Huggingface, TorchAttacks
GitHub Link | Adv Attack Paper
In this project we extensively experimented with three remote sensing image classification datasets
using recent convolutional and attention based architectures. The focus was to identify best models
keeping compute performance, prediction accuracy and number of parameters in mind. This project helps
the newbies and researchers to select the best models when working with aerial scene understanding. This
project is under active development and we are experimenting with attention based architectures for
now.
Tech Stack & Libraries : Python, OpenCV, PyTorch, ResNets,
EfficienNets, ConvNext, DenseNet, SqueezeNet
GitHub Link | Aerial Scene Understanding Paper
This was my MS thesis project. I worked on skin cancer (Acral Melanoma) detection and classification
from Dermoscopy images. A data centric approach was used to enhance limited training data. Various
deep
learning models were trained to select the best model. Our AMNet model was a modified ResNet-18
architecture that performed better to diagnose Acral Melanoma. In addition, a NASNet model was also
used
for binary classification of Melanoma skin cancer.
Tech Stack & Libraries : Python, OpenCV, Augmentor, PyTorch, Keras, TensorFlow, ResNets,
MobileNets,
NASNet
GitHub Link | Acral Melanoma
Detection |
Melanoma Detection
This project tackles the problem of NER for low resource languages. We worked on data generation for
Punjabi (Shahmukhi Script). A multilingual BERT model was trained for NER task. We developed a large
training corpora using a simple and novel PoW augmentation technique. The publication is accepted in
the prestigious ACM TALLIP.
Tech Stack & Libraries : Python, BERT, NLTK, SpaCy
GitHub Link | Paper
We experimented with various machine learning models for recent intrusion detection datasets.
NSL-KDD, CSE-CIC-IDS2018 and UNSW-NB-15 datasets were used in the study. Five major ML algorithms were
trained on preprocessed datasets. We proposed a hybrid, cost and compute efficient ensemble model for
development of secure IDS systems.
Tech Stack & Libraries : Python, scikit-learn, Pandas, NumPy, Keras
GitHub Link | Paper
We used a recent dataset named as LC-25000 for lungs cancer classification. The dataset
contains 15000 images of lungs histopathology. We trained two ResNets (18 and 50), MobileNet and
AlexNet model for classification. A maximum accuracy of 98% was achived using ResNet-18 algorithm.
Tech Stack & Libraries : Python, PyTorch, FastAI, OpenCV
GitHub
Link | Report