Written by divyakamatAugust 14, 2021August 21, 2021

DETR : End-to-End Object Detection with Transformers

So lets first understand what is object detection? Object Detection models are one of the most widely used models among other computer vision tasks. Object detection is a task where we want our model to distinguish the foreground objects from the background and predict the locations and the categories for the objects present in the […]

Written by divyakamatAugust 5, 2021August 6, 2021

Vision Transformers (ViT)

Transformers have been the de-facto for NLP tasks, various pretrained models are available for translation, text generation, summarization and more. The models can be downloaded and fine tuned in your deep learning framework of choice as it plays nicely with Tensorflow, Pytorch and Jax. Transformers aren’t just for text any more- they can handle a […]

Written by divyakamatAugust 5, 2021August 6, 2021

Spatial Transformers

Spatial Transformer The spatial transformer module consists of layers of neural networks that can spatially transform an image. These spatial transformations include cropping, scaling, rotations, and translations etc CNNs perform poorly when the input data contains so much variation. One of the solutions to this is the max-pooling layer. But then again, max-pooling layers do […]

Written by divyakamatAugust 1, 2021June 17, 2022

YoloV3 – Training Custom Dataset

Recently, while exploring computer vision got a chance to train YoloV3 on custom dataset for object detection. We custom trained the YOLO V3 to detect following classes:– hardhat– vest– mask– boots Below is a short video demonstrating how amazingly the model is able to detect these objects. Code for this can be found here.

Written by divyakamatDecember 27, 2020August 6, 2021

Cats Vs Dog — Image Classification using PyTorch

In this post we will use a standard computer vision dataset – Dogs vs. Cats dataset that involves classifying photos as either containing a dog or cat. Although, the dataset seems to be pretty simple, the goal would be to outline the steps required to solve image processing and classification using pytorch and the same […]

Divya's Blog

Penning My AI/ML Understandings

Category: Computer Vision