SemTorch

Last update: Dec 07, 2022

Related tags

Overview

SemTorch

This repository contains different deep learning architectures definitions that can be applied to image segmentation.

All the architectures are implemented in PyTorch and can been trained easily with FastAI 2.

In Deep-Tumour-Spheroid repository can be found and example of how to apply it with a custom dataset, in that case brain tumours images are used.

These architectures are classified as:

Semantic Segmentation: each pixel of an image is linked to a class label.
Instance Segmentation: is similar to semantic segmentation, but goes a bit deeper, it identifies , for each pixel, the object instance it belongs to.
Salient Object Detection (Binary clases only): detection of the most noticeable/important object in an image.

🚀 Getting Started

To start using this package, install it using pip:

For example, for installing it in Ubuntu use:

pip3 install SemTorch

👩‍💻 Usage

This package creates an abstract API to access a segmentation model of different architectures. This method returns a FastAI 2 learner that can be combined with all the fastai's functionalities.

# SemTorch
from semtorch import get_segmentation_learner

learn = get_segmentation_learner(dls=dls, number_classes=2, segmentation_type="Semantic Segmentation",
                                 architecture_name="deeplabv3+", backbone_name="resnet50", 
                                 metrics=[tumour, Dice(), JaccardCoeff()],wd=1e-2,
                                 splitter=segmentron_splitter).to_fp16()

You can find a deeper example in Deep-Tumour-Spheroid repository, in this repo the package is used for the segmentation of brain tumours.

def get_segmentation_learner(dls, number_classes, segmentation_type, architecture_name, backbone_name,
                             loss_func=None, opt_func=Adam, lr=defaults.lr, splitter=trainable_params, 
                             cbs=None, pretrained=True, normalize=True, image_size=None, metrics=None, 
                             path=None, model_dir='models', wd=None, wd_bn_bias=False, train_bn=True,
                             moms=(0.95,0.85,0.95)):

This function return a learner for the provided architecture and backbone

Parameters:

dls (DataLoader): the dataloader to use with the learner
number_classes (int): the number of clases in the project. It should be >=2
segmentation_type (str): just Semantic Segmentation accepted for now
architecture_name (str): name of the architecture. The following ones are supported: unet, deeplabv3+, hrnet, maskrcnn and u2^net
backbone_name (str): name of the backbone
loss_func (): loss function.
opt_func (): opt function.
lr (): learning rates
splitter (): splitter function for freazing the learner
cbs (List[cb]): list of callbacks
pretrained (bool): it defines if a trained backbone is needed
normalize (bool): if normalization is applied
image_size (int): REQUIRED for MaskRCNN. It indicates the desired size of the image.
metrics (List[metric]): list of metrics
path (): path parameter
model_dir (str): the path in which save models
wd (float): wieght decay
wd_bn_bias (bool):
train_bn (bool):
moms (Tuple(float)): tuple of different momentuns

Returns:

learner: value containing the learner object

Supported configs

Architecture	supported config	backbones
unet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `xresnet18`, `xresnet34`, `xresnet50`, `xresnet101`, `xresnet152`, `squeezenet1_0`, `squeezenet1_1`, `densenet121`, `densenet169`, `densenet201`, `densenet161`, `vgg11_bn`, `vgg13_bn`, `vgg16_bn`, `vgg19_bn`, `alexnet`
deeplabv3+	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`resnet18`, `resnet34`, `resnet50`, `resnet101`, `resnet152`, `resnet50c`, `resnet101c`, `resnet152c`, `xception65`, `mobilenet_v2`
hrnet	`Semantic Segmentation`,`binary` `Semantic Segmentation`,`multiple`	`hrnet_w18_small_model_v1`, `hrnet_w18_small_model_v2`, `hrnet_w18`, `hrnet_w30`, `hrnet_w32`, `hrnet_w48`
maskrcnn	`Semantic Segmentation`,`binary`	`resnet50`
u2^net	`Semantic Segmentation`,`binary`	`small`, `normal`

📩 Contact

📧 [email protected]

💼 Linkedin David Lacalle Castillo

SemTorch

Related tags

Overview

SemTorch

🚀 Getting Started

👩‍💻 Usage

Parameters:

Returns:

Supported configs

📩 Contact

Owner

David Lacalle Castillo

This is a c++ project deploying a deep scene text reading pipeline with tensorflow. It reads text from natural scene images. It uses frozen tensorflow graphs. The detector detect scene text locations. The recognizer reads word from each detected bounding box.

Rotational region detection based on Faster-RCNN.

OCR of Chicago 1909 Renumbering Plan

Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"

Histogram specification using openCV in python .

An organized collection of tutorials and projects created for aspriring computer vision students.

computer vision, image processing and machine learning on the web browser or node.

OCR-D-compliant page segmentation

Some codes from PyImageSearch course's and external projects.

Automatically resolve RidderMaster based on TensorFlow & OpenCV

This Repository contain Opencv Projects in python

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Image Recognition Model Generator

(CVPR 2021) ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

SRA's seminar on Introduction to Computer Vision Fundamentals

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

PianoVisuals - Create background videos synced with piano music using opencv

Hand gesture detection project with aweome UI implementation.

Drowsiness Detection and Alert System

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.