Deep Learning for Computer Vision final project

Last update: Nov 30, 2021

Overview

Deep Learning for Computer Vision final project

Team: DLCV1

Member & Contribution:

林彥廷 (R06943184): 主程式撰寫、模型訓練 (50%)
王擎天 (R06945055): 副程式撰寫、模型訓練、海報設計 (50%)

Overview:

This project contains code to predict image's type from different domain using moment matching.

Description:

Folders:

script: folder contains scripts
src: folder contains source code
model: folder contains saved models which automatically download from network

Files:

script/get_dataset.sh: script which downloads training and testing dataset
script/download_from_gdrive.sh: script which downloads googledrive data
script/parse_data.sh: script which loads training dataset and converts to torch dataset
script/predict.sh: script which predicts images
script/evaluate.sh: script which evaluates the model
script/predict_for_verify.sh script which generates mini-batch average validation accuracy and loss plot
src/models/classifier.py: classifier model
src/models/loss.py: loss function
src/models/pretrained.py: pretrained model
src/models/model.py: Model and function for prediction and evaluation
src/parse_data.py: load data in folder and convert them to torch dataset
src/predict.py: prediction main function
src/evaluate.py: evaluation main function
src/train.py: training function
src/utils.py: code for parsing and saving
src/util/dataset.py: customized dataloader
src/util/visual.py: code for visualization
src/create_path_csv.py:main function to create image path csv file for image folder

Dataset:

Download training and testing dataset to folder named "dataset_public":

bash ./script/get_dataset.sh

WARNING:

You MUST use src/create_path_csv.py to create image-path csv file for image folder which hasn't contain image-path csv file, the usage will teach you how to use it!!!

Usage:

Create image-path csv file for image folder:

User can use this script to create image-path csv file

python3 src/create_path_csv.py $1

$1 is the folder containing the images

Example: (path: /home/final-dlcv1)

python3 src/create_path_csv.py dataset_public/test

The result will look like following text: image_name,label test/018764.jpg,-1 test/034458.jpg,-1 test/050001.jpg,-1 test/027193.jpg,-1 test/002637.jpg,-1 test/017265.jpg,-1 test/048396.jpg,-1 test/013178.jpg,-1 test/036777.jpg,-1 ......

Predict labels of images:

User can use this script to predict labels of images

bash ./script/predict.sh $1 $2 $3 $4 $5

$1 is the domain of images (Option: infograph, quickdraw, real, sketch)
$2 is the folder containing the images
$3 is the csv file contains image paths
$4 is the folder to saved the result file
$5 is the batch size

Example 1: Predict images from real domain (path: /home/final-dlcv1)

bash script/predict.sh real dataset_public dataset_public/test/image_path.csv predict 256

Example 2: Predict images from sketch domain (path: /home/final-dlcv1)

bash script/predict.sh sketch dataset_public dataset_public/sketch/sketch_test.csv predict 256

Example 3: Predict images from infograph domain (path: /home/final-dlcv1)

bash script/predict.sh infograph dataset_public dataset_public/infograph/infograph_test.csv predict 256

Example 4: Predict images from quickdraw domain (path: /home/final-dlcv1)

bash script/predict.sh quickdraw dataset_public dataset_public/quickdraw/quickdraw_test.csv predict 256

Evaluate the result file:

User can use this script to evaluate the reuslt file with answer file, it will print result on the screen

bash ./script/evaluate.sh $1 $2

$1 is the predicted file csv
$2 is the answer file csv

Example (path:/home/final-dlcv1)

bash ./script/evaluate.sh predict/real_predict.csv test/test_answer.csv

Reference

https://gntuedutw-my.sharepoint.com/:p:/g/personal/b03901093_g_ntu_edu_tw/EUuiRqkxp69GjGDOzVcJeHoBIY61MIbEEbBVsMgNBfcEBg?rtime=uLaDbmDq1kg
Peng, X., Bai, Q., Xia, X., Huang, Z., Saenko, K. and Wang, B., 2018. Moment Matching for Multi-Source Domain Adaptation. arXiv preprint arXiv:1812.01754.

Deep Learning for Computer Vision final project

Related tags

Overview

Deep Learning for Computer Vision final project

Team: DLCV1

Member & Contribution:

Overview:

Description:

Dataset:

WARNING:

Usage:

Create image-path csv file for image folder:

Predict labels of images:

Evaluate the result file:

Reference

Owner

grassking100

AtlasNet: A Papier-Mâché Approach to Learning 3D Surface Generation

Deep learning based hand gesture recognition using LSTM and MediaPipie.

Combine Tacotron2 and Hifi GAN to generate speech from text

This repository provides some of the code implemented and the data used for the work proposed in "A Cluster-Based Trip Prediction Graph Neural Network Model for Bike Sharing Systems".

The challenge for Quantum Coalition Hackathon 2021

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

SymmetryNet: Learning to Predict Reflectional and Rotational Symmetries of 3D Shapes from Single-View RGB-D Images

CityLearn Challenge Multi-Agent Reinforcement Learning for Intelligent Energy Management, 2020, PikaPika team

Computer Vision application in the web

A super lightweight Lagrangian model for calculating millions of trajectories using ERA5 data

Constrained Language Models Yield Few-Shot Semantic Parsers

Modular Probabilistic Programming on MXNet

A python code to convert Keras pre-trained weights to Pytorch version

Array Camera Ptychography

This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (KDD'21).

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Source code for the paper "SEPP: Similarity Estimation of Predicted Probabilities for Defending and Detecting Adversarial Text" PACLIC 2021

object recognition with machine learning on Respberry pi

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.