PyTorch implementation of "Optimization Planning for 3D ConvNets"

Last update: Jan 12, 2022

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

The provided codes have been tested with Python-3.9.5 & Pytorch-1.9.0 on four Tesla-V100s.

2. Project structure

├─ base_config             # Pre-set config file for each dataset
├─ dataset                 # Video lists (NOT provided) and code to load video data
├─ jpgs                    # Images for README
├─ layers                  # Custom network layers
├─ model                   # Network architectures
├─ record                  # Config file for each run
├─ utils                   # Basic functions
├─ extract_score_3d.py     # Main script to extract predicted score
├─ helpers.py              # Helper functions for main scripts
├─ merge_score.py          # Main script to merge scores from different clips
├─ train_3d.py             # Main script to launch a training using given strategy
├─ train_3d_op.py          # Main script to launch a searching of best strategy
└─ run.sh                  # Shell script for training-extracting-merging pipeline

3. Run the code

Pre-process the target dataset and put the lists in to the dataset folder. Codes in dataset/video_dataset.py can load three video formats (raw video, jpeg frames and video LMDB) and can be simply modified to support the custom format.
Make config file in the record folder. The config examples include op-*.yml for pre-searched strategy, kinetics-*.yml for simple strategy on Kinetics-400,
Run run.sh for the training-extracting-merging pipeline or replace train_3d.py with train_3d_op.py for searching the optimal strategy.

4. TO DO

Add more explainations and examples.

5. Contact

Please feel free to email to Zhaofan Qiu if you have any question regarding the paper or any suggestions for further improvements.

6. Citation

If you find this code helpful, thanks for citing our work as

@inproceedings{qiu2021optimization,
title={Optimization Planning for 3D ConvNets},
author={Qiu, Zhaofan and Yao, Ting and Ngo, Chong-Wah and Mei, Tao},
booktitle={Proceedings of the 38th International Conference on Machine Learning (ICML)},
publisher={PMLR},
year={2021}
}

Please also pay attention to the citations of the included networks/algorithms.

PyTorch implementation of "Optimization Planning for 3D ConvNets"

Related tags

Overview

Optimization-Planning-for-3D-ConvNets

Code for the ICML 2021 paper: Optimization Planning for 3D ConvNets.

Authors: Zhaofan Qiu, Ting Yao, Chong-Wah Ngo, Tao Mei

1. Requirement

2. Project structure

3. Run the code

4. TO DO

5. Contact

6. Citation

Owner

Zhaofan Qiu

OCR Streamlit App is used to extract text from images using python's easyocr, pytorch and streamlit packages

ruptures: change point detection in Python

Attention-guided gan for synthesizing IR images

Unofficial implementation of Alias-Free Generative Adversarial Networks. (https://arxiv.org/abs/2106.12423) in PyTorch

PyTorch implementation of Off-policy Learning in Two-stage Recommender Systems

Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)

TagLab: an image segmentation tool oriented to marine data analysis

Serverless proxy for Spark cluster

Iris prediction model is used to classify iris species created julia's DecisionTree, DataFrames, JLD2, PlotlyJS and Statistics packages.

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

Space Invaders For Python

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

CVPR2022 paper "Dense Learning based Semi-Supervised Object Detection"

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

Task-based end-to-end model learning in stochastic optimization

Official Implementation of LARGE: Latent-Based Regression through GAN Semantics

Code for our paper Aspect Sentiment Quad Prediction as Paraphrase Generation in EMNLP 2021.

Implementation of ProteinBERT in Pytorch

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis

object detection; robust detection; ACM MM21 grand challenge; Security AI Challenger Phase VII