CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Last update: Dec 22, 2022

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

This repo contains code for our paper "Counterfactual Samples Synthesizing for Robust Visual Question Answering" This repo contains code modified from here,many thanks!

Prerequisites

Make sure you are on a machine with a NVIDIA GPU and Python 2.7 with about 100 GB disk space.
h5py==2.10.0
pytorch==1.1.0
Click==7.0
numpy==1.16.5
tqdm==4.35.0

Data Setup

You can use

bash tools/download.sh

to download the data
and the rest of the data and trained model can be obtained from BaiduYun(passwd:3jot) or GoogleDrive unzip feature1.zip and feature2.zip and merge them into data/rcnn_feature/
use

bash tools/process.sh

to process the data

Training

Run

CUDA_VISIBLE_DEVICES=0 python main.py --dataset cpv2 --mode q_v_debias --debias learned_mixin --topq 1 --topv -1 --qvp 5 --output [] --seed 0

to train a model

Testing

Run

CUDA_VISIBLE_DEVICES=0 python eval.py --dataset cpv2 --debias learned_mixin --model_state []

to eval a model

Citation

If you find this code useful, please cite the following paper:

@inproceedings{chen2020counterfactual,
title={Counterfactual Samples Synthesizing for Robust Visual Question Answering},
author={Chen, Long and Yan, Xin and Xiao, Jun and Zhang, Hanwang and Pu, Shiliang and Zhuang, Yueting},
booktitle={CVPR},
year={2020}
}

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Related tags

Overview

CVPR2020 Counterfactual Samples Synthesizing for Robust VQA

Prerequisites

Data Setup

Training

Testing

Citation

Owner

Pytorch Implementation of LNSNet for Superpixel Segmentation

This is the code of using DQN to play Sekiro .

Learning Logic Rules for Document-Level Relation Extraction

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Provided is code that demonstrates the training and evaluation of the work presented in the paper: "On the Detection of Digital Face Manipulation" published in CVPR 2020.

A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

Implementation of ReSeg using PyTorch

Export CenterPoint PonintPillars ONNX Model For TensorRT

Official implementation of Few-Shot and Continual Learning with Attentive Independent Mechanisms

Ganilla - Official Pytorch implementation of GANILLA

Diverse Object-Scene Compositions For Zero-Shot Action Recognition

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

CMSC320 - Introduction to Data Science - Fall 2021

Neural-PIL: Neural Pre-Integrated Lighting for Reflectance Decomposition - NeurIPS2021

Vis2Mesh: Efficient Mesh Reconstruction from Unstructured Point Clouds of Large Scenes with Learned Virtual View Visibility ICCV2021

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

This is the code used in the paper "Entity Embeddings of Categorical Variables".

Motion Reconstruction Code and Data for Skills from Videos (SFV)