Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Last update: Dec 03, 2022

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

This is an implementation for our paper Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search. The code is modified from Github repositoty "pytorch implementation for ECCV2018 paper Deep Cross-Modal Projection Learning for Image-Text Matching".

Requirement

Python 3.7
Pytorch 1.0.0 & torchvision 0.2.1
numpy
matplotlib (not necessary unless the need for the result figure)
scipy 1.2.1
pytorch_transformers

Usage

Data Preparation

Please download CUHK-PEDES dataset .
Put reid_raw.json under project_directory/data/
run data.sh
Copy files test_reid.json, train_reid.json and val_reid.json under CUHK-PEDES/data/ to project_directory/data/processed_data/
Download pretrained Resnet50 model, bert-base-uncased model and vocabulary to project_directory/pretrained/

Training & Testing

You should firstly change the parameter BASE_ROOT to your current directory and IMAGE_DIR to the directory of CUHK-PEDES dataset. Run command sh scripts/train.sh to train the model. Run command sh scripts/test.sh to evaluate the model.

Code for "Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search"

Related tags

Overview

Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search

Requirement

Usage

Data Preparation

Training & Testing

Model Framework

Model Performance

Owner

Tencent YouTu Research

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Post-training Quantization for Neural Networks with Provable Guarantees

SPRING is a seq2seq model for Text-to-AMR and AMR-to-Text (AAAI2021).

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

Rede Neural Convolucional feita durante o processo seletivo do Laboratório de Inteligência Artificial da FACOM (UFMS)

SegNet model implemented using keras framework

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Implementation of OmniNet, Omnidirectional Representations from Transformers, in Pytorch

Code-free deep segmentation for computational pathology

DABO: Data Augmentation with Bilevel Optimization

Streamlit component for TensorBoard, TensorFlow's visualization toolkit

Official Pytorch implementation of 'GOCor: Bringing Globally Optimized Correspondence Volumes into Your Neural Network' (NeurIPS 2020)

General Virtual Sketching Framework for Vector Line Art (SIGGRAPH 2021)

Code of U2Fusion: a unified unsupervised image fusion network for multiple image fusion tasks, including multi-modal, multi-exposure and multi-focus image fusion.

Annealed Flow Transport Monte Carlo

Classification models 1D Zoo - Keras and TF.Keras

Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.

Self-Supervised Learning

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

PFFDTD is an open-source FDTD simulator for 3D room acoustics