Pathdreamer: A World Model for Indoor Navigation

Last update: Jan 04, 2023

Related tags

Deep Learning pathdreamer

Overview

Pathdreamer: A World Model for Indoor Navigation

This repository hosts the open source code for Pathdreamer, to be presented at ICCV 2021.

Paper | Project Webpage | Colab Demo

Setup instructions

Environment

Set up virtualenv, and install required libraries:

virtualenv venv
source venv/bin/activate
pip install -r requirements.txt

Add the Pathdreamer library to PYTHONPATH:

export PYTHONPATH=$PYTHONPATH:/home/path/to/pathdreamer_root/

Downloading Pretrained Checkpoints

We provide a pretrained checkpoint which can be acquired by running:

wget https://storage.googleapis.com/gresearch/pathdreamer/ckpt.tar -P data/
tar -xf data/ckpt.tar --directory data/

The results will be extracted to the data/ckpt directory. Two checkpoints are provided, one for the Stage 1 model (Structure Generator), and another for the Stage 2 model (Image Generator).

Colab Demo

Pathdreamer_Example_Colab.ipynb [click to launch in Google Colab] shows how to setup and run the pretrained Pathdreamer model for inference. It includes examples on synthesizing image sequences and continuous video sequences for arbitrary navigation trajectories.

Citation

If you find this work useful, please consider citing:

@inproceedings{koh2021pathdreamer,
  title={Pathdreamer: A World Model for Indoor Navigation},
  author={Koh, Jing Yu and Lee, Honglak and Yang, Yinfei and Baldridge, Jason and Anderson, Peter},
  journal={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  year={2021}
}

License

Pathdreamer is released under the Apache 2.0 license. The Matterport3D dataset is governed by the Matterport3D Terms of Use.

Disclaimer

Not an official Google product.

Pathdreamer: A World Model for Indoor Navigation

Related tags

Overview

Pathdreamer: A World Model for Indoor Navigation

Setup instructions

Environment

Downloading Pretrained Checkpoints

Colab Demo

Citation

License

Disclaimer

Owner

Google Research

MPI-IS Mesh Processing Library

Computer Vision and Pattern Recognition, NUS CS4243, 2022

FcaNet: Frequency Channel Attention Networks

Dieser Scanner findet Websites, die nicht direkt in Suchmaschinen auftauchen, aber trotzdem erreichbar sind.

Reference models and tools for Cloud TPUs.

MAUS: A Dataset for Mental Workload Assessment Using Wearable Sensor - Baseline system

This program will stylize your photos with fast neural style transfer.

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

《Rethinking Sptil Dimensions of Vision Trnsformers》(2021)

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Bu repo SAHI uygulamasını mantığını öğreniyoruz.

(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain

Optimized code based on M2 for faster image captioning training

Flybirds - BDD-driven natural language automated testing framework, present by Trip Flight

Project Tugas Besar pertama Pengenalan Komputasi Institut Teknologi Bandung

ML From Scratch

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

Group-Free 3D Object Detection via Transformers

The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning