A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Overview

Awesome-Human-Pose-Prediction

Version Awesome LastUpdated HitCount

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

Maintainers: Karttikeya Mangalam

Contributing: Please feel free to pull requests to add new resources or suggest addditions or changes to the list. While proposing a new addition, please keep in mind the following principles:

  • The work has been accepted in a reputable peer reviewed publication venue.
  • An opensource link to the paper pdf is attached (as far as possible).
  • Code for the paper is linked (if made opensource by the authors).

Email: [email protected].{berkeley,stanford).edu

Datasets

  • Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [Paper]
  • Stanford Drone Dataset (SDD): Learning Social Etiquette: Human Trajectory Understanding in Crowded Scenes [Paper] [Leaderboard]

Papers

As End in Itself

  • From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting [Paper]

  • It Is Not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction [Paper]

  • Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data [Paper]

  • Interaction-Based Trajectory Prediction Over a Hybrid Traffic Graph [paper]

  • Map-Adaptive Goal-Based Trajectory Prediction [paper]

  • Interaction-Aware Trajectory Prediction based on a 3D Spatio-Temporal Tensor Representation using Convolutional–Recurrent Neural Networks [paper]

  • DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning [Paper]

  • Discrete Residual Flow for Probabilistic Pedestrian Behavior Prediction [Paper]

  • Social-VRNN: One-Shot Multi-modal Trajectory Prediction for Interacting Pedestrians [Paper]

  • Leveraging Neural Network Gradients within Trajectory Optimization for Proactive Human-Robot Interactions [Paper]

  • Social NCE: Contrastive Learning of Socially-aware Motion Representations [Paper]

  • Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach [Paper]

  • Risk-Sensitive Sequential Action Control with Multi-Modal Human Trajectory Forecasting for Safe Crowd-Robot Interaction [Paper]

  • Deep Learning for Vision-based Prediction: A Survey [Paper]

  • Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction Using a Graph Vehicle-Pedestrian Attention Network [Paper]

  • Semantics for Robotic Mapping, Perception and Interaction: A Survey [Paper]

  • Benchmark for Evaluating Pedestrian Action Prediction[Paper]

  • Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking [Paper]

  • Pedestrian Behavior Prediction via Multitask Learning and Categorical Interaction Modeling [Paper]

  • Graph-SIM: A Graph-based Spatiotemporal Interaction Modelling for Pedestrian Action Prediction [Paper]

  • Haar Wavelet based Block Autoregressive Flows for Trajectories [Paper]

  • Imitative Planning using Conditional Normalizing Flow [Paper]

  • TNT: Target-driveN Trajectory Prediction [Paper]

  • SimAug: Learning Robust Representations from Simulation for Trajectory Prediction [Paper]

  • SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints [Paper]

  • Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks [Paper]

  • DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents [Paper]

  • Predicting Whole Body Motion Trajectories using Conditional Neural Movement Primitives [Paper] [W]

  • Anticipating Human Intention for Full-Body Motion Prediction [Paper] [W]

  • Human Motion Prediction With Graph Neural Networks [Paper] [W]

  • Action-Agnostic Human Pose Forecasting [Paper]

  • Human Torso Pose Forecasting in the Real World [Paper]

  • Imitation Learning for Human Pose Prediction [Paper]

  • Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision [Paper]

  • Predicting 3D Human Dynamics from Video [Paper]

  • Recurrent Network Models for Human Dynamics [Paper]

  • Structural-RNN: Deep Learning on Spatio-Temporal Graphs [Paper]

  • Learning Trajectory Dependencies for Human Motion Prediction [Paper]

  • Anticipating many futures: Online human motion prediction and generation for human-robot interaction [Paper]

  • Teaching Robots to Predict Human Motion [Paper]

  • Deep representation learning for human motion prediction and classification [Paper]

  • On human motion prediction using recurrent neural networks [Paper]

  • Few-Shot Human Motion Prediction via Meta-learning [Paper]

  • Efficient convolutional hierarchical autoencoder for human motion prediction [Paper]

  • Learning Human Motion Models for Long-term Predictions [Paper]

  • Long-Term Human Motion Prediction by Modeling Motion Context and Enhancing Motion Dynamic [Paper]

  • Context-aware Human Motion Prediction [Paper]

  • Adversarial Geometry-Aware Human Motion Prediction [Paper]

  • Convolutional Sequence to Sequence Model for Human Dynamics [Paper]

  • QuaterNet: A Quaternion-based Recurrent Model for Human Motion [Paper]

  • BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN [Paper]

  • Human Motion Modeling using DVGANs [Paper]

  • Human Motion Prediction using Semi-adaptable Neural Networks [Paper]

  • A Neural Temporal Model for Human Motion Prediction [Paper]

  • Modeling Human Motion with Quaternion-based Neural Networks [Paper]

  • Human Motion Prediction via Learning Local Structure Representations and Temporal Dependencies [Paper]

  • VRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction [Paper]

  • EAN: Error Attenuation Network for Long-term Human Motion Prediction [Paper]

  • Structured Prediction Helps 3D Human Motion Modelling [Paper]

  • Forecasting Human Dynamics from Static Images [Paper]

  • HP-GAN: Probabilistic 3D human motion prediction via GAN [Paper]

  • Learning Latent Representations of 3D Human Pose with Deep Neural Networks [Paper]

  • A Recurrent Variational Autoencoder for Human Motion Synthesis [Paper]

  • Spatio-temporal Manifold Learning for Human Motions via Long-horizon Modeling [Paper]

  • Combining Recurrent Neural Networks and Adversarial Training for Human Motion Synthesis and Control [Paper]

  • PISEP2: Pseudo Image Sequence Evolution based 3D Pose Prediction [Paper]

  • Human Motion Prediction via Spatio-Temporal Inpainting [Paper]

  • Spatiotemporal Co-attention Recurrent Neural Networks for Human-Skeleton Motion Prediction [Paper]

  • Human Pose Forecasting via Deep Markov Models [Paper]

  • Auto-Conditioned Recurrent Networks For Extended Complex Human Motion Synthesis [Paper]

  • Predicting Long-Term Skeletal Motions by a Spatio-Temporal Hierarchical Recurrent Network [Paper]

As a Subtask

  • The Pose Knows: Video Forecasting by Generating Pose Futures [Paper]
  • I-Planner: Intention-Aware Motion Planning Using Learning Based Human Motion Prediction [Paper]
  • Language2Pose: Natural Language Grounded Pose Forecasting [Paper]
  • Long-Term Video Generation of Multiple Futures Using Human Poses [Paper]
  • Predicting body movements for person identification under different walking conditions [Paper]
Owner
Karttikeya Manglam
PhD Student in Computer Vision @ BAIR, UC Berkeley.
Karttikeya Manglam
LAVT: Language-Aware Vision Transformer for Referring Image Segmentation

LAVT: Language-Aware Vision Transformer for Referring Image Segmentation Where we are ? 12.27 目前和原论文仍有1%左右得差距,但已经力压很多SOTA了 ckpt__448_epoch_25.pth mIoU

zichengsaber 60 Dec 11, 2022
Recommendation algorithms for large graphs

Fast recommendation algorithms for large graphs based on link analysis. License: Apache Software License Author: Emmanouil (Manios) Krasanakis Depende

Multimedia Knowledge and Social Analytics Lab 27 Jan 07, 2023
Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)

Learning to Execute (L2E) Official code base for completely reproducing all results reported in I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learnin

3 May 18, 2022
The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

Semi-Supervised Learning with Multi-Head Co-Training (PyTorch) Abstract Co-training, extended from self-training, is one of the frameworks for semi-su

cmc 6 Dec 04, 2022
A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

Probabilistic U-Net + **Update** + An improved Model (the Hierarchical Probabilistic U-Net) + LIDC crops is now available. See below. Re-implementatio

Simon Kohl 498 Dec 26, 2022
This repository is for DSA and CP scripts for reference.

dsa-script-collections This Repo is the collection of DSA and CP scripts for reference. Contents Python Bubble Sort Insertion Sort Merge Sort Quick So

Aditya Kumar Pandey 9 Nov 22, 2022
RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.

[3DV 2021] We propose a new cascaded architecture for novel view synthesis, called RGBD-Net, which consists of two core components: a hierarchical depth regression network and a depth-aware generator

Phong Nguyen Ha 4 May 26, 2022
This is the official repository of the paper Stocastic bandits with groups of similar arms (NeurIPS 2021). It contains the code that was used to compute the figures and experiments of the paper.

Experiments How to reproduce experimental results of Stochastic bandits with groups of similar arms submitted paper ? Section 5 of the paper To reprod

Fabien 0 Oct 25, 2021
A new video text spotting framework with Transformer

TransVTSpotter: End-to-end Video Text Spotter with Transformer Introduction A Multilingual, Open World Video Text Dataset and End-to-end Video Text Sp

weijiawu 67 Jan 03, 2023
The code of "Dependency Learning for Legal Judgment Prediction with a Unified Text-to-Text Transformer".

Code data_preprocess.py: preprocess data for Dependent-T5. parameters.py: define parameters of Dependent-T5. train_tools.py: traning and evaluation co

1 Apr 21, 2022
HEAM: High-Efficiency Approximate Multiplier Optimization for Deep Neural Networks

Approximate Multiplier by HEAM What's HEAM? HEAM is a general optimization method to generate high-efficiency approximate multipliers for specific app

4 Sep 11, 2022
3D position tracking for soccer players with multi-camera videos

This repo contains a full pipeline to support 3D position tracking of soccer players, with multi-view calibrated moving/fixed video sequences as inputs.

Yuchang Jiang 72 Dec 27, 2022
Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

UACANet: Uncertainty Aware Context Attention for Polyp Segmentation Official pytorch implementation of UACANet: Uncertainty Aware Context Attention fo

Taehun Kim 85 Dec 14, 2022
Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments (CoRL 2020)

Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments [Project website] [Paper] This project is a PyTorch

Cognitive Learning for Vision and Robotics (CLVR) lab @ USC 49 Nov 28, 2022
Loopy belief propagation for factor graphs on discrete variables, in JAX!

PGMax implements general factor graphs for discrete probabilistic graphical models (PGMs), and hardware-accelerated differentiable loopy belief propagation (LBP) in JAX.

Vicarious 62 Dec 23, 2022
Planning from Pixels in Environments with Combinatorially Hard Search Spaces -- NeurIPS 2021

PPGS: Planning from Pixels in Environments with Combinatorially Hard Search Spaces Environment Setup We recommend pipenv for creating and managing vir

Autonomous Learning Group 11 Jun 26, 2022
This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

Discontinuous Grammar as a Foreign Language This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing

Daniel Fernández-González 2 Apr 07, 2022
Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Ima

Xuanchi Ren 86 Dec 07, 2022
Code implementation of "Sparsity Probe: Analysis tool for Deep Learning Models"

Sparsity Probe: Analysis tool for Deep Learning Models This repository is a limited implementation of Sparsity Probe: Analysis tool for Deep Learning

3 Jun 09, 2021
MANO hand model porting for the GraspIt simulator

Learning Joint Reconstruction of Hands and Manipulated Objects - ManoGrasp Porting the MANO hand model to GraspIt! simulator Yana Hasson, Gül Varol, D

Lucas Wohlhart 10 Feb 08, 2022