HUMAN4D: A Human-Centric Multimodal Dataset for Motions & Immersive Media

HUMAN4D constitutes a large and multimodal 4D dataset that contains a variety of human activities simultaneously captured by a professional marker-based MoCap, a volumetric capture and an audio recording system.

The related paper can be found here in PDF.

You can download the dataset from Zenodo (in various parts):

For data that are not publicly available but are included in the HUMAN4D dataset, contact us @ tofis3d [at] central.ntua.gr.

Pictures taken during the preparation and capturing of the HUMAN4D dataset. The room was equipped with 24 Vicon MXT40S cameras rigidly placed on the walls, while a portable volumetric capturing system (https://github.com/VCL3D/VolumetricCapture) with 4 Intel RealSense D415 depth sensors was temporarily set up to capture the RGBD data cues.

HW-SYNCed multi-view RGBD samples (4 RGBD frames each) from "stretching_n_talking"(top) and "basket-ball_dribbling"(bottom) activities.

3D Scanning using a custom photogrammetry rig with 96 cameras, photos were taken of the actor (left) and reconstructed into a 3D textured mesh using Agisoft Metashape (right).

Reconstructed mesh-based volumetric data with (Left) color per vertex visualization in 3 voxel-grid resolutions, i.e. r= 5, r= 6 andr= 7 and (Right) textured 3D mesh sample in voxel-grid resolution for r= 6.

Merged reconstructed point-cloud from one single mRGBD frame from various views.

If you used the dataset or found this work useful, please cite:

@article{chatzitofis2020human4d,
  title={HUMAN4D: A Human-Centric Multimodal Dataset for Motions and Immersive Media},
  author={Chatzitofis, Anargyros and Saroglou, Leonidas and Boutis, Prodromos and Drakoulis, Petros and Zioulis, Nikolaos and Subramanyam, Shishir and Kevelham, Bart and Charbonnier, Caecilia and Cesar, Pablo and Zarpalas, Dimitrios and others},
  journal={IEEE Access},
  volume={8},
  pages={176241--176262},
  year={2020},
  publisher={IEEE}
}

Human4D Dataset tools for processing and visualization

Related tags

Overview

HUMAN4D: A Human-Centric Multimodal Dataset for Motions & Immersive Media

Owner

tofis

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

Implementation of paper "Graph Condensation for Graph Neural Networks"

Groceries ARL: Association Rules (Birliktelik Kuralı)

Bot developed in Python that automates races in pegaxy.

This repo in the implementation of EMNLP'21 paper "SPARQLing Database Queries from Intermediate Question Decompositions" by Irina Saparina, Anton Osokin

An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

Xview3 solution - XView3 challenge, 2nd place solution

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

Create images and texts with the First Order Generative Adversarial Networks

Code & Data for the Paper "Time Masking for Temporal Language Models", WSDM 2022

[ACM MM 2019 Oral] Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation

deep_image_prior_extension

Summary Explorer is a tool to visually explore the state-of-the-art in text summarization.

A simple python program that can be used to implement user authentication tokens into your program...

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

BRepNet: A topological message passing system for solid models

PyTorch implementation of MICCAI 2018 paper "Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector"

Code release of paper "Deep Multi-View Stereo gone wild"

PyTorch implementations of deep reinforcement learning algorithms and environments

Stock-history-display - something like a easy yearly review for your stock performance