TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

Last update: Dec 26, 2022

Related tags

Deep Learning TEDSummary

Overview

TEDSummary

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id. This script crawls the TEDTalk website to get the above information. However, this script does not supply audio data. You can use the utterance id to align TED-LIUM3 (https://www.openslr.org/51/) or extract audio from the MP4 file.

References

[1] Takatomo Kano, Atsunori Ogawa, Marc Delcroix, and Shinji Watanabe "Attention-based Multi-hypothesis Fusion for Speech Summarization," Proc. ASRU, pp. –, 2021

Citation
@inproceedings{attention-fusion,
author = {Takatomo Kano and Atsunori Ogawa and Marc Delcroix and Shinji Watanabe},
title = {Attention-based Multi-hypothesis Fusion for Speech Summarization},
booktitle = {{ASRU 2021 - 2021 IEEE Automatic Speech Recoginition and Understanding Workshop (ASRU)}},
pages={-},
year = {2021}
}

Install tools

Python 3. requests unidecode json tqdm unicodedata

How to run

cd TEDSummary/ python TEDListCrawler.py

Outputs

telklist.json: URLs list for tedtalks.
ted_summary.json: Summarization dataset. That includes summary IDs, TEDTalk URL, mp4 URL, document, abstract, title, speaker name, and uttrance id for Tedlium alignment.

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

Related tags

Overview

TEDSummary

References

Install tools

How to run

Outputs

Owner

Traffic4D: Single View Reconstruction of Repetitious Activity Using Longitudinal Self-Supervision

[NeurIPS 2021 Spotlight] Aligning Pretraining for Detection via Object-Level Contrastive Learning

Deep generative models of 3D grids for structure-based drug discovery

CONditionals for Ordinal Regression and classification in PyTorch

Crab is a ﬂexible, fast recommender engine for Python that integrates classic information ﬁltering recommendation algorithms in the world of scientiﬁc Python packages (numpy, scipy, matplotlib).

Covid-19 Test AI (Deep Learning - NNs) Software. Accuracy is the %96.5, loss is the 0.09 :)

Pytorch Implementation of Interaction Networks for Learning about Objects, Relations and Physics

Tandem Mass Spectrum Prediction with Graph Transformers

The devkit of the nuPlan dataset.

TAP: Text-Aware Pre-training for Text-VQA and Text-Caption, CVPR 2021 (Oral)

TransVTSpotter: End-to-end Video Text Spotter with Transformer

A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation

Paper: De-rendering Stylized Texts

Trajectory Prediction with Graph-based Dual-scale Context Fusion

This is a yolo3 implemented via tensorflow 2.7

El-Gamal on Elliptic Curve (Python)

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)

PyTorch implementation for our AAAI 2022 Paper "Graph-wise Common Latent Factor Extraction for Unsupervised Graph Representation Learning"

WRENCH: Weak supeRvision bENCHmark