A Pythonic library for Nvidia Codec.

The project is still in active development; expect breaking changes.

Why another Python library for Nvidia Codec?

Comparison to Video-Processing-Framework

Methodologies

VPF is written fully in C++ and uses pybind to expose Python interfaces. PNC is written fully in Python and uses ctypes to access Nvidia C interfaces. Our codes tends to be more concise, less duplicative and easier to read and write.

Performance

Preliminary tests shows little to no difference in terms of performance, because the heavy lifting is done on the GPU anyway. Both library can saturate GPU decoder. PNC uses more CPU than VPF as expected from Python vs. C++, but still negligible (less than 10% of Ryzen 3100 single core for 8K*4K HEVC)

Resource Management

In VPF Surface given to user are not owned by the user. It will be overwritten by new frames which is counter-intuitive; Picture are not exposed to user at all - they are always mapped (post-processed and copied) to Surface so the picture can be ready for new frames. The latter is inefficient when only a subset of Pictures are needed (e.g. screenshots).
The above is because VPF allocates the bare minimum of resources needed for most decoding tasks. PNC allows the user to specify the amount of resources to be allocated for advanced applications. Users own the resources and decide when and whether to deal with them.
Managing resources is not painful: similar to pycuda, we shift the burden of managing host/device resources to the Python garbage collector. Resources (such as Picture and Surface) are automatically freed when the user drops the reference.

Things to come

TODO Cropping and scaling support in postprocessing
TODO Color space conversion from YUV (bt. 601/709, full-range/limit-range) to RGB using pycuda
Encoder

Acknowledgements

Many thanks to @rarzumanyan for all the helps and explanations!

A Pythonic library for Nvidia Codec.

Related tags

Overview

A Pythonic library for Nvidia Codec.

Why another Python library for Nvidia Codec?

Things to come

Acknowledgements

Owner

Zesen Qian

This repository contains the source code of Auto-Lambda and baselines from the paper, Auto-Lambda: Disentangling Dynamic Task Relationships.

On the adaptation of recurrent neural networks for system identification

Multi-Modal Machine Learning toolkit based on PaddlePaddle.

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

A unified framework for machine learning with time series

RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

An implementation of the paper "A Neural Algorithm of Artistic Style"

Official Implementation for Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation

ReferFormer - Official Implementation of ReferFormer

GraphLily: A Graph Linear Algebra Overlay on HBM-Equipped FPGAs

MassiveSumm: a very large-scale, very multilingual, news summarisation dataset

The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.

Implementation of CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification

A ssl analyzer which could analyzer target domain's certificate.

Post-Training Quantization for Vision transformers.

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Repository for Driving Style Recognition algorithms for Autonomous Vehicles

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.