Skip to content

Latest commit

 

History

History
217 lines (200 loc) · 41.8 KB

README.md

File metadata and controls

217 lines (200 loc) · 41.8 KB

Awesome Audio-Visual: Awesome

A curated list of papers and datsets for various audio-visual tasks, inspired by awesome-computer-vision.

Contents

Audio-Visual Localization

Audio-Visual Separation

Audio-Visual Representation/Classification

Audio-Visual Action Recognition

Audio-Visual Spatial/Depth

Audio-Visual Navigation/RL

Audio-Visual Faces/Speech

Cross-modal Generation (Audio-Video / Video-Audio)

Multi-modal Architectures

Uncategorized Papers

Datasets

General Audio-Visual Tasks

Face-Voice Dataset

Licenses

License

CC0

To the extent possible under law, Kranti Kumar Parida has waived all copyright and related or neighboring rights to this work.

Contributing

Please feel free to send me pull requests or email ([email protected]) to add links, correct wrong ones or if you find any broken links.