Skip to content

Latest commit

 

History

History
12 lines (6 loc) · 559 Bytes

Localization, detection and tracking of multiple moving sound sources with a convolutional recurrent neural network阅读笔记.md

File metadata and controls

12 lines (6 loc) · 559 Bytes

Introduction

Sound event localization, detection, and tracking (SELDT) is the combined task of identifying the temporal onset and offset of potentially temporally-overlapping sound events, recognizing their classes, and tracking their respective spatial trajectory when they are active.

Method

first detect and then localize

image-20211114154706334

The SELDnet maps the spectrogram to two outputs – sound event detection, and tracking; together they produce the SELDT output.