From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning
2024 (English)Conference paper, Published paper (Refereed)
Abstract [en]
We propose an adaptive change point detection method (A-CPD) for machine guided weak label annotation of audio recording segments. The goal is to maximize the amount of information gained about the temporal activations of the target sounds. For each unlabeled audio recording, we use a prediction model to derive a probability curve used to guide annotation. The prediction model is initially pre-trained on available annotated sound event data with classes that are disjoint from the classes in the unlabeled dataset. The prediction model then gradually adapts to the annotations provided by the annotator in an active learning loop. We derive query segments to guide the weak label annotator towards strong labels, using change point detection on these probabilities. We show that it is possible to derive strong labels of high quality with a limited annotation budget, and show favorable results for A-CPD when compared to two baseline query segment strategies.
Place, publisher, year, edition, pages
European Signal Processing Conference, EUSIPCO , 2024. p. 902-906
Keywords [en]
Adversarial machine learning; Audio recordings; Budget control; Change detection; Contrastive Learning; Deep learning; Prediction models; Sound recording; Active Learning; Annotation; Change point detection; Deep learning; Detection methods; Prediction modelling; Query segments; Sound event detection; Sound events; Weak labels; Active learning
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-76157Scopus ID: 2-s2.0-85208422384ISBN: 9789464593617 (electronic)OAI: oai:DiVA.org:ri-76157DiVA, id: diva2:1914551
Conference
32nd European Signal Processing Conference, EUSIPCO 2024. Lyon. 26 August 2024 through 30 August 2024
Note
This work was supported by The Swedish Foundation for Strategic Research (SSF; FID20-0028) and Sweden\u2019s Innovation Agency (2023-01486).
2024-11-192024-11-192024-11-19Bibliographically approved