Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning
RISE Research Institutes of Sweden, Digital Systems, Data Science.ORCID iD: 0000-0002-5032-4367
RISE Research Institutes of Sweden, Digital Systems, Data Science.ORCID iD: 0000-0002-9567-2218
Lund University, Sweden.
Tampere University, Finland.
2024 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We propose an adaptive change point detection method (A-CPD) for machine guided weak label annotation of audio recording segments. The goal is to maximize the amount of information gained about the temporal activations of the target sounds. For each unlabeled audio recording, we use a prediction model to derive a probability curve used to guide annotation. The prediction model is initially pre-trained on available annotated sound event data with classes that are disjoint from the classes in the unlabeled dataset. The prediction model then gradually adapts to the annotations provided by the annotator in an active learning loop. We derive query segments to guide the weak label annotator towards strong labels, using change point detection on these probabilities. We show that it is possible to derive strong labels of high quality with a limited annotation budget, and show favorable results for A-CPD when compared to two baseline query segment strategies. 

Place, publisher, year, edition, pages
European Signal Processing Conference, EUSIPCO , 2024. p. 902-906
Keywords [en]
Adversarial machine learning; Audio recordings; Budget control; Change detection; Contrastive Learning; Deep learning; Prediction models; Sound recording; Active Learning; Annotation; Change point detection; Deep learning; Detection methods; Prediction modelling; Query segments; Sound event detection; Sound events; Weak labels; Active learning
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-76157Scopus ID: 2-s2.0-85208422384ISBN: 9789464593617 (electronic)OAI: oai:DiVA.org:ri-76157DiVA, id: diva2:1914551
Conference
32nd European Signal Processing Conference, EUSIPCO 2024. Lyon. 26 August 2024 through 30 August 2024
Note

This work was supported by The Swedish Foundation for Strategic Research (SSF; FID20-0028) and Sweden\u2019s Innovation Agency (2023-01486).

Available from: 2024-11-19 Created: 2024-11-19 Last updated: 2024-11-19Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Martinsson, JohnMogren, Olof

Search in DiVA

By author/editor
Martinsson, JohnMogren, Olof
By organisation
Data Science
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 7 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf