Change search
Link to record
Permanent link

Direct link
Publications (3 of 3) Show all publications
Zec, E. L., Ekblom, E., Willbo, M., Mogren, O. & Girdzijauskas, S. (2022). Decentralized adaptive clustering of deep nets is beneficial for client collaboration. In: : . Paper presented at International Workshop on Trustworthy Federated Learning 2022.
Open this publication in new window or tab >>Decentralized adaptive clustering of deep nets is beneficial for client collaboration
Show others...
2022 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We study the problem of training personalized deep learning models in a decentralized peer-to-peer setting, focusing on the setting where data distributions differ between the clients and where different clients have different local learning tasks. We study both covariate and label shift, and our contribution is an algorithm which for each client finds beneficial collaborations based on a similarity estimate for the local task. Our method does not rely on hyperparameters which are hard to estimate, such as the number of client clusters, but rather continuously adapts to the network topology using soft cluster assignment based on a novel adaptive gossip algorithm. We test the proposed method in various settings where data is not independent and identically distributed among the clients. The experimental evaluation shows that the proposed method performs better than previous state-of-the-art algorithms for this problem setting, and handles situations well where previous methods fail

National Category
Computer Sciences
Identifiers
urn:nbn:se:ri:diva-62529 (URN)
Conference
International Workshop on Trustworthy Federated Learning 2022
Available from: 2023-01-13 Created: 2023-01-13 Last updated: 2024-07-28Bibliographically approved
Martinsson, J., Willbo, M., Pirinen, A., Mogren, O. & Sandsten, M. (2022). Few-shot bioacoustic event detection using a prototypical network ensemble with adaptive embedding functions. In: : . Paper presented at Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022.
Open this publication in new window or tab >>Few-shot bioacoustic event detection using a prototypical network ensemble with adaptive embedding functions
Show others...
2022 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In this report we present our method for the DCASE 2022 challenge on few-shot bioacoustic event detection. We use an ensemble of prototypical neural networks with adaptive embedding functions and show that both ensemble and adaptive embedding functions can be used to improve results from an average F-score of 41.3% to an average F-score of 60.0% on the validation dataset.

Keywords
Machine listening, bioacoustics, few-shot learning, ensemble
National Category
Language Technology (Computational Linguistics)
Identifiers
urn:nbn:se:ri:diva-62530 (URN)
Conference
Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022
Available from: 2023-01-13 Created: 2023-01-13 Last updated: 2024-07-28Bibliographically approved
Martinsson, J., Willbo, M., Pirinen, A., Mogren, O. & Sandsten, M. (2022). FEW-SHOT BIOACOUSTIC EVENT DETECTION USING AN EVENT-LENGTH ADAPTED ENSEMBLE OF PROTOTYPICAL NETWORKS. In: : . Paper presented at Detection and Classification of Acoustic Scenes and Events 2022. 3–4 November 2022, Nancy, France.
Open this publication in new window or tab >>FEW-SHOT BIOACOUSTIC EVENT DETECTION USING AN EVENT-LENGTH ADAPTED ENSEMBLE OF PROTOTYPICAL NETWORKS
Show others...
2022 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In this paper we study two major challenges in few-shot bioacoustic event detection: variable event lengths and false-positives. We use prototypical networks where the embedding function is trained using a multi-label sound event detection model instead of using episodic training as the proxy task on the provided training dataset. This is motivated by polyphonic sound events being present in the base training data. We propose a method to choose the embedding function based on the average event length of the few-shot examples and show that this makes the method more robust towards variable event lengths. Further, we show that an ensemble of prototypical neural networks trained on different training and validation splits of time-frequency images with different loudness normalizations reduces false-positives. In addition, we present an analysis on the effect that the studied loudness normalization techniques have on the performance of the prototypical network ensemble. Overall, per-channel energy normalization (PCEN) outperforms the standard log transform for this task. The method uses no data augmentation and no external data. The proposed approach achieves a F-score of 48.0% when evaluated on the hidden test set of the Detection and Classification of Acoustic Scenes and Events (DCASE) task 5

Keywords
Machine listening, bioacoustics, few-shot learning, ensemble
National Category
Computer Sciences
Identifiers
urn:nbn:se:ri:diva-62540 (URN)
Conference
Detection and Classification of Acoustic Scenes and Events 2022. 3–4 November 2022, Nancy, France
Available from: 2023-01-16 Created: 2023-01-16 Last updated: 2024-07-28Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0009-0004-1803-4193

Search in DiVA

Show all publications