Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards model-agnostic cooperative perception
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics.
Umeå University, Faculty of Science and Technology, Department of Applied Physics and Electronics. RISE Research Institutes of Sweden.ORCID iD: 0000-0002-0562-2082
2025 (English)In: 2025 International Joint Conference on Neural Networks (IJCNN), IEEE, 2025, p. 1-9Conference paper, Published paper (Refereed)
Abstract [en]

Existing cooperative perception systems often require joint training among participating agents. However, this assumption clashes with practical deployments where agents belong to diverse entities, each employing unique models and performing their own downstream tasks. Sharing model details for joint training is often hard to achieve since the concerns of intellectual property (IP) leakage and the centralized training may conflict with the downstream task of each agent. To address these challenges, we introduce IMCP, a robust Intermediate Model-Agnostic Cooperative Perception framework. IMCP enables universal agent fusion without the need for joint training or model sharing. Each agent undergoes independent initial training, followed by a cooperative fine-tuning stage where the feature encoder of each agent remains frozen. To effectively fuse features from diverse domains, we incorporate parameter-efficient hierarchical feature adaptation layers that map features into a common representation space. Furthermore, deformable attention is employed to selectively aggregate multiple Bird’s-Eye View (BEV) features of varying sizes. Extensive experiments on two real-world cooperative perception datasets demonstrate that IMCP achieves comparable performance to existing joint training methods. Code is available at https://github.com/JesseWong333/IMCP.

Place, publisher, year, edition, pages
IEEE, 2025. p. 1-9
Series
Proceedings of International Joint Conference on Neural Networks, ISSN 2161-4393, E-ISSN 2161-4407
Keywords [en]
Cooperative Perception, Autonomous driving
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:umu:diva-246863DOI: 10.1109/ijcnn64981.2025.11229412Scopus ID: 2-s2.0-105029265314ISBN: 979-8-3315-1042-8 (electronic)ISBN: 979-8-3315-1043-5 (print)OAI: oai:DiVA.org:umu-246863DiVA, id: diva2:2016870
Conference
International Joint Conference on Neural Networks (IJCNN 2025), Rome, Italy, June 30 - July 5, 2025
Available from: 2025-11-26 Created: 2025-11-26 Last updated: 2026-04-13Bibliographically approved
In thesis
1. Cooperative perception for next-generation autonomous vehicles
Open this publication in new window or tab >>Cooperative perception for next-generation autonomous vehicles
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Alternative title[sv]
Samverkande perception för nästa generations autonoma fordon
Abstract [en]

Cooperative perception has emerged as a key paradigm for enhancing environmental understanding in multi-agent systems by fusing sensory information from multiple agents to achieve more comprehensive and accurate perception than single-agent approaches.Despite its demonstrated benefits, existing cooperative perception methods face critical limitations in practical deployments, primarily due to model heterogeneity, latency, and limited communication bandwidth.

This Ph.D. thesis addresses the gap between the theoretical promise of cooperative perception and its practical deployment by systematically investigating how to design cooperative perception systems that are robust, efficient, and scalable under realistic constraints. The main objective of this research is to develop unified frameworks that enable effective multi-agent perception.

To this end, the thesis proposes a series of novel methods targeting these challenges.First, as a foundational study, InputMix is proposed to balance the contributions of heterogeneous sensors in joint training scenarios. Second, an intermediate model-agnostic cooperative perception framework is introduced to enable modular training and seamless collaboration among agents with heterogeneous models. Third, the Latency-Robust Cooperative Perception (LRCP) framework is developed to mitigate the adverse effects of temporal misalignment among agents. Fourth, a lightweight, codebook-free feature compression framework is designed to reduce communication overhead while preserving perceptual performance. Finally, these components are integrated into a unified framework.

Extensive experiments on public benchmark datasets demonstrate that the proposed methods achieve perception performance comparable to the ideal scenario under latency constraints, while enabling effective collaboration among heterogeneous agents and substantially reducing communication bandwidth.

The main contributions of this thesis lie in establishing practical cooperative perception frameworks that collectively address multiple fundamental challenges in multi-agent perception. The findings of this research have broader implications for large-scale autonomous systems, including connected autonomous vehicles and distributed robotic platforms, where reliable cooperative perception under communication and system heterogeneity constraints is essential.

Place, publisher, year, edition, pages
Umeå: Umeå University, 2026. p. 65
Keywords
Cooperative Perception, Autonomous Driving
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:umu:diva-251909 (URN)978-91-6850-019-5 (ISBN)978-91-6850-020-1 (ISBN)
Public defence
2026-05-07, NAT.D.440, 09:00 (English)
Opponent
Supervisors
Available from: 2026-04-16 Created: 2026-04-13 Last updated: 2026-04-14Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Wang, JunjieNordström, Tomas

Search in DiVA

By author/editor
Wang, JunjieNordström, Tomas
By organisation
Department of Applied Physics and Electronics
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 87 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf