Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Semantic Segmentation of Fashion Images Using Feature Pyramid Networks
RISE Research Institutes of Sweden, Digital Systems, Data Science.ORCID iD: 0000-0002-5032-4367
RISE Research Institutes of Sweden, Digital Systems, Data Science.ORCID iD: 0000-0002-9567-2218
2019 (English)In: 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), 2019, p. 3133-3136Conference paper, Published paper (Refereed)
Abstract [en]

In this work, we approach the problem of semantically segmenting fashion images into different categories of clothing. This problem poses particular challenges because of the importance of both textural information and cues from shapes and context. To this end, we propose a fully convolutional neural network based on feature pyramid networks (FPN), together with a backbone consisting of the ResNeXt architecture. Our experimental evaluation shows that the proposed model achieves state-of-the-art results on two standard fashion benchmark datasets, and a qualitative study verifies its effectiveness when applied to typical fashion images. The approach has a modest memory footprint and can be used without a conditional random field (CRF) without much degradation of quality which makes our model preferable from a computational perspective. When comparing all methods without a CRF, our approach outperforms all state-of-the-art models on both datasets by a clear margin in all evaluated metrics. In fact, our approach achieves a higher accuracy without the CRF than the state-of-the-art models using CRFs.

Place, publisher, year, edition, pages
2019. p. 3133-3136
Keywords [en]
feature extraction, image classification, image segmentation, image texture, learning (artificial intelligence), neural nets, feature pyramid networks, ResNeXt architecture, standard fashion benchmark datasets, fashion images, CRF, semantic segmentation, textural information, fully convolutional neural network, conditional random field, Clothing, Training, Semantics, Shape, Computational modeling, convolutional neural networks
National Category
Natural Sciences
Identifiers
URN: urn:nbn:se:ri:diva-51881DOI: 10.1109/ICCVW.2019.00382OAI: oai:DiVA.org:ri-51881DiVA, id: diva2:1519211
Conference
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)
Available from: 2021-01-18 Created: 2021-01-18 Last updated: 2023-06-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full text

Authority records

Martinsson, JohnMogren, Olof

Search in DiVA

By author/editor
Martinsson, JohnMogren, Olof
By organisation
Data Science
Natural Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 35 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf