Endre søk
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Labelling of Annotated Condition Monitoring Data Through Technical Language Processing
Luleå University of Technology, Sweden.
SKF, Netherlands.
SKF, Netherlands.
RISE Research Institutes of Sweden, Digitala system, Datavetenskap.ORCID-id: 0000-0002-7873-3971
Vise andre og tillknytning
2023 (engelsk)Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

We propose a novel approach, technical language labelling, to facilitate supervised intelligent fault diagnosis on unlabelled but annotated industry datasets using technical language processing. Condition monitoring (CM) is vital for high safety and resource efficiency in the green transition and digital transformation of the process industry. Computerised maintenance systems are required to facilitate CM scalability, and learning-based Intelligent Fault Diagnosis (IFD) methods are required to automate maintenance decisions and improve support for human analysts. A major challenge is the lack of labelled datasets from industry and the difficulty of transferring features from labelled lab datasets to unlabelled industry datasets. In this study, we investigate how the fault description annotations and maintenance work orders present in many CM datasets can be understood and used for IFD through Technical Language Processing, based on insights from recent advances in Natural Language Supervision joint pre-training of images and captions. We identify two distinct pipelines, one based on pre-training on large datasets, and one based on a human-centric approach and unsupervised clustering methods to transform annotations into labels, aided by insights from dimensionality reduction and visualisation techniques. Finally, we showcase one example of the small-data fault classification implementation on a CM industry dataset with a Sentence BERT model and conventional signal processing methods. Sets of features are used to overcome data imbalance and label misalignment, and we show that our model can separate sets of cable and sensor fault recordings from sets of bearing-related fault recordings with an F1-score of 92.6%. To our knowledge, this is the first system to create labels for CM data through pre-trained language models without requiring pre-defined taxonomies. 

sted, utgiver, år, opplag, sider
Prognostics and Health Management Society , 2023. Vol. 15, nr 1
Emneord [en]
Accident prevention; Classification (of information); Failure analysis; Fault detection; Large dataset; Maintenance; Natural language processing systems; Signal processing; Condition-monitoring data; Fault recording; Green transitions; High safety; Intelligent fault diagnosis; Labelings; Language processing; Pre-training; Resource efficiencies; Technical languages; Condition monitoring
HSV kategori
Identifikatorer
URN: urn:nbn:se:ri:diva-69278Scopus ID: 2-s2.0-85178380051OAI: oai:DiVA.org:ri-69278DiVA, id: diva2:1826264
Konferanse
15th Annual Conference of the Prognostics and Health Management Society, PHM 2023. Salt Lake City, USA. 28 October 2023 through 2 November 2023
Merknad

This work is supported by the Strategic innovation program Process industrial IT and Automation(PiIA), a joint investment of Vinnova, Formas andthe Swedish Energy Agency, reference number 2019-02533. T

Tilgjengelig fra: 2024-01-11 Laget: 2024-01-11 Sist oppdatert: 2025-09-23bibliografisk kontrollert

Open Access i DiVA

fulltext(11246 kB)148 nedlastinger
Filinformasjon
Fil FULLTEXT01.pdfFilstørrelse 11246 kBChecksum SHA-512
84b7e14fae11bc63ef1febe9cfc6ecb56d56a6059b4752dc0bf0790cb1e9d29c7b940c033e6a6cc419da41e4e2b8de315cc194e98399273830be6b3e4934ff69
Type fulltextMimetype application/pdf

Scopus

Person

Nivre, Joakim

Søk i DiVA

Av forfatter/redaktør
Nivre, Joakim
Av organisasjonen

Søk utenfor DiVA

GoogleGoogle Scholar
Totalt: 148 nedlastinger
Antall nedlastinger er summen av alle nedlastinger av alle fulltekster. Det kan for eksempel være tidligere versjoner som er ikke lenger tilgjengelige

urn-nbn

Altmetric

urn-nbn
Totalt: 504 treff
RefereraExporteraLink to record
Permanent link

Direct link
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annet format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annet språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
v. 2.47.0