Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Labelling of Annotated Condition Monitoring Data Through Technical Language Processing
Luleå University of Technology, Sweden.
SKF, Netherlands.
SKF, Netherlands.
RISE Research Institutes of Sweden, Digitala system, Datavetenskap.ORCID-id: 0000-0002-7873-3971
Visa övriga samt affilieringar
2023 (Engelska)Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

We propose a novel approach, technical language labelling, to facilitate supervised intelligent fault diagnosis on unlabelled but annotated industry datasets using technical language processing. Condition monitoring (CM) is vital for high safety and resource efficiency in the green transition and digital transformation of the process industry. Computerised maintenance systems are required to facilitate CM scalability, and learning-based Intelligent Fault Diagnosis (IFD) methods are required to automate maintenance decisions and improve support for human analysts. A major challenge is the lack of labelled datasets from industry and the difficulty of transferring features from labelled lab datasets to unlabelled industry datasets. In this study, we investigate how the fault description annotations and maintenance work orders present in many CM datasets can be understood and used for IFD through Technical Language Processing, based on insights from recent advances in Natural Language Supervision joint pre-training of images and captions. We identify two distinct pipelines, one based on pre-training on large datasets, and one based on a human-centric approach and unsupervised clustering methods to transform annotations into labels, aided by insights from dimensionality reduction and visualisation techniques. Finally, we showcase one example of the small-data fault classification implementation on a CM industry dataset with a Sentence BERT model and conventional signal processing methods. Sets of features are used to overcome data imbalance and label misalignment, and we show that our model can separate sets of cable and sensor fault recordings from sets of bearing-related fault recordings with an F1-score of 92.6%. To our knowledge, this is the first system to create labels for CM data through pre-trained language models without requiring pre-defined taxonomies. 

Ort, förlag, år, upplaga, sidor
Prognostics and Health Management Society , 2023. Vol. 15, nr 1
Nyckelord [en]
Accident prevention; Classification (of information); Failure analysis; Fault detection; Large dataset; Maintenance; Natural language processing systems; Signal processing; Condition-monitoring data; Fault recording; Green transitions; High safety; Intelligent fault diagnosis; Labelings; Language processing; Pre-training; Resource efficiencies; Technical languages; Condition monitoring
Nationell ämneskategori
Naturvetenskap
Identifikatorer
URN: urn:nbn:se:ri:diva-69278Scopus ID: 2-s2.0-85178380051OAI: oai:DiVA.org:ri-69278DiVA, id: diva2:1826264
Konferens
15th Annual Conference of the Prognostics and Health Management Society, PHM 2023. Salt Lake City, USA. 28 October 2023 through 2 November 2023
Anmärkning

This work is supported by the Strategic innovation program Process industrial IT and Automation(PiIA), a joint investment of Vinnova, Formas andthe Swedish Energy Agency, reference number 2019-02533. T

Tillgänglig från: 2024-01-11 Skapad: 2024-01-11 Senast uppdaterad: 2025-09-23Bibliografiskt granskad

Open Access i DiVA

fulltext(11246 kB)148 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 11246 kBChecksumma SHA-512
84b7e14fae11bc63ef1febe9cfc6ecb56d56a6059b4752dc0bf0790cb1e9d29c7b940c033e6a6cc419da41e4e2b8de315cc194e98399273830be6b3e4934ff69
Typ fulltextMimetyp application/pdf

Scopus

Person

Nivre, Joakim

Sök vidare i DiVA

Av författaren/redaktören
Nivre, Joakim
Av organisationen
Datavetenskap
Naturvetenskap

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 148 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 504 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf