Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Towards Distributed and Adaptive Detection and Localisation of Network Faults
RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.ORCID iD: 0000-0002-5893-7774
RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.ORCID iD: 0000-0001-8952-3542
2010 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We present a statistical probing-approach to distributed fault-detection in networked systems, based on autonomous configuration of algorithm parameters. Statistical modelling is used for detection and localisation of network faults. A detected fault is isolated to a node or link by collaborative fault-localisation. From local measurements obtained through probing between nodes, probe response delay and packet drop are modelled via parameter estimation for each link. Estimated model parameters are used for autonomous configuration of algorithm parameters, related to probe intervals and detection mechanisms. Expected fault-detection performance is formulated as a cost instead of specific parameter values, significantly reducing configuration efforts in a distributed system. The benefit offered by using our algorithm is fault-detection with increased certainty based on local measurements, compared to other methods not taking observed network conditions into account. We investigate the algorithm performance for varying user parameters and failure conditions. The simulation results indicate that more than 95 % of the generated faults can be detected with few false alarms. At least 80 % of the link faults and 65 % of the node faults are correctly localised. The performance can be improved by parameter adjustments and by using alternative paths for communication of algorithm control messages.

Place, publisher, year, edition, pages
2010, 15. p. 1-6
Keywords [en]
adaptive probing, distributed fault-detection, fault-localisation
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-23794DOI: 10.1109/AICT.2010.65OAI: oai:DiVA.org:ri-23794DiVA, id: diva2:1042871
Conference
2010 Sixth Advanced International Conference on Telecommunications
Projects
CNS4WARDprobmanAvailable from: 2016-10-31 Created: 2016-10-31 Last updated: 2020-12-01Bibliographically approved

Open Access in DiVA

fulltext(393 kB)153 downloads
File information
File name FULLTEXT01.pdfFile size 393 kBChecksum SHA-512
910403741e6d8f2ea7bda8436a308e7ac9c3dec11fdda340022346360f0549b50234962c1632895d4b03740c6fc9490a465133f35a42a8ea125f64a5f96108e7
Type fulltextMimetype application/pdf

Other links

Publisher's full texthttp

Authority records

Steinert, RebeccaGillblad, Daniel

Search in DiVA

By author/editor
Steinert, RebeccaGillblad, Daniel
By organisation
Decisions, Networks and Analytics lab
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 153 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 57 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf