Planned maintenance
A system upgrade is planned for 10/12-2024, at 12:00-13:00. During this time DiVA will be unavailable.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Achieving Robust Self-Management for Large-Scale Distributed Applications
RISE, Swedish ICT, SICS, Computer Systems Laboratory.ORCID iD: 0000-0002-9546-4937
RISE, Swedish ICT, SICS, Computer Systems Laboratory.ORCID iD: 0000-0002-9431-5139
2010 (English)Report (Other academic)
Abstract [en]

Autonomic managers are the main architectural building blocks for constructing self-management capabilities of computing systems and applications. One of the major challenges in developing self-managing applications is robustness of management elements which form autonomic managers. We believe that transparent handling of the effects of resource churn (joins/leaves/failures) on management should be an essential feature of a platform for self-managing large-scale dynamic distributed applications, because it facilitates the development of robust autonomic managers and hence improves robustness of self-managing applications. This feature can be achieved by providing a robust management element abstraction that hides churn from the programmer. In this paper, we present a generic approach to achieve robust services that is based on finite state machine replication with dynamic reconfiguration of replica sets. We contribute a decentralized algorithm that maintains the set of nodes hosting service replicas in the presence of churn. We use this approach to implement robust management elements as robust services that can operate despite of churn. Our proposed decentralized algorithm uses peer-to-peer replica placement schemes to automate replicated state machine migration in order to tolerate churn. Our algorithm exploits lookup and failure detection facilities of a structured overlay network for managing the set of active replicas. Using the proposed approach, we can achieve a long running and highly available service, without human intervention, in the presence of resource churn. In order to validate and evaluate our approach, we have implemented a prototype that includes the proposed algorithm.

Place, publisher, year, edition, pages
Kista, Sweden, 2010, 7.
Series
SICS Technical Report, ISSN 1100-3154 ; 2010:02
Keywords [en]
autonomic computing, distributed systems, self-management, replicated state machines, service migration, peer-to-peer
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-23666OAI: oai:DiVA.org:ri-23666DiVA, id: diva2:1042743
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2023-05-25Bibliographically approved

Open Access in DiVA

fulltext(173 kB)204 downloads
File information
File name FULLTEXT01.pdfFile size 173 kBChecksum SHA-512
ac9b491c6463776110ba19a5cd0fdeb9641b31d23142a1e0f4821d50e57fe5d656e3b42865f60b1422f97106d9450e8fe04ae9b128bd5bf040de39f0d88615c6
Type fulltextMimetype application/pdf

Authority records

Al-Shishtawy, AhmadPopov, Konstantin

Search in DiVA

By author/editor
Al-Shishtawy, AhmadPopov, Konstantin
By organisation
Computer Systems Laboratory
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 204 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 229 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf