Change search
Link to record
Permanent link

Direct link
Girdzijauskas, SarunasORCID iD iconorcid.org/0000-0003-4516-7317
Alternative names
Publications (10 of 15) Show all publications
Komini, V. & Girdzijauskas, S. (2025). Integrating Logit Space Embeddings for Reliable Out-of-Distribution Detection. In: Lect. Notes Comput. Sci.: . Paper presented at Lecture Notes in Computer Science (pp. 255-269). Springer Science and Business Media Deutschland GmbH
Open this publication in new window or tab >>Integrating Logit Space Embeddings for Reliable Out-of-Distribution Detection
2025 (English)In: Lect. Notes Comput. Sci., Springer Science and Business Media Deutschland GmbH , 2025, p. 255-269Conference paper, Published paper (Refereed)
Abstract [en]

Deep learning (DL) models have significantly transformed machine learning (ML), particularly with their prowess in classification tasks. However, these models struggle to differentiate between in-distribution (ID) and out-of-distribution (OOD) data at the testing phase. This challenge has curtailed their deployment in sensitive fields like biotechnology, where misidentifying OOD data, such as unclear or unknown bacterial genomic sequences, as known ID classes could lead to dire consequences. To address this, we propose an approach to make DL models OOD-sensitive by exploiting the configuration of the logit space embeddings, into the model’s decision-making process. Leveraging the effect observed in recent studies that there is minimal overlap between the embeddings of ID and OOD data, we use a density estimator to model the ID logit distribution based on the training data. This allows us to reliably flag data that do not match the ID distribution as OOD. Our methodology is designed to be independent of the specific data or model architecture and can seamlessly augment existing trained models without the need to expose them to OOD data. Testing our method on widely recognized image datasets, we achieve leading-edge results, including a substantial 10% enhancement in the area under the receiver operating characteristic curve (AUCROC) on the Google genome dataset.

Place, publisher, year, edition, pages
Springer Science and Business Media Deutschland GmbH, 2025
Keywords
Adversarial machine learning, Classification tasks, Decision-making process, Density estimator, Embeddings, Genomic sequence, Learning models, Machine-learning, Space embedding, Testing phase, Training data
National Category
Computer Sciences Bioinformatics (Computational Biology) Probability Theory and Statistics
Identifiers
urn:nbn:se:ri:diva-79300 (URN)10.1007/978-3-031-82484-5_19 (DOI)2-s2.0-105000982628 (Scopus ID)
Conference
Lecture Notes in Computer Science
Note

Conference paper; Granskad

Available from: 2025-12-04 Created: 2025-12-04 Last updated: 2025-12-04Bibliographically approved
Isaksson, M., Zec, E. L., Cöster, R., Gillblad, D. & Girdzijauskas, S. (2023). Adaptive Expert Models for Federated Learning. In: Lecture Notes in Computer Science Volume 13448 Pages 1 - 16 2023: . Paper presented at 1st International Workshop on Trustworthy Federated Learning in Conjunction with International Joint Conference on AI, FL-IJCAI 2022. Vienna 23 July 2022 through 23 July 2022 (pp. 1-16). Char
Open this publication in new window or tab >>Adaptive Expert Models for Federated Learning
Show others...
2023 (English)In: Lecture Notes in Computer Science Volume 13448 Pages 1 - 16 2023, Char, 2023, p. 1-16Conference paper, Published paper (Refereed)
Abstract [en]

Federated Learning (FL) is a promising framework for distributed learning when data is private and sensitive. However, the state-of-the-art solutions in this framework are not optimal when data is heterogeneous and non-IID. We propose a practical and robust approach to personalization in FL that adjusts to heterogeneous and non-IID data by balancing exploration and exploitation of several global models. To achieve our aim of personalization, we use a Mixture of Experts (MoE) that learns to group clients that are similar to each other, while using the global models more efficiently. We show that our approach achieves an accuracy up to 29.78% better than the state-of-the-art and up to 4.38% better compared to a local model in a pathological non-IID setting, even though we tune our approach in the IID setting. © 2023, The Author(s)

Place, publisher, year, edition, pages
Char: , 2023
Keywords
Federated learning, Personalization, Privacy preserving, Artificial intelligence, Learning systems, Distributed learning, Expert modeling, Global models, Heterogeneous data, IID data, Personalizations, Robust approaches, State of the art, Privacy-preserving techniques
National Category
Computer Sciences
Identifiers
urn:nbn:se:ri:diva-64398 (URN)10.1007/978-3-031-28996-5_1 (DOI)2-s2.0-85152565856 (Scopus ID)9783031289958 (ISBN)
Conference
1st International Workshop on Trustworthy Federated Learning in Conjunction with International Joint Conference on AI, FL-IJCAI 2022. Vienna 23 July 2022 through 23 July 2022
Note

Funding details: Linköpings Universitet, LiU; Funding details: Knut och Alice Wallenbergs Stiftelse; Funding text 1: This work was partially supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation. The computations were enabled by the supercomputing resource Berzelius provided by National Supercomputer Centre at Linköping University and the Knut and Alice Wallenberg foundation. Funding text 2: Acknowledgment. This work was partially supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation.

Available from: 2023-05-08 Created: 2023-05-08 Last updated: 2025-09-23Bibliographically approved
Zec, E. L., Ekblom, E., Willbo, M., Mogren, O. & Girdzijauskas, S. (2022). Decentralized adaptive clustering of deep nets is beneficial for client collaboration. In: : . Paper presented at International Workshop on Trustworthy Federated Learning 2022.
Open this publication in new window or tab >>Decentralized adaptive clustering of deep nets is beneficial for client collaboration
Show others...
2022 (English)Conference paper, Published paper (Refereed)
Abstract [en]

We study the problem of training personalized deep learning models in a decentralized peer-to-peer setting, focusing on the setting where data distributions differ between the clients and where different clients have different local learning tasks. We study both covariate and label shift, and our contribution is an algorithm which for each client finds beneficial collaborations based on a similarity estimate for the local task. Our method does not rely on hyperparameters which are hard to estimate, such as the number of client clusters, but rather continuously adapts to the network topology using soft cluster assignment based on a novel adaptive gossip algorithm. We test the proposed method in various settings where data is not independent and identically distributed among the clients. The experimental evaluation shows that the proposed method performs better than previous state-of-the-art algorithms for this problem setting, and handles situations well where previous methods fail

National Category
Computer Sciences
Identifiers
urn:nbn:se:ri:diva-62529 (URN)
Conference
International Workshop on Trustworthy Federated Learning 2022
Available from: 2023-01-13 Created: 2023-01-13 Last updated: 2025-09-23Bibliographically approved
Soliman, A., Girdzijauskas, S., Bouguelia, M., Pashami, S. & Nowaczyk, S. (2020). Decentralized and Adaptive K-Means Clustering for Non-IID Data Using HyperLogLog Counters. In: 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020(Lecture Notes in Computer Science book series (LNCS, volume 12084)): . Paper presented at 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020 (pp. 343-355). Springer
Open this publication in new window or tab >>Decentralized and Adaptive K-Means Clustering for Non-IID Data Using HyperLogLog Counters
Show others...
2020 (English)In: 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020(Lecture Notes in Computer Science book series (LNCS, volume 12084)), Springer , 2020, p. 343-355Conference paper, Published paper (Refereed)
Abstract [en]

The data shared over the Internet tends to originate from ubiquitous and autonomous sources such as mobile phones, fitness trackers, and IoT devices. Centralized and federated machine learning solutions represent the predominant way of providing smart services for users. However, moving data to central location for analysis causes not only many privacy concerns, but also communication overhead. Therefore, in certain situations machine learning models need to be trained in a collaborative and decentralized manner, similar to the way the data is originally generated without requiring any central authority for data or model aggregation. This paper presents a decentralized and adaptive k-means algorithm that clusters data from multiple sources organized in peer-to-peer networks. Our algorithm allows peers to reach an approximation of the global model without sharing any raw data. Most importantly, we address the challenge of decentralized clustering with skewed non-IID data and asynchronous computations by integrating HyperLogLog counters with k-means algorithm. Furthermore, our clustering algorithm allows nodes to individually determine the number of clusters that fits their local data. Results using synthetic and real-world datasets show that our algorithm outperforms state-of-the-art decentralized k-means algorithms achieving accuracy gain that is up-to 36%. 

Place, publisher, year, edition, pages
Springer, 2020
Keywords
Approximation algorithms, Data mining, Data Sharing, Machine learning, Peer to peer networks, Central locations, Communication overheads, Machine learning models, Model aggregations, Number of clusters, Privacy concerns, Real-world datasets, State of the art, K-means clustering
National Category
Natural Sciences
Identifiers
urn:nbn:se:ri:diva-45104 (URN)10.1007/978-3-030-47426-3_27 (DOI)2-s2.0-85085735657 (Scopus ID)9783030474256 (ISBN)
Conference
24th Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2020
Note

Funding text 1: This research has been conducted within the ?BIDAF: A Big Data Analytics Framework for a Smart Society? (http://bidaf.sics.se/) project funded by the Swedish Knowledge Foundation.

Available from: 2020-07-02 Created: 2020-07-02 Last updated: 2025-09-23Bibliographically approved
Pozzoli, S., Soliman, A., Bahri, L., Branca, R., Girdzijauskas, S. & Brambilla, M. (2020). Domain expertise–agnostic feature selection for the analysis of breast cancer data*. Artificial Intelligence in Medicine, 108, Article ID 101928.
Open this publication in new window or tab >>Domain expertise–agnostic feature selection for the analysis of breast cancer data*
Show others...
2020 (English)In: Artificial Intelligence in Medicine, ISSN 0933-3657, E-ISSN 1873-2860, Vol. 108, article id 101928Article in journal (Refereed) Published
Abstract [en]

Progress in proteomics has enabled biologists to accurately measure the amount of protein in a tumor. This work is based on a breast cancer data set, result of the proteomics analysis of a cohort of tumors carried out at Karolinska Institutet. While evidence suggests that an anomaly in the protein content is related to the cancerous nature of tumors, the proteins that could be markers of cancer types and subtypes and the underlying interactions are not completely known. This work sheds light on the potential of the application of unsupervised learning in the analysis of the aforementioned data sets, namely in the detection of distinctive proteins for the identification of the cancer subtypes, in the absence of domain expertise. In the analyzed data set, the number of samples, or tumors, is significantly lower than the number of features, or proteins; consequently, the input data can be thought of as high-dimensional data. The use of high-dimensional data has already become widespread, and a great deal of effort has been put into high-dimensional data analysis by means of feature selection, but it is still largely based on prior specialist knowledge, which in this case is not complete. There is a growing need for unsupervised feature selection, which raises the issue of how to generate promising subsets of features among all the possible combinations, as well as how to evaluate the quality of these subsets in the absence of specialist knowledge. We hereby propose a new wrapper method for the generation and evaluation of subsets of features via spectral clustering and modularity, respectively. We conduct experiments to test the effectiveness of the new method in the analysis of the breast cancer data, in a domain expertise–agnostic context. Furthermore, we show that we can successfully augment our method by incorporating an external source of data on known protein complexes. Our approach reveals a large number of subsets of features that are better at clustering the samples than the state-of-the-art classification in terms of modularity and shows a potential to be useful for future proteomics research.

Place, publisher, year, edition, pages
Elsevier B.V., 2020
Keywords
Breast cancer, Clustering, Clustering performance evaluation, Dimensionality reduction, Feature selection, Proteomics, Unsupervised learning, Clustering algorithms, Diseases, Feature extraction, Proteins, Set theory, Tumors, Breast cancer data, High dimensional data, High-dimensional data analysis, Number of samples, Protein complexes, Proteomics research, Spectral clustering, Unsupervised feature selection, Quality control
National Category
Natural Sciences
Identifiers
urn:nbn:se:ri:diva-45613 (URN)10.1016/j.artmed.2020.101928 (DOI)2-s2.0-85088878526 (Scopus ID)
Note

Funding text 1: The research project has partially received funding under the Marie Sk?odowska-Curie Actions (MCSA) funded project Real-Time Analytics for Internet of Sports (RAIS) (Grant Agreement No. 813162). The research was also partially funded under ?AI for Proteomics? initiative by RISE Research Institutes of Sweden.

Available from: 2020-08-14 Created: 2020-08-14 Last updated: 2025-09-23Bibliographically approved
Holst, A., Bouguelia, M.-R. -., Görnerup, O., Pashami, S., Al-Shishtawy, A., Falkman, G., . . . Soliman, A. (2019). Eliciting structure in data. In: CEUR Workshop Proceedings: . Paper presented at 2019 Joint ACM IUI Workshops, ACMIUI-WS 2019, 20 March 2019.
Open this publication in new window or tab >>Eliciting structure in data
Show others...
2019 (English)In: CEUR Workshop Proceedings, 2019Conference paper, Published paper (Refereed)
Abstract [en]

This paper demonstrates how to explore and visualize different types of structure in data, including clusters, anomalies, causal relations, and higher order relations. The methods are developed with the goal of being as automatic as possible and applicable to massive, streaming, and distributed data. Finally, a decentralized learning scheme is discussed, enabling finding structure in the data without collecting the data centrally. © 2019 Copyright held for the individual papers by the papers’ authors. Copying permitted for private and academic purposes. This volume is published and copyrighted by its editors.

Keywords
Anomaly detection, Causal inference, Clustering, Distributed analytics, Higher-order structure, Information visualization, Information systems, User interfaces, Causal inferences, Data acquisition
National Category
Natural Sciences
Identifiers
urn:nbn:se:ri:diva-38261 (URN)2-s2.0-85063227224 (Scopus ID)
Conference
2019 Joint ACM IUI Workshops, ACMIUI-WS 2019, 20 March 2019
Available from: 2019-04-02 Created: 2019-04-02 Last updated: 2025-09-23Bibliographically approved
Abbas, Z., Al-Shishtawy, A., Girdzijauskas, S. & Vlassov, V. (2018). Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks. In: Proceedings - 2018 IEEE International Congress on Big Data, BigData Congress 2018 - Part of the 2018 IEEE World Congress on Services: . Paper presented at 7th IEEE International Congress on Big Data, BigData Congress 2018, 2 July 2018 through 7 July 2018 (pp. 57-65). Institute of Electrical and Electronics Engineers Inc.
Open this publication in new window or tab >>Short-Term Traffic Prediction Using Long Short-Term Memory Neural Networks
2018 (English)In: Proceedings - 2018 IEEE International Congress on Big Data, BigData Congress 2018 - Part of the 2018 IEEE World Congress on Services, Institute of Electrical and Electronics Engineers Inc. , 2018, p. 57-65Conference paper, Published paper (Refereed)
Abstract [en]

Short-term traffic prediction allows Intelligent Transport Systems to proactively respond to events before they happen. With the rapid increase in the amount, quality, and detail of traffic data, new techniques are required that can exploit the information in the data in order to provide better results while being able to scale and cope with increasing amounts of data and growing cities. We propose and compare three models for short-term road traffic density prediction based on Long Short-Term Memory (LSTM) neural networks. We have trained the models using real traffic data collected by Motorway Control System in Stockholm that monitors highways and collects flow and speed data per lane every minute from radar sensors. In order to deal with the challenge of scale and to improve prediction accuracy, we propose to partition the road network into road stretches and junctions, and to model each of the partitions with one or more LSTM neural networks. Our evaluation results show that partitioning of roads improves the prediction accuracy by reducing the root mean square error by the factor of 5. We show that we can reduce the complexity of LSTM network by limiting the number of input sensors, on average to 35% of the original number, without compromising the prediction accuracy. .

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers Inc., 2018
Keywords
LSTM, neural networks, traffic prediction, Big data, Brain, Forecasting, Intelligent systems, Mean square error, Roads and streets, Traffic control, Evaluation results, Input sensor, Intelligent transport systems, Prediction accuracy, Radar sensors, Root mean square errors, Long short-term memory
National Category
Electrical Engineering, Electronic Engineering, Information Engineering
Identifiers
urn:nbn:se:ri:diva-37285 (URN)10.1109/BigDataCongress.2018.00015 (DOI)2-s2.0-85054887478 (Scopus ID)9781538672327 (ISBN)
Conference
7th IEEE International Congress on Big Data, BigData Congress 2018, 2 July 2018 through 7 July 2018
Note

Funding details: Fellowships Fund Incorporated, FFI; Funding details: VINNOVA; Funding details: 20140221; Funding details: European Commission, EC, FPA 2012-0030; Funding details: Education, Audiovisual and Culture Executive Agency, EACEA; Funding details: Directorate-General for Research and Innovation, 2015-00677; Funding text 1: ACKNOWLEDGMENT This work was supported by the project BADA: Big Automotive Data Analytics in the funding program FFI: Strategic Vehicle Research and Innovation (grant 2015-00677) administrated by VINNOVA the Swedish government agency for innovation systems, by the project BIDAF: Big Data Analytics Framework for a Smart Society (grant 20140221) funded by KKS the Swedish Knowledge Foundation, and by the Erasmus Mundus Joint Doctorate in Distributed Computing (EMJD-DC) programme funded by the Education, Audiovisual and Culture Executive Agency (EACEA) of the European Commission under FPA 2012-0030.

Available from: 2019-01-18 Created: 2019-01-18 Last updated: 2025-09-23Bibliographically approved
Ghoorchian, K., Girdzijauskas, S. & Rahimian, F. (2017). DeGPar: Large Scale Topic Detection Using Node-Cut Partitioning on Dense Weighted Graphs. In: Proceedings - International Conference on Distributed Computing Systems: . Paper presented at 37th IEEE International Conference on Distributed Computing Systems, ICDCS 2017, 5 June 2017 through 8 June 2017 (pp. 775-785).
Open this publication in new window or tab >>DeGPar: Large Scale Topic Detection Using Node-Cut Partitioning on Dense Weighted Graphs
2017 (English)In: Proceedings - International Conference on Distributed Computing Systems, 2017, p. 775-785Conference paper, Published paper (Refereed)
Abstract [en]

Topic Detection (TD) refers to automatic techniques for locating topically related material in web documents. Nowadays, massive amounts of documents are generated by users of Online Social Networks (OSNs), in form of very short text, tweets and snippets of news. While topic detection, in its traditional form, is applied to a few documents containing a lot of information, the problem has now changed to dealing with massive number of documents with very little information. The traditional solutions, thus, fall short either in scalability (due to huge number of input items) or sparsity (due to insufficient information per input item). In this paper we address the scalability problem by introducing an efficient and scalable graph based algorithm for TD on short texts, leveraging dimensionality reduction and clustering techniques. We first, compress the input set of documents into a dense graph, such that frequent cooccurrence patterns in the documents create multiple dense topological areas in the graph. Then, we partition the graph into multiple dense sub-graphs, each representing a topic. We compare the accuracy and scalability of our solution with two state-of-the-art solutions (including the standard LDA, and BiTerm). The results on two widely used benchmark datasets show that our algorithm not only maintains a similar or better accuracy, but also performs by an order of magnitude faster than the state-of-the-art approaches.

Keywords
Dense Weighted Graph Partitioning, Dimensionality Reduction, Distributed Algorithms, Node-cut Graph Partitioning, Online Social Networks, Random Indexing, Topic Detection, Clustering algorithms, Distributed computer systems, Graphic methods, Parallel algorithms, Scalability, Scales (weighing instruments), Social networking (online), Topology, Graph Partitioning, On-line social networks, Weighted graph, Graph theory
National Category
Natural Sciences
Identifiers
urn:nbn:se:ri:diva-30836 (URN)10.1109/ICDCS.2017.19 (DOI)2-s2.0-85027258993 (Scopus ID)9781538617915 (ISBN)
Conference
37th IEEE International Conference on Distributed Computing Systems, ICDCS 2017, 5 June 2017 through 8 June 2017
Available from: 2017-09-07 Created: 2017-09-07 Last updated: 2025-09-23Bibliographically approved
Rahimian, F., Girdzijauskas, S. & Haridi, S. (2014). Parallel Community Detection For Cross-Document Coreference (6ed.). Kista, Sweden: Swedish Institute of Computer Science
Open this publication in new window or tab >>Parallel Community Detection For Cross-Document Coreference
2014 (English)Report (Other academic)
Abstract [en]

This document presents a highly parallel solution for cross-document coreference resolution, which can deal with billions of documents that exist in the current web. At the core of our solution lies a novel algorithm for community detection in large scale graphs. We operate on graphs which we construct by representing documents' keywords as nodes and the co-location of those keywords in a document as edges. We then exploit the particular nature of such graphs where coreferent words are topologically clustered and can be efficiently discovered by our community detection algorithm. The accuracy of our technique is considerably higher than that of the state of the art, while the convergence time is by far shorter. In particular, we increase the accuracy for a baseline dataset by more than 15\% compared to the best reported result so far. Moreover, we outperform the best reported result for a dataset provided for the Word Sense Induction task in SemEval 2010.

Place, publisher, year, edition, pages
Kista, Sweden: Swedish Institute of Computer Science, 2014 Edition: 6
Series
SICS Technical Report, ISSN 1100-3154 ; 2014:01
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:ri:diva-24302 (URN)
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2025-09-23Bibliographically approved
Rahimian, F., Payberah, A., Girdzijauskas, S., Jelasity, M. & Haridi, S. (2013). Ja-be-Ja: A Distributed Algorithm for Balanced Graph Partitioning (7ed.). Kista, Sweden: Swedish Institute of Computer Science
Open this publication in new window or tab >>Ja-be-Ja: A Distributed Algorithm for Balanced Graph Partitioning
Show others...
2013 (English)Report (Other academic)
Abstract [en]

Balanced graph partitioning is a well known NP-complete problem with a wide range of applications. These applications include many large-scale distributed problems such as the optimal storage of large sets of graph-structured data over several hosts, or identifying clusters in on-line social networks. In such very large-scale distributed scenarios, state-of-the-art algorithms are not directly applicable, because they typically involve frequent global operations over the entire graph. In this paper, we propose a distributed graph partitioning algorithm, called Ja-be-Ja1. The algorithm is massively parallel: each graph node is processed independently, and only the direct neighbors of the node, and a small subset of random nodes in the graph need to be known. Strict synchronization is not required. These features allow Ja-be-Ja to be easily adapted to any distributed graph-processing system from data centers to fully distributed networks. We perform a thorough experimental analysis, which shows that the minimal edge-cut value achieved by Ja-be-Ja is comparable to state-of-the-art centralized algorithms such as Metis. In particular, on large social networks Ja-be-Ja outperforms Metis.

Place, publisher, year, edition, pages
Kista, Sweden: Swedish Institute of Computer Science, 2013 Edition: 7
Series
SICS Technical Report, ISSN 1100-3154 ; 2013:03
Keywords
graph partitioning, distributed algorithm, load balancing
National Category
Computer and Information Sciences
Identifiers
urn:nbn:se:ri:diva-24165 (URN)
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2025-09-23Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0003-4516-7317

Search in DiVA

Show all publications