Endre søk
Link to record
Permanent link

Direct link
Publikasjoner (10 av 46) Visa alla publikasjoner
McAllister, T. & Gambäck, B. (2022). Music Style Transfer Using Constant-Q Transform Spectrograms. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Volume 13221 LNCS, Pages 195 - 2112022: . Paper presented at 11th International Conference on Artificial Intelligence in Music, Sound, Art and Design, EvoMUSART 2022, held as Part of EvoStar 2022Madrid20 April 2022 through 22 April 2022 (pp. 195-211). Springer Science and Business Media Deutschland GmbH
Åpne denne publikasjonen i ny fane eller vindu >>Music Style Transfer Using Constant-Q Transform Spectrograms
2022 (engelsk)Inngår i: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)Volume 13221 LNCS, Pages 195 - 2112022, Springer Science and Business Media Deutschland GmbH , 2022, s. 195-211Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Previous work on music generation and transformation has commonly targeted single instrument or single melody music. Here, in contrast, five music genres are used with the goal to achieve selective remixing by using domain transfer methods on spectrogram images of music. A pipeline architecture comprised of two independent generative adversarial network models was created. The first applies features from one of the genres to constant-Q transform spectrogram images to perform style transfer. The second network turns a spectrogram into a real-value tensor representation which is approximately reconstructed back into audio. The system was evaluated experimentally and through a survey. Due to the increased complexity involved in processing high sample rate music with homophonic or polyphonic audio textures, the system’s audio output was considered to be low quality, but the style transfer produced noticeable selective remixing on most of the music tracks evaluated. © 2022, The Author(s),

sted, utgiver, år, opplag, sider
Springer Science and Business Media Deutschland GmbH, 2022
Emneord
Audio acoustics, Generative adversarial networks, Music, Quality control, Textures, Audio textures, Domain transfers, Music genre, Network models, Pipeline architecture, Real values, Sample rate, Spectrograms, Tensor representation, Transfer method, Spectrographs
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-59247 (URN)10.1007/978-3-031-03789-4_13 (DOI)2-s2.0-85128973918 (Scopus ID)9783031037887 (ISBN)
Konferanse
11th International Conference on Artificial Intelligence in Music, Sound, Art and Design, EvoMUSART 2022, held as Part of EvoStar 2022Madrid20 April 2022 through 22 April 2022
Tilgjengelig fra: 2022-06-13 Laget: 2022-06-13 Sist oppdatert: 2022-06-13bibliografisk kontrollert
Ekern, E. & Gambäck, B. (2021). Interactive, Efficient and Creative Image Generation Using Compositional Pattern-Producing Networks. In: Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2021. Lecture Notes in Computer Science, vol 12693.: . Paper presented at Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2021. (pp. 131-146). Springer Science and Business Media Deutschland GmbH, 12693
Åpne denne publikasjonen i ny fane eller vindu >>Interactive, Efficient and Creative Image Generation Using Compositional Pattern-Producing Networks
2021 (engelsk)Inngår i: Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2021. Lecture Notes in Computer Science, vol 12693., Springer Science and Business Media Deutschland GmbH , 2021, Vol. 12693, s. 131-146Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

In contrast to most recent models that generate an entire image at once, the paper introduces a new architecture for generating images one pixel at a time using a Compositional Pattern-Producing Network (CPPN) as the generator part in a Generative Adversarial Network (GAN), allowing for effective generation of visually interesting images with artistic value, at arbitrary resolutions independent of the dimensions of the training data. The architecture, as well as accompanying (hyper-) parameters, for training CPPNs using recent GAN stabilisation techniques is shown to generalise well across many standard datasets. Rather than relying on just a latent noise vector (entangling various features with each other), mutual information maximisation is utilised to get disentangled representations, removing the requirement to use labelled data and giving the user control over the generated images. A web application for interacting with pre-trained models was also created, unique in the offered level of interactivity with an image-generating GAN.

sted, utgiver, år, opplag, sider
Springer Science and Business Media Deutschland GmbH, 2021
Emneord
Compositional pattern-producing networks, Generative adversarial networks, Image generation, Data visualization, Network architecture, Adversarial networks, Artistic value, Image generations, Mutual informations, Noise vectors, Training data, WEB application, Artificial intelligence
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-53516 (URN)10.1007/978-3-030-72914-1_9 (DOI)2-s2.0-85107436289 (Scopus ID)9783030729134 (ISBN)
Konferanse
Artificial Intelligence in Music, Sound, Art and Design. EvoMUSART 2021.
Tilgjengelig fra: 2021-06-17 Laget: 2021-06-17 Sist oppdatert: 2021-06-17bibliografisk kontrollert
Jamatia, A., Swamy, S., Gambäck, B., Das, A. & Debbarma, S. (2020). Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus. International journal on artificial intelligence tools, 29(5), Article ID 2050014.
Åpne denne publikasjonen i ny fane eller vindu >>Deep Learning Based Sentiment Analysis in a Code-Mixed English-Hindi and English-Bengali Social Media Corpus
Vise andre…
2020 (engelsk)Inngår i: International journal on artificial intelligence tools, ISSN 0218-2130, Vol. 29, nr 5, artikkel-id 2050014Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

Sentiment analysis is a circumstantial analysis of text, identifying the social sentiment to better understand the source material. The article addresses sentiment analysis of an English-Hindi and English-Bengali code-mixed textual corpus collected from social media. Code-mixing is an amalgamation of multiple languages, which previously mainly was associated with spoken language. However, social media users also deploy it to communicate in ways that tend to be somewhat casual. The coarse nature of social media text poses challenges for many language processing applications. Here, the focus is on the low predictive nature of traditional machine learners when compared to Deep Learning counterparts, including the contextual language representation model BERT (Bidirectional Encoder Representations from Transformers), on the task of extracting user sentiment from code-mixed texts. Three deep learners (a BiLSTM CNN, a Double BiLSTM and an Attention-based model) attained accuracy 20-60% greater than traditional approaches on code-mixed data, and were for comparison also tested on monolingual English data.

sted, utgiver, år, opplag, sider
World Scientific, 2020
Emneord
Code-switching, convolutional neural networks, recurrent neural networks, Codes (symbols), Learning systems, Metals, Sentiment analysis, Social networking (online), Language processing, Machine learners, Multiple languages, Representation model, Social media, Source material, Spoken languages, Traditional approaches, Deep learning
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-50436 (URN)10.1142/S0218213020500141 (DOI)2-s2.0-85092430312 (Scopus ID)
Tilgjengelig fra: 2020-11-09 Laget: 2020-11-09 Sist oppdatert: 2020-12-01bibliografisk kontrollert
Swamy, S. D., Jamatia, A. & Gambäck, B. (2019). Studying generalisability across abusive language detection datasets. In: CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference: . Paper presented at 23rd Conference on Computational Natural Language Learning, CoNLL 2019, 3 November 2019 through 4 November 2019 (pp. 940-950). Association for Computational Linguistics
Åpne denne publikasjonen i ny fane eller vindu >>Studying generalisability across abusive language detection datasets
2019 (engelsk)Inngår i: CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference, Association for Computational Linguistics , 2019, s. 940-950Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Work on Abusive Language Detection has tackled a wide range of subtasks and domains. As a result of this, there exists a great deal of redundancy and non-generalisability between datasets. Through experiments on cross-dataset training and testing, the paper reveals that the preconceived notion of including more non-abusive samples in a dataset (to emulate reality) may have a detrimental effect on the generalisability of a model trained on that data. Hence a hierarchical annotation model is utilised here to reveal redundancies in existing datasets and to help reduce redundancy in future efforts.

sted, utgiver, år, opplag, sider
Association for Computational Linguistics, 2019
Emneord
Statistical tests, Language detection, Subtasks, Training and testing, Redundancy
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-45025 (URN)2-s2.0-85084333327 (Scopus ID)9781950737727 (ISBN)
Konferanse
23rd Conference on Computational Natural Language Learning, CoNLL 2019, 3 November 2019 through 4 November 2019
Tilgjengelig fra: 2020-05-25 Laget: 2020-05-25 Sist oppdatert: 2020-05-28bibliografisk kontrollert
Kumar, U., Reganti, A. N., Maheshwari, T., Chakroborty, T., Gambäck, B. & Das, A. (2018). Inducing Personalities and Values from Language Use in Social Network Communities. Information Systems Frontiers, 20(6), 1219-1240
Åpne denne publikasjonen i ny fane eller vindu >>Inducing Personalities and Values from Language Use in Social Network Communities
Vise andre…
2018 (engelsk)Inngår i: Information Systems Frontiers, ISSN 1387-3326, E-ISSN 1572-9419, Vol. 20, nr 6, s. 1219-1240Artikkel i tidsskrift (Fagfellevurdert) Published
Abstract [en]

A community in social networks is generally assumed to be composed of a group of individuals with similar characteristics. Although there has been a plethora of work on understanding network topologies (edge density, clustering coefficient, etc.) within an online community, the psycho-sociological compositions of social network communities have hardly been studied. The present paper aims to analyse the communities as composition of induced psycholinguistic and sociolinguistic variables (Personalities, Values and Ethics) across individuals in social media networks. The motivation behind this analysis is to understand the behavioural characteristics at individual as well as societal level in social networks. To this end, three studies were carried out on six different datasets: three Twitter corpora, two Facebook corpora, and an Essay corpus, annotated with Values and Ethics of the users. First, experiments on creating automatic models to determine the Personality and Values of individuals by analysing their language usage and social media behaviour. Second, experiments on understanding the characteristics or blend of characteristics of individuals within an online community. Finally, generation of a map of values and ethics for India, a multi-lingual and multi-cultural country. Striking similarities to general intuitive perception could be observed, i.e., the results obtained in the study resemble our general perception about the cities/towns of India. 

sted, utgiver, år, opplag, sider
Springer New York LLC, 2018
Emneord
Community, Ethics, Personality, Social network, Values, Linguistics, Online systems, Philosophical aspects, Clustering coefficient, Network communities, On-line communities, Social media networks, Social networking (online)
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-37955 (URN)10.1007/s10796-017-9793-8 (DOI)2-s2.0-85029010113 (Scopus ID)
Tilgjengelig fra: 2019-04-23 Laget: 2019-04-23 Sist oppdatert: 2019-05-03bibliografisk kontrollert
Maheshwari, T., Reganti, A. N., Gupta, S., Jamatia, A., Kumar, U., Gambäck, B. & Das, A. (2017). A societal sentiment analysis: Predicting the values and ethics of individuals by analysing social media content. In: 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference: . Paper presented at 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017, 3 April 2017 through 7 April 2017 (pp. 731-741).
Åpne denne publikasjonen i ny fane eller vindu >>A societal sentiment analysis: Predicting the values and ethics of individuals by analysing social media content
Vise andre…
2017 (engelsk)Inngår i: 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference, 2017, s. 731-741Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

To find out how users' social media behaviour and language are related to their ethical practices, the paper investigates applying Schwartz' psycholinguistic model of societal sentiment to social media text. The analysis is based on corpora collected from user essays as well as social media (Facebook and Twitter). Several experiments were carried out on the corpora to classify the ethical values of users, incorporating Linguistic Inquiry Word Count analysis, n-grams, topic models, psycholinguistic lexica, speech-acts, and nonlinguistic information, while applying a range of machine learners (Support Vector Machines, Logistic Regression, and Random Forests) to identify the best linguistic and non-linguistic features for automatic classification of values and ethics.

Emneord
Classification (of information), Computational linguistics, Decision trees, Linguistics, Philosophical aspects, Automatic classification, Ethical practices, Ethical values, Linguistic features, Logistic regressions, Machine learners, Psycholinguistic models, Sentiment analysis, Social networking (online)
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-31136 (URN)10.18653/v1/e17-1069 (DOI)2-s2.0-85021625321 (Scopus ID)9781510838604 (ISBN)
Konferanse
15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017, 3 April 2017 through 7 April 2017
Tilgjengelig fra: 2017-08-28 Laget: 2017-08-28 Sist oppdatert: 2019-08-14bibliografisk kontrollert
Gambäck, B., Olsson, F. & Täckström, O. (2011). Active Learning for Dialogue Act Classification (9ed.). In: : . Paper presented at INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association.
Åpne denne publikasjonen i ny fane eller vindu >>Active Learning for Dialogue Act Classification
2011 (engelsk)Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Active learning techniques were employed for classification of dialogue acts over two dialogue corpora, the English human-human Switchboard corpus and the Spanish human-machine Dihana corpus. It is shown clearly that active learning improves on a baseline obtained through a passive learning approach to tagging the same data sets. An error reduction of 7% was obtained on Switchboard, while a factor 5 reduction in the amount of labeled data needed for classification was achieved on Dihana. The passive Support Vector Machine learner used as baseline in itself significantly improves the state of the art in dialogue act classification on both corpora. On Switchboard it gives a 31% error reduction compared to the previously best reported result.

HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-23877 (URN)
Konferanse
INTERSPEECH 2011, 12th Annual Conference of the International Speech Communication Association
Prosjekter
COMPANIONS
Tilgjengelig fra: 2016-10-31 Laget: 2016-10-31 Sist oppdatert: 2020-12-01bibliografisk kontrollert
Wilks, Y., Gambäck, B. & Danieli, M. (Eds.). (2010). Workshop on Companionable Dialogue Systems (6ed.). Uppsala, Sweden: ACL
Åpne denne publikasjonen i ny fane eller vindu >>Workshop on Companionable Dialogue Systems
2010 (engelsk)Collection/Antologi (Fagfellevurdert)
sted, utgiver, år, opplag, sider
Uppsala, Sweden: ACL, 2010. s. 59 Opplag: 6
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-23896 (URN)978-1-932432-81-7 (ISBN)
Tilgjengelig fra: 2016-10-31 Laget: 2016-10-31 Sist oppdatert: 2020-12-01bibliografisk kontrollert
Ståhl, O., Gambäck, B., Turunen, M. & Hakulinen, J. (2009). A Mobile Health and Fitness Companion Demonstrator (11ed.). In: Proceedings of the Demonstrations Session at EACL 2009: . Paper presented at 12th Conference of the European Chapter of the Association for Computational Linguistics, ACL (pp. 65-68).
Åpne denne publikasjonen i ny fane eller vindu >>A Mobile Health and Fitness Companion Demonstrator
2009 (engelsk)Inngår i: Proceedings of the Demonstrations Session at EACL 2009, 2009, 11, s. 65-68Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

Multimodal conversational spoken dialogues using physical and virtual agents provide a potential interface to motivate and support users in the domain of health and fitness. The paper presents a multimodal conversational Companion system focused on health and fitness, which has both a stationary and a mobile component.

HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-23676 (URN)
Konferanse
12th Conference of the European Chapter of the Association for Computational Linguistics, ACL
Prosjekter
COMPANIONS
Tilgjengelig fra: 2016-10-31 Laget: 2016-10-31 Sist oppdatert: 2023-12-04bibliografisk kontrollert
Gambäck, B., Olsson, F., Argaw, A. A. & Asker, L. (2009). Methods for Amharic part-of-speech tagging (1ed.). In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: . Paper presented at First Workshop on Language Technologies for African Languages, March 2009, Athens, Greece.
Åpne denne publikasjonen i ny fane eller vindu >>Methods for Amharic part-of-speech tagging
2009 (engelsk)Inngår i: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, 2009, 1, , s. 8Konferansepaper, Publicerat paper (Fagfellevurdert)
Abstract [en]

The paper describes a set of experiments involving the application of three state-of- the-art part-of-speech taggers to Ethiopian Amharic, using three different tagsets. The taggers showed worse performance than previously reported results for Eng- lish, in particular having problems with unknown words. The best results were obtained using a Maximum Entropy ap- proach, while HMM-based and SVM- based taggers got comparable results.

Publisher
s. 8
Emneord
part-of-speech tagging, amharic, machine learning
HSV kategori
Identifikatorer
urn:nbn:se:ri:diva-23519 (URN)
Konferanse
First Workshop on Language Technologies for African Languages, March 2009, Athens, Greece
Tilgjengelig fra: 2016-10-31 Laget: 2016-10-31 Sist oppdatert: 2020-12-01bibliografisk kontrollert
Organisasjoner
Identifikatorer
ORCID-id: ORCID iD iconorcid.org/0000-0002-5252-707x
v. 2.45.0