Active learning and visual analytics for stance classification with ALVA
2017 (English)In: ACM Transactions on Interactive Intelligent Systems, ISSN 2160-6455, E-ISSN 2160-6463, Vol. 7, no 3, article id 14Article in journal (Refereed) Published
Abstract [en]
The automatic detection and classification of stance (e.g., certainty or agreement) in text data using natural language processing and machine-learning methods creates an opportunity to gain insight into the speakers' attitudes toward their own and other people's utterances. However, identifying stance in text presents many challenges related to training data collection and classifier training. To facilitate the entire process of training a stance classifier, we propose a visual analytics approach, called ALVA, for text data annotation and visualization. ALVA's interplay with the stance classifier follows an active learning strategy to select suitable candidate utterances for manual annotaion. Our approach supports annotation process management and provides the annotators with a clean user interface for labeling utterances with multiple stance categories. ALVA also contains a visualization method to help analysts of the annotation and training process gain a better understanding of the categories used by the annotators. The visualization uses a novel visual representation, called CatCombos, which groups individual annotation items by the combination of stance categories. Additionally, our system makes a visualization of a vector space model available that is itself based on utterances. ALVA is already being used by our domain experts in linguistics and computational linguistics to improve the understanding of stance phenomena and to build a st ance classifier for applications such as social media monitoring.
Place, publisher, year, edition, pages
Association for Computing Machinery , 2017. Vol. 7, no 3, article id 14
Keywords [en]
Artificial intelligence, Classifiers, Data visualization, Learning algorithms, Learning systems, Linguistics, Natural language processing systems, Text processing, User interfaces, Vector spaces, Visualization, Active learning strategies, Automatic Detection, Classifier training, Machine learning methods, Social media monitoring, Vector space models, Visual representations, Visualization method, Classification (of information)
National Category
Computer Systems
Identifiers
URN: urn:nbn:se:ri:diva-38055DOI: 10.1145/3132169Scopus ID: 2-s2.0-85032958347OAI: oai:DiVA.org:ri-38055DiVA, id: diva2:1296902
2019-03-182019-03-182023-09-20Bibliographically approved