Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Studying generalisability across abusive language detection datasets
National Institute of Technology, India.
National Institute of Technology, India.
RISE - Research Institutes of Sweden (2017-2019), ICT, SICS. NTNU Norwegian University of Science and Technology, Norway.ORCID iD: 0000-0002-5252-707x
2019 (English)In: CoNLL 2019 - 23rd Conference on Computational Natural Language Learning, Proceedings of the Conference, Association for Computational Linguistics , 2019, p. 940-950Conference paper, Published paper (Refereed)
Abstract [en]

Work on Abusive Language Detection has tackled a wide range of subtasks and domains. As a result of this, there exists a great deal of redundancy and non-generalisability between datasets. Through experiments on cross-dataset training and testing, the paper reveals that the preconceived notion of including more non-abusive samples in a dataset (to emulate reality) may have a detrimental effect on the generalisability of a model trained on that data. Hence a hierarchical annotation model is utilised here to reveal redundancies in existing datasets and to help reduce redundancy in future efforts.

Place, publisher, year, edition, pages
Association for Computational Linguistics , 2019. p. 940-950
Keywords [en]
Statistical tests, Language detection, Subtasks, Training and testing, Redundancy
National Category
Engineering and Technology
Identifiers
URN: urn:nbn:se:ri:diva-45025Scopus ID: 2-s2.0-85084333327ISBN: 9781950737727 (print)OAI: oai:DiVA.org:ri-45025DiVA, id: diva2:1431794
Conference
23rd Conference on Computational Natural Language Learning, CoNLL 2019, 3 November 2019 through 4 November 2019
Available from: 2020-05-25 Created: 2020-05-25 Last updated: 2020-05-28Bibliographically approved

Open Access in DiVA

No full text in DiVA

Scopus

Authority records

Gambäck, Björn

Search in DiVA

By author/editor
Gambäck, Björn
By organisation
SICS
Engineering and Technology

Search outside of DiVA

GoogleGoogle Scholar

isbn
urn-nbn

Altmetric score

isbn
urn-nbn
Total: 28 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf