Driftmeddelande
För närvarande är det driftstörningar. Felsökning pågår.
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Low-Resource Techniques for Analysing the Rhetorical Structure of Swedish Historical Petitions
Uppsala University, Sweden.
Uppsala University, Sweden.
RISE Research Institutes of Sweden, Digitala system, Datavetenskap. Uppsala University, Sweden.ORCID-id: 0000-0002-7873-3971
2023 (Engelska)Ingår i: RESOURCEFUL 2023 - Workshop on Resources and Representations for Under-Resourced Languages and Domains, Proceedings of the 2nd, Association for Computational Linguistics , 2023, s. 132-139Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Natural language processing techniques can be valuable for improving and facilitating historical research. This is also true for the analysis of petitions, a source which has been relatively little used in historical research. However, limited data resources pose challenges for mainstream natural language processing approaches based on machine learning. In this paper, we explore methods for automatically segmenting petitions according to their rhetorical structure. We find that the use of rules, word embeddings, and especially keywords can give promising results for this task.

Ort, förlag, år, upplaga, sidor
Association for Computational Linguistics , 2023. s. 132-139
Nyckelord [en]
Computational linguistics; Image segmentation; Natural language processing systems; Text processing; Data resources; Historical research; Language processing; Language processing techniques; Limited data; Natural languages; On-machines; Processing approach; Rhetorical structure; Swedishs; Learning algorithms
Nationell ämneskategori
Data- och informationsvetenskap
Identifikatorer
URN: urn:nbn:se:ri:diva-67997Scopus ID: 2-s2.0-85175852005OAI: oai:DiVA.org:ri-67997DiVA, id: diva2:1814311
Konferens
2nd Workshop on Resources and Representations for Under-Resourced Languages and Domains, RESOURCEFUL 2023. Torshavn, Denmark. 22 May 2023
Anmärkning

The research reported in this paper was supported by a grant from the Swedish Research Council (grant number 2018-06159).

Tillgänglig från: 2023-11-24 Skapad: 2023-11-24 Senast uppdaterad: 2025-09-23Bibliografiskt granskad

Open Access i DiVA

Fulltext saknas i DiVA

Scopus

Person

Nivre, Joakim

Sök vidare i DiVA

Av författaren/redaktören
Nivre, Joakim
Av organisationen
Datavetenskap
Data- och informationsvetenskap

Sök vidare utanför DiVA

GoogleGoogle Scholar

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 175 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf