Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Creating Bilingual Lexica Using Reference Wordlists for Alignment of Monolingual Semantic Vector Spaces
RISE - Research Institutes of Sweden, ICT, SICS.ORCID iD: 0000-0001-5100-0535
RISE, Swedish ICT, SICS.ORCID iD: 0000-0003-4042-4919
2005 (English)Conference paper, Published paper (Refereed)
Abstract [en]

This paper proposes a novel method for automatically acquiring multi-lingual lexica from non-parallel data and reports some initial experiments to prove the viability of the approach. Using established techniques for building mono-lingual vector spaces two independent semantic vector spaces are built from textual data. These vector spaces are related to each other using a small {\em reference word list} of manually chosen reference points taken from available bi-lingual dictionaries. Other words can then be related to these reference points first in the one language and then in the other. In the present experiments, we apply the proposed method to comparable but non-parallel English-German data. The resulting bi-lingual lexicon is evaluated using an online English-German lexicon as gold standard. The results clearly demonstrate the viability of the proposed methodology.

Place, publisher, year, edition, pages
2005, 1.
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-20962OAI: oai:DiVA.org:ri-20962DiVA, id: diva2:1040996
Conference
15th Nordic Conference of Computational Linguistics
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2018-08-21Bibliographically approved

Open Access in DiVA

No full text in DiVA

Authority records BETA

Sahlgren, Magnus

Search in DiVA

By author/editor
Sahlgren, MagnusKarlgren, Jussi
By organisation
SICSSICS
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 8 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
v. 2.35.4