Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Weighting Query Terms Based on Distributional Statistics
RISE, Swedish ICT, SICS.ORCID iD: 0000-0003-4042-4919
RISE - Research Institutes of Sweden, ICT, SICS.ORCID iD: 0000-0001-5100-0535
RISE, Swedish ICT, SICS.
2006 (English)In: Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005: Revised Papers, 2006, 1, , p. 5Conference paper, Published paper (Refereed)
Abstract [en]

This year, the SICS team has concentrated on query processing and on the internal topical structure of the query, specifically compound translation. Compound translation is non-trivial due to dependencies between compound elements. This year, we have investigated topical dependencies between query terms: if a query term happens to be non-topical or noise, it should be discarded or given a low weight when ranking retrieved documents; if a query term shows high topicality its weight should be boosted. The two experiments described here are based on the analysis of the distributional character of query terms: one using similarity of occurrence context between query terms globally across the entire collection; the other using the likelihood of individual terms to appear topically in individual texts. Both -- complementary -- boosting schemes tested delivered improved results.

Place, publisher, year, edition, pages
2006, 1. , p. 5
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:ri:diva-21049OAI: oai:DiVA.org:ri-21049DiVA, id: diva2:1041083
Conference
6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2018-08-21Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

http

Authority records BETA

Sahlgren, Magnus

Search in DiVA

By author/editor
Karlgren, JussiSahlgren, Magnus
By organisation
SICSSICS
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

urn-nbn

Altmetric score

urn-nbn
Total: 1 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
v. 2.35.7