Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Automatic Keyword Extraction Using Domain Knowledge
RISE, Swedish ICT, SICS.ORCID iD: 0000-0003-4042-4919
Show others and affiliations
Number of Authors: 5
2008 (English)In: Computational Linguistics and Intelligent Text Processing, Berlin / Heidelberg: Springer , 2008, 1, , 10 p.Chapter in book (Refereed)
Abstract [en]

Documents can be assigned keywords by frequency analysis of the terms found in the document text, which arguably is the primary source of knowledge about the document itself. By including a hierarchi- cally organised domain specific thesaurus as a second knowledge source the quality of such keywords was improved considerably, as measured by match to previously manually assigned keywords. In the presented ex- periment, the combination of the evidence from frequency analysis and the hierarchically organised thesaurus was done using inductive logic programming.

Place, publisher, year, edition, pages
Berlin / Heidelberg: Springer , 2008, 1. , 10 p.
National Category
Computer and Information Science
Identifiers
URN: urn:nbn:se:ri:diva-22263ISBN: 978-3-540-41687-6 (print)OAI: oai:DiVA.org:ri-22263DiVA: diva2:1041808
Available from: 2016-10-31 Created: 2016-10-31 Last updated: 2016-12-28Bibliographically approved

Open Access in DiVA

No full text

Search in DiVA

By author/editor
Karlgren, Jussi
By organisation
SICS
Computer and Information Science

Search outside of DiVA

GoogleGoogle Scholar

Total: 17 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
v. 2.27.0