Change search
Refine search result
123 101 - 134 of 134
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rows per page
  • 5
  • 10
  • 20
  • 50
  • 100
  • 250
Sort
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
  • Standard (Relevance)
  • Author A-Ö
  • Author Ö-A
  • Title A-Ö
  • Title Ö-A
  • Publication type A-Ö
  • Publication type Ö-A
  • Issued (Oldest first)
  • Issued (Newest first)
  • Created (Oldest first)
  • Created (Newest first)
  • Last updated (Oldest first)
  • Last updated (Newest first)
  • Disputation date (earliest first)
  • Disputation date (latest first)
Select
The maximal number of hits you can export is 250. When you want to export more records please use the Create feeds function.
  • 101.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Kauber, Markus
    Boujemaa, Nozha
    Compañó, Ramón
    Dosch, Christoph
    Geurts, Joost
    Gouraud, Henri
    King, Paul
    Köhler, Joachim
    van der Linden, Pieter
    Ortgies, Robert
    Rudström, Åsa
    RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.
    Sebe, Nicu
    CHORUS Deliverable 3.4: Vision Document2009Other (Other academic)
    Abstract [en]

    The goal of the CHORUS Vision Document is to create a high level vision on audio-visual search engines in order to give guidance to the future R&D work in this area and to highlight trends and challenges in this domain. The vision of CHORUS is strongly connected to the CHORUS Roadmap Document (D2.3). A concise document integrating the outcomes of the two deliverables will be prepared for the end of the project (NEM Summit).

  • 102.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Kauber, Markus
    Geurts, Joost
    Boujemaa, Nozha
    Sebe, Nicu
    Ortgies, Robert
    Dosch, Christoph
    Compañó, Ramón
    Köhler, Joachim
    CHORUS Deliverable 4.5: Report of the 3rd CHORUS Conference2009Other (Other academic)
    Abstract [en]

    The third and last CHORUS conference on Multimedia Search Engines took place from the 26th to the 27th of May 2009 in Brussels, Belgium. About 100 participants from 15 European countries, the US, Japan and Australia learned about the latest developments in the domain. An exhibition of 13 stands presented 16 research projects currently ongoing around the world.

  • 103.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Olsson, Fredrik
    RISE, Swedish ICT, SICS.
    Trusting the results in crosslingual keyword-based image retrieval2007In: Evaluation of Multilingual and Multi-modal Information Retrieval, 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006: Revised Selected Papers, 2007, 1, , p. 3Conference paper (Refereed)
    Abstract [en]

    This paper gives a brief description of the starting points for the experiments the SICS team has performed in the 2006 interactive CLEF campaign.

  • 104.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Rudström, Åsa
    RISE, Swedish ICT, SICS, Decisions, Networks and Analytics lab.
    Dosch, Christoph
    Ortgies, Robert
    CHORUS Deliverable 4.3: Report from CHORUS workshops on national initiatives and metadata2007Other (Other academic)
    Abstract [en]

    Minutes of the following Workshops: • National Initiatives on Multimedia Content Description and Retrieval, Geneva, October 10th, 2007. • Metadata in Audio-Visual/Multimedia production and archiving, Munich, IRT, 21st – 22nd November 2007 Workshop in Geneva 10/10/2007 This highly successful workshop was organised in cooperation with the European Commission. The event brought together the technical, administrative and financial representatives of the various national initiatives, which have been established recently in some European countries to support research and technical development in the area of audio-visual content processing, indexing and searching for the next generation Internet using semantic technologies, and which may lead to an internet-based knowledge infrastructure. The objective of this workshop was to provide a platform for mutual information and exchange between these initiatives, the European Commission and the participants. Top speakers were present from each of the national initiatives. There was time for discussions with the audience and amongst the European National Initiatives. The challenges, communalities, difficulties, targeted/expected impact, success criteria, etc. were tackled. This workshop addressed how these national initiatives could work together and benefit from each other. Workshop in Munich 11/21-22/2007 Numerous EU and national research projects are working on the automatic or semi-automatic generation of descriptive and functional metadata derived from analysing audio-visual content. The owners of AV archives and production facilities are eagerly awaiting such methods which would help them to better exploit their assets.Hand in hand with the digitization of analogue archives and the archiving of digital AV material, metadatashould be generated on an as high semantic level as possible, preferably fully automatically. All users of metadata rely on a certain metadata model. All AV/multimedia search engines, developed or under current development, would have to respect some compatibility or compliance with the metadata models in use. The purpose of this workshop is to draw attention to the specific problem of metadata models in the context of (semi)-automatic multimedia search.

  • 105.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    From Words to Understanding2001In: Foundations of Real-World Intelligence, Stanford, California: CSLI Publications , 2001, 1, p. 294-308Chapter in book (Refereed)
  • 106.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Vector-based semantic analysis using random indexing and morphological analysis for cross-lingual information retrieval2002In: Revised Papers from the Second Workshop of the Cross-Language Evaluation Forum on Evaluation of Cross-Language Information Retrieval Systems, Darmstadt, Germany, September 3 - 4, 2001, Springer-Verlag , 2002, 1, p. 169-176Chapter in book (Refereed)
    Abstract [en]

    Meaning, the main object of study in information access, is most decidedly situation-dependent. While much of meaning appears to achieve consistency across usage situations -- a term will seem to mean much the same thing in many of its contexts -- most everything can be negotiated on the go. Human processing appears to be flexible in this respect, and oriented towards learning from prototypes rather than learning by definition: learning new words, and adding new meanings or shades of meaning to an existing word does not need a formal re-training process. We have built a query expansion and translation tool for information retrieval systems. When used in one single language it will expand the terms of a query using a thesaurus built for that purpose; when used across languages it will provide numerous translations and near translations for the source language terms. The underlying technology we are testing is that of vector-based semantic analysis, an analysis method related to latent semantic indexing based on stochastic pattern computing. This paper will briefly describe how we acquired training data, aligned it, analyzed it using morphological analysis tools, and finally built a thesaurus using the data, but will concentrate on an overview of vector-based semantic analysis and how stochastic pattern computing differs from latent semantic indexing in its current form.

  • 107.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Weighting Query Terms Based on Distributional Statistics2006In: Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005: Revised Papers, 2006, 1, , p. 5Conference paper (Refereed)
    Abstract [en]

    This year, the SICS team has concentrated on query processing and on the internal topical structure of the query, specifically compound translation. Compound translation is non-trivial due to dependencies between compound elements. This year, we have investigated topical dependencies between query terms: if a query term happens to be non-topical or noise, it should be discarded or given a low weight when ranking retrieved documents; if a query term shows high topicality its weight should be boosted. The two experiments described here are based on the analysis of the distributional character of query terms: one using similarity of occurrence context between query terms globally across the entire collection; the other using the likelihood of individual terms to appear topically in individual texts. Both -- complementary -- boosting schemes tested delivered improved results.

  • 108.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Järvinen, Timo
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Dynamic lexica for query translation2005In: Multilingual Information Access for Text, Speech and Images, Third Workshop of the Cross-Language Evaluation Forum (CLEF), 2005, 1Conference paper (Refereed)
    Abstract [en]

    This experiment tests a simple, scalable, and effective approach to building a domain-specific translation lexicon using distributional statistics over parallellized bilingual corpora. A bilingual lexicon is extracted from aligned Swedish-French data, used to translate CLEF topics from Swedish to French, which resulting French queries are then in turn used to retrieve documents from the French language CLEF collection. The results give 34 of fifty queries on or above median for the ``precision at 1000 documents'' recall oriented score; with many of the errors possible to handle by the use of string-matching and cognate search. We conclude that the approach presented here is a simple and efficient component in an automatic query translation system.

  • 109.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Straszheim, Troy
    Visualizing Stylistic Variation1997In: Proceedings of the 30th Hawaii International Conference on Systems Sciences, 1997, 1Conference paper (Refereed)
    Abstract [en]

    Texts vary not only by topic, but by style; indeed, often the variation between texts `about the same thing' can be just as noticeable as the variation between texts `about different things'. Some facets of this variation are quite easy to detect, and quite predictable when applied to categorization of texts by genre, functional style, or - tentatively - quality. Making use of such variation in an retrieval context is quite straightforward in principle; our work consists of an implementation of a visualization tool for document databases. The issues addressed include 1) choice of stylistic items to investigate, 2) composition of dimensions of variation, and 3) judicious naming of dimensions for presentation. We use use principal components analysis to combine our quite large number of stylistic items into two most significant dimensions of variation and plot the document space under consideration into a plane. This space can be used as a first or last filter in an information retrieval task. The composition of the most significant dimensions is naturally corpus dependent, as is the naming of them: our work is tested on Internet and TREC data.

  • 110.
    Karlgren, Jussi
    et al.
    RISE, Swedish ICT, SICS.
    Tarhio, Jorma
    Hyyrö, Heikki
    String processing and information retrieval2009 (ed. 1)Book (Refereed)
    Abstract [en]

    Proceedings of the 16th International Symposium on String Processing and Information Retrieval (SPIRE 2009), Saariselkä, Finland, 25-27 August 2009.

  • 111. Klamberg, Mark
    et al.
    Nilsson, Mikael
    Petersson, Anna
    Seipel, Peter
    Flyghed, Janne
    Magnusson Sjöberg, Cecilia
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Bylund, Markus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Palmås, Karl
    Kullenberg, Christopher
    Ström, Pär
    Thorburn, Daniel
    Westerholm, Johan
    FRA-lagen medför massiv kartläggning av oskyldiga2008Other (Other (popular science, discussion, etc.))
    Abstract [sv]

    Mark Klamberg Doktorand juridik, Stockholms universitet, Mikael Nilsson Doktorand datalogi, Kungliga Tekniska högskolan, Anna Petersson Doktorand matematik, Uppsala universitet, Peter Seipel Professor emeritus rättsinformatik, Stockholms universitet, Janne Flyghed Professor kriminologi, Stockholms universitet, Cecilia Magnusson Sjöberg Professor i rättsinformatik, Stockholms universitet, Jussi Karlgren Docent språkteknologi, Swedish Institute of Computer Science, Markus Bylund Doktor i data- och systemvetenskap inriktning personlig integritet, Swedish Institute of Computer Science, Karl Palmås Civilingenjör och doktor i sociologi, Handelshögskolan vid Göteborgs universitet, Christopher Kullenberg Doktorand vetenskapsteori, Göteborgs universitet, Pär Ström Civilingenjör och författare, integritetsombudsman på tankesmedjan Den nya välfärden, Daniel Thorburn Professor i statistik, Stockholms universitet, Johan Westerholm Kapten, reservofficerare underrättelsetjänst, rådgivare Greycat

  • 112.
    Nylander, Stina
    et al.
    RISE, Swedish ICT, SICS, Software and Systems Engineering Laboratory.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Statistics and phonotactical rules in finding OCR errors1999In: Proceedings of NODALIDA'99 (Nordiska Datorlingvistikdagarna), 1999, 1Conference paper (Refereed)
  • 113.
    Olsson, Fredrik
    et al.
    RISE, Swedish ICT, SICS. Userware.
    Hansen, Preben
    RISE, Swedish ICT, SICS. Userware.
    Franzén, Kristofer
    RISE, Swedish ICT, SICS. Userware.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Userware.
    Information Access and Refinement - a Research Theme2001In: ERCIM NewsArticle in journal (Refereed)
  • 114.
    Olsson, Fredrik
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Hansen, Preben
    RISE, Swedish ICT, SICS.
    Svensson, Martin
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Sahlgren, Magnus
    RISE - Research Institutes of Sweden, ICT, SICS.
    Consensus and opinions; quality and churn2006Conference paper (Refereed)
    Abstract [en]

    The role of the web user is under transformation from merely being an information consumer to also being a content provider, ``from information age to participation age'', in the words of Sun CEO Scott McNealy. This increase in participation is most obviously manifested by the growth of online communities, weblogs (blogs), and various forms of cooperative and participatory publication of information. One main factor in the shift towards participation is the advent of authoring tools for wikipedias and blogs. Such tools have decreased the threshold for publishing material online considerably --- it is no longer necessary to have knowledge about the technical workings of the web to be able to use it for making information available to a massive number of potential readers. (Although the lion's share of information produced will probably remain in text form in the foreseeable future, it should be noted that other modalities, such as podcasts, screencasts, films and images, are increasingly attracting interest.) The dynamic nature of blogs and wikipedias poses new challenges to the field of information access and refinement; new theories, methods, and tools for alleviating the burden of digesting information on behalf of the readers are clearly needed. This paper presents some issues on readership and participation we are currently considering.

  • 115. Ortgies, Robert
    et al.
    Dosch, Christoph
    Nesvadba, Jan
    Proidl, Adolf
    Gouraud, Henri
    van der Linden, Pieter
    Boujemaa, Nozha
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Compañó, Ramón
    Köhler, Joachim
    King, Paul
    Lowen, David
    CHORUS Deliverable 3.3: Vision Document - Intermediate version2008Other (Other academic)
    Abstract [en]

    The goal of the CHORUS vision document is to create a high level vision on audio-visual search engines in order to give guidance to the future R&D work in this area (in line with the mandate of CHORUS as a Coordination Action). This current intermediate draft of the CHORUS vision document (D3.3) is based on the previous CHORUS vision documents D3.1 to D3.2 and on the results of the six CHORUS Think-Tank meetings held in March, September and November 2007 as well as in April, July and October 2008, and on the feedback from other CHORUS events. The outcome of the six Think-Thank meetings will not just be to the benefit of the participants which are stakeholders and experts from academia and industry – CHORUS, as a coordination action of the EC, will feed back the findings (see Summary) to the projects under its purview and, via its website, to the whole community working in the domain of AV content search. A few subjections of this deliverable are to be completed after the eights (and presumably last) Think-Tank meeting in spring 2009.

  • 116. Palme, Jacob
    et al.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Pargman, Daniel
    Issues when designing filters in messaging systems1996In: Computer Communications, ISSN 0140-3664, E-ISSN 1873-703X, Vol. 19, p. 95-101Article in journal (Refereed)
    Abstract [en]

    The increasing size of messaging communities increases the risk of information overload, especially when group communication tools like mailing lists or asynchronous conferencing systems (like Usenet News) are used. Future messaging systems will require more capable filters to aid users in the selection of what to read. The increasing use of networks by non-computer professionals requires filters that are easier to use and manage than most filtering software today. Filters might use evaluations of messages made by certain users as an aid to filtering these messages for other users.

  • 117.
    Persson, Per
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Korhonen, Panu
    Galore, Janet
    Tierney, Mark
    RISE, Swedish ICT, SICS.
    Redmon, Chad
    Hemanus, Juha
    Lönnqvist, Peter
    Laaksolahti, Jarmo
    RISE, Swedish ICT, SICS, Computer Systems Laboratory.
    Expressive messaging on mobile platforms2001Conference paper (Refereed)
    Abstract [en]

    This paper presents design requirements for expressive, avatar-based multi-modal messaging on mobile platforms. It is argued that expressive messaging needs to exploit context of peers, embodied appearance and behaviour, in combination with text. Our approach allows strong expressiveness and yet simple, on the fly message compositions required in a mobile, noisy setting. Technical challenges for a user ready prototype are sketched. It is argued that the context of usage between work-related stationary terminals and mobile ones is radically different.

  • 118.
    Persson, Per
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Korhonen, Panu
    Galore, Janet
    Tierney, Mark
    RISE, Swedish ICT, SICS.
    Redmon, Chad
    Hemánus, Juha
    Lönnqvist, Peter
    Laaksolahti, Jarmo
    RISE, Swedish ICT, SICS, Computer Systems Laboratory.
    Expressive Messaging on Mobile Platforms2001Report (Other academic)
    Abstract [en]

    We present a design for expressive multimodal messaging on mobile platforms. Strong context, simple text messages, and crude animations combine well to produce surprisingly expressive results.

  • 119. Peters, Carol
    et al.
    Clough, PaulGey, FredericKarlgren, JussiRISE, Swedish ICT, SICS.Magnini, BernardoOard, Douglasde Rijke, MaartenStempfhuber, Maximilian
    Evaluation of Multilingual and Multi-modal Information Retrieval: 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, Alicante, Spain, September 20-22, 2006, Revised Selected Papers2007Conference proceedings (editor) (Refereed)
    Abstract [en]

    This book constitutes the thoroughly refereed postproceedings of the 7th Workshop of the Cross-Language Evaluation Forum, CLEF 2006, held in Alicante, Spain, September 2006. The revised papers presented together with an introduction were carefully reviewed and selected for inclusion in the book. The papers are organized in topical sections on Multilingual Textual Document Retrieval, Domain-Specifig Information Retrieval, i-CLEF, QA@CLEF, ImageCLEF, CLSR, WebCLEF and GeoCLEF.

  • 120.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Hansen, Preben
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    English-Japanese cross-lingual query expansion using random indexing of aligned bilingual text data2002Conference paper (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. In this paper, we report on experiments with using the Random Indexing vector-space technique for extracting a cross-lingual thesaurus from aligned English-Japanese bilingual data. The cross-lingual thesaurus has been used for automatic cross-lingual query expansion in the NTCIR patent retrieval task.

  • 121.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Automatic Bilingual Lexicon Acquisition Using Random Indexing of Parallel Corpora2005In: Natural Language Engineering, ISSN 1351-3249, E-ISSN 1469-8110, Vol. 11, no 3, p. 327-341Article in journal (Refereed)
    Abstract [en]

    This paper presents a very simple and effective approach to using parallel corpora for automatic bilingual lexicon acquisition. The approach, which uses the Random Indexing vector space methodology, is based on finding correlations between terms based on their distributional characteristics. The approach requires a minimum of preprocessing and linguistic knowledge, and is efficient, fast and scalable. In this paper, we explain how our approach differs from traditional cooccurrence-based word alignment algorithms, and we demonstrate how to extract bilingual lexica using the Random Indexing approach applied to aligned parallel data. The acquired lexica are evaluated by comparing them to manually compiled gold standards, and we report overlap of around 60\%. We also discuss methodological problems with evaluating lexical resources of this kind.

  • 122.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Attityd.
    Buzz monitoring in word space2008Conference paper (Refereed)
    Abstract [en]

    This paper discusses the task of tracking mentions of some topically interesting textual entity from a continuously and dynamically changing flow of text, such as a news feed, the output from an Internet crawler or a similar text source - a task sometimes referred to as buzz monitoring. Standard approaches from the field of information access for identifying salient textual entities are reviewed, and it is argued that the dynamics of buzz monitoring calls for more accomplished analysis mechanisms than the typical text analysis tools provide today. The notion of word space is introduced, and it is argued that word spaces can be used to select the most salient markers for topicality, find associations those observations engender, and that they constitute an attractive foundation for building a representation well suited for the tracking and monitoring of mentions of the entity under consideration.

  • 123.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Counting Lumps in Word Space: Density as a Measure of Corpus Homogeneity2005Conference paper (Refereed)
    Abstract [en]

    This paper introduces a measure of corpus homogeneity that indicates the amount of topical dispersion in a corpus. The measure is based on the density of neighborhoods in semantic word spaces. We evaluate the measure by comparing the results for five different corpora. Our initial results indicate that the proposed density measure can indeed identify differences in topical dispersion.

  • 124.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS. Attityd.
    Terminology mining in social media2009Conference paper (Refereed)
    Abstract [en]

    The highly variable and dynamic word usage in social media presents serious challenges for both research and those commercial applications that are geared towards blogs or other user-generated non-editorial texts. This paper discusses and exemplifies a terminology mining approach for dealing with the productive character of the textual environment in social media. We explore the challenges of practically acquiring new terminology, and of modeling similarity and relatedness of terms from observing realistic amounts of data. We also discuss semantic evolution and density, and investigate novel measures for characterizing the preconditions for terminology mining.

  • 125.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    SICS at CLEF 2002: automatic query expansion using random indexing2002Conference paper (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. We have used a technique called Random Indexing to accumulate context vectors for Swedish, French and Italian. We have then used the context vectors to perform automatic query expansion. In this paper, we report on our CLEF 2002 experiments on Swedish, French and Italian monolingual query expansion.

  • 126.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Cöster, Rickard
    RISE, Swedish ICT, SICS.
    Järvinen, Timo
    Automatic query expansion using random indexing2003In: Advances in Cross-Language Information Retrieval: Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002. Rome, Italy, September 19-20, 2002: Revised Papers, Springer-Verlag , 2003, 1, p. 311-320Chapter in book (Refereed)
    Abstract [en]

    Vector-space techniques can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. We have used a technique called Random Indexing to accumulate context vectors for Swedish, French and Italian. We have then used the context vectors to perform automatic query expansion. In this paper, we report on our CLEF 2002 experiments on Swedish, French and Italian monolingual query expansion.

  • 127.
    Sahlgren, Magnus
    et al.
    RISE - Research Institutes of Sweden, ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Eriksson, Gunnar
    RISE, Swedish ICT, SICS.
    SICS: Valence annotation based on seeds in word space2007Conference paper (Refereed)
  • 128. Simaioforidis, Aggis
    et al.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Ereback, Anna-Lena
    RISE, Swedish ICT, SICS.
    Creating a task-specific collaborative information retrieval system1998In: Proceedings of the 2nd European Conference on Research and Advanced Technology for Digital Libraries (ECDL), 1998, 4Conference paper (Refereed)
  • 129.
    Simsarian, Kristian
    et al.
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Fahlén, Lennart
    RISE, Swedish ICT, SICS.
    Frecon, Emmanuel
    RISE, Swedish ICT, SICS.
    Bretan, Ivan
    Frost, Niklas
    Jonsson, Lars
    Achieving Virtual Presence with a Semi-Autonomous Robot Through a Multi-Reality and Speech Control Interface1996In: Proceedings of 3d Eurographics Workshop on Virtual Environments, 1996, 1Conference paper (Refereed)
    Abstract [en]

    This paper describes a model for a complex human-machine system where a human operator controls a remote robot through the mediation of a distributed virtual environment with a language interface. The system combines speech controlled graphical immersive environments with the live video from a robot working in a real environment. The worlds are synchronized and updated based on operator selections, commands and robot actions. This system allows the user to have a powerful tool with a high level of abstraction to create and control autonomous robots, thus making possible the realization of single and multiple autonomous robot applications.

  • 130. Strzalkowski, Tomek
    et al.
    Guthrie, Louise
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Leistensnider, Jim
    Lin, Fang
    Perez-Carballo, Jose
    Straszheim, Troy
    Wang, Jin
    Wilding, Jon
    Natural language information retrieval: TREC-5 report1996In: NIST Special Publication 500-238: The Fifth Text REtrieval Conference (TREC-5), November 1996, Gaithersburg, Maryland, USA, 1996, 1, , p. 28Conference paper (Refereed)
    Abstract [en]

    In this paper we report on the joint GE/Lockheed Martin/Rutgers/NYU natural language information retrieval project as related to the 5th Text Retrieval Conference (TREC-5). The main thrust of this project is to use natural language processing techniques to enhance the effectiveness of full-text document retrieval. Since our first TREC entry in 1992 (as NYU team) the basic premise of our research was to demonstrate that robust if relatively shallow NLP can help to derive a better representation of text documents for statistical search. TREC-5 marks a shift in this approach away from text representation issues and towards query development problems. While our TREC-5 system still performs extensive text processing in order to extract phrasal and other indexing terms, our main focus this year was on query construction using words, sentences, and entire passages to expand initial topic specifications in an attempt to cover their various angles, aspects and contexts. Based on our earlier TREC results indicating that NLP is more effective when long, descriptive queries are used, we allowed for liberal expansion with long passages from related documents imported verbatim into the queries. This method appears to have produced a dramatic improvement in the performance of two different statistical search engines that we tested (Cornell’s SMART and NIST’s Prise) boosting the average precision by at least 40%. The overall architecture of TREC-5 system has also changed in a number of ways from TREC-4. The most notable new feature is the stream architecture in which several independent, parallel indexes are built for a given collection, each index reflecting a different representation strategy for text documents. Stream indexes are built using a mixture of different indexing approaches, term extracting, and weighting strategies. We used both SMART and Prise base indexing engines, and selected optimal term weighting strategies for each stream, based on a training collection of approximately 500 MBytes. The final results are produced by a merging procedure that combines ranked list of documents obtained by searching all stream indexes with appropriately preprocessed queries. This allows for an effective combination of alternative retrieval and filtering methods, creating into a meta-search where the contribution of each stream can be optimized through training.

  • 131. Strzalkowski, Tomek
    et al.
    Perez-Carballo, Jose
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Hulth, Anette
    Tapanainen, Pasi
    Lahtinen, TImo
    Natural language information retrieval: TREC-8 report1999In: NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC-8), 1999, 1, , p. 10Conference paper (Refereed)
    Abstract [en]

    This report describes the adhoc experiments performed by the GE/Rutgers/SICS/SU/Conexor team in the context of TREC-8. The research efforts went in four directions: 1. As in previous years, we performed a full linguistic analysis of the entire corpus, and used the re- sults of the analysis to provide index terms on a higher level of abstraction than can be provided by stems alone. 2. We made use of two different query expansion techniques, one automatic and one manual, both de- veloped for TREC-8. 3. The various analysis models were combined using a stream model architecture, where each stream represents an alternative text indexing method, and the stream's various overlapping knowledge was merged using a new merging algorithm derived from first principles. 4. The entire text was analyzed for various stylistic items. Due to the distributed approach, this years’ research efforts partly canceled out each other. New ex- periments in every step of the process did not result in an overwhelming overall result. We are able to determine that the manual query expansion technique developed at General Electric performed very well.

  • 132.
    Svensson, Martin
    et al.
    RISE, Swedish ICT, SICS.
    Johansson, Arnold
    Ereback, Anna-Lena
    RISE, Swedish ICT, SICS.
    Höök, Kristina
    RISE, Swedish ICT, SICS.
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Bretan, Ivan
    Moving Usability Testing onto the Web1998In: Proceedings of WebNet'98, 1998, 1Conference paper (Refereed)
    Abstract [en]

    Abstract: In order to remotely obtain detailed usability data by tracking user behaviors within a given web site, a server-based usability testing environment has been created. Web pages are annotated in such a way that arbitrary user actions (such as "mouse over link" or "click back button") can be selected for logging. In addition, the system allows the experiment designer to interleave interactive questions into the usability evaluation, which for instance could be triggered by a particular sequence of actions. The system works in conjunction with clustering and visualization algorithms that can be applied to the resulting log file data. A first version of the system has been used successfully to carry out a web usability evaluation.

  • 133.
    Täckström, Oscar
    et al.
    RISE, Swedish ICT, SICS.
    Eriksson, Gunnar
    RISE, Swedish ICT, SICS.
    Velupillai, Sumithra
    Dalianis, Hercules
    Hassel, Martin
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Uncertainty Detection as Approximate Max-Margin Sequence Labelling2010In: Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 2010, 12Conference paper (Refereed)
    Abstract [en]

    This paper reports experiments for the CoNLL 2010 shared task on learning to detect hedges and their scope in natural language text. We have addressed the experimental tasks as supervised linear maximum margin prediction problems. For sentence level hedge detection in the biological domain we use an L1-regularised binary support vector machine, while for sentence level weasel detection in the Wikipedia domain, we use an L2-regularised approach. We model the in-sentence uncertainty cue and scope detection task as an L2-regularised approximate maximum margin sequence labelling problem, using the BIO-encoding. In addition to surface level features, we use a variety of linguistic features based on a functional dependency analysis. A greedy forward selection strategy is used in exploring the large set of potential features. Our official results for Task 1 for the biological domain are 85.2 F1-score, for the Wikipedia set 55.4 F1-score. For Task 2, our official results are 2.1 for the entire task with a score of 62.5 for cue detection. After resolving errors and final bugs, our final results are for Task 1, biological: 86.0, Wikipedia: 58.2; Task 2, scopes: 39.6 and cues: 78.5.

  • 134. Uzuner, Ozlem
    et al.
    Argamon, Shlomo
    Karlgren, Jussi
    RISE, Swedish ICT, SICS.
    Stylistics for text retrieval in practice2006In: ACM SIGIR Forum, Vol. 40, p. 2Article in journal (Refereed)
    Abstract [en]

    Stylistics for Text Retrieval in Practice has met during SIGIR 2006. With participants from both academia and industry, the workshop spurred interesting discussions on the future of stylistics, its practical uses, and “killer app”. The papers presented varied from customer feedback evaluation systems to automatic speech generation.

123 101 - 134 of 134
CiteExportLink to result list
Permanent link
Cite
Citation style
  • apa
  • harvard1
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
v. 2.35.8