RUO Principal

Repositorio Institucional de la Universidad de Oviedo

Ver ítem 
  •   RUO Principal
  • Producción Bibliográfica de UniOvi: RECOPILA
  • Artículos
  • Ver ítem
  •   RUO Principal
  • Producción Bibliográfica de UniOvi: RECOPILA
  • Artículos
  • Ver ítem
    • español
    • English
JavaScript is disabled for your browser. Some features of this site may not work without it.

Listar

Todo RUOComunidades y ColeccionesPor fecha de publicaciónAutoresTítulosMateriasxmlui.ArtifactBrowser.Navigation.browse_issnPerfil de autorEsta colecciónPor fecha de publicaciónAutoresTítulosMateriasxmlui.ArtifactBrowser.Navigation.browse_issn

Mi cuenta

AccederRegistro

Estadísticas

Ver Estadísticas de uso

AÑADIDO RECIENTEMENTE

Novedades
Repositorio
Cómo publicar
Recursos
FAQs

Application of Variable Length N-gram Vectors to Monolingual and Bilingual Information Retrieval

Autor(es) y otros:
Gayo Avello, DanielAutoridad Uniovi; Álvarez Gutiérrez, DarioAutoridad Uniovi; Gayo Avello, José
Fecha de publicación:
2005
Editorial:

Springer

Versión del editor:
http://dx.doi.org/10.1007/11519645_7
Citación:
Lecture Notes in Computer Science, 3491, p. 73-82(2005); doi:10.1007/11519645_7
Descripción física:
p. 73-82
Resumen:

Our group in the Department of Informatics at the University of Oviedo has participated, for the first time, in two tasks from CLEF: monolingual (Russian) and bilingual (Spanish-to-English) information retrieval. Our main goal was to test the application to IR of a modified version of n-gram vector space model (codenamed blindLight). This new approach has been successfully applied to other NLP tasks such as language identification or text summarization and the results achieved at CLEF'04, although not exceptional, are encouraging. Major differences between the blindLight approach and classical techniques are two: (1) relative frequencies are no more used as vector weights but replaced by n-gram significances, and (2) cosine distance is abandoned in favor of a new metric inspired by sequence alignment techniques although not so computationally expensive. In order to perform cross-language IR we have developed a naive n-gram pseudo-translator similar to those described by McNamee and Mayfield or Pirkola et al.

Our group in the Department of Informatics at the University of Oviedo has participated, for the first time, in two tasks from CLEF: monolingual (Russian) and bilingual (Spanish-to-English) information retrieval. Our main goal was to test the application to IR of a modified version of n-gram vector space model (codenamed blindLight). This new approach has been successfully applied to other NLP tasks such as language identification or text summarization and the results achieved at CLEF'04, although not exceptional, are encouraging. Major differences between the blindLight approach and classical techniques are two: (1) relative frequencies are no more used as vector weights but replaced by n-gram significances, and (2) cosine distance is abandoned in favor of a new metric inspired by sequence alignment techniques although not so computationally expensive. In order to perform cross-language IR we have developed a naive n-gram pseudo-translator similar to those described by McNamee and Mayfield or Pirkola et al.

URI:
http://hdl.handle.net/10651/8140
ISSN:
0302-9743
DOI:
10.1007/11519645_7
Colecciones
  • Artículos [37544]
Ficheros en el ítem
Métricas
Compartir
Exportar a Mendeley
Estadísticas de uso
Estadísticas de uso
Metadatos
Mostrar el registro completo del ítem
Página principal Uniovi

Biblioteca

Contacto

Facebook Universidad de OviedoTwitter Universidad de Oviedo
El contenido del Repositorio, a menos que se indique lo contrario, está protegido con una licencia Creative Commons: Attribution-NonCommercial-NoDerivatives 4.0 Internacional
Creative Commons Image