Applying proper noun detection in Spanish for information retrieval

Authors

  • Ángel Francisco Zazo Rodríguez Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España
  • Carlos García Figuerola Paniagua Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España
  • José Luis Alonso Berrocal Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España

DOI:

https://doi.org/10.54886/ibersid.v1i.3272

Abstract

In this work an automatic method for proper noun detection in Spanish documents is presented. The objective is to check if it can be applied to improve the retrieval performance. A priori we assume that an indexing process that incorporates more information of the document also provides better retrieval results for classical information retrieval. But our results show that this is not true. A lot of tests were carried out to obtain the best performance for all situations: single proper nouns, compound proper nouns, compound proper nouns plus single proper nouns, different weighting schema for proper nouns, etc. The results were discouraging, as the retrieval performance was deteriorated in all the tests. The worst case is detecting compound proper nouns. The effect is less dramatic if the single nouns of the compound ones are considered.

Downloads

Download data is not yet available.

Published

2007-09-15

How to Cite

Zazo Rodríguez, Ángel F., García Figuerola Paniagua, C., & Alonso Berrocal, J. L. (2007). Applying proper noun detection in Spanish for information retrieval. Ibersid: Journal of Information and Documentation Systems (ISSNe 2174-081X; ISSN 1888-0967), 1, 109–116. https://doi.org/10.54886/ibersid.v1i.3272

Issue

Section

Articles

Most read articles by the same author(s)