La detección de nombres propios en español su aplicación en recuperación de información

Ángel Francisco Zazo Rodríguez; Carlos García Figuerola Paniagua; José Luis Alonso Berrocal

doi:10.54886/ibersid.v1i.3272

Applying proper noun detection in Spanish for information retrieval

Authors

Ángel Francisco Zazo Rodríguez Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España
Carlos García Figuerola Paniagua Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España
José Luis Alonso Berrocal Departamento de Informática y Automática, Facultad de Traducción y Documentación, Universidad de Salamanca, España

DOI:

https://doi.org/10.54886/ibersid.v1i.3272

Abstract

In this work an automatic method for proper noun detection in Spanish documents is presented. The objective is to check if it can be applied to improve the retrieval performance. A priori we assume that an indexing process that incorporates more information of the document also provides better retrieval results for classical information retrieval. But our results show that this is not true. A lot of tests were carried out to obtain the best performance for all situations: single proper nouns, compound proper nouns, compound proper nouns plus single proper nouns, different weighting schema for proper nouns, etc. The results were discouraging, as the retrieval performance was deteriorated in all the tests. The worst case is detecting compound proper nouns. The effect is less dramatic if the single nouns of the compound ones are considered.

Downloads

PDF (Español (España))

Published

2007-09-15

How to Cite

Zazo Rodríguez, Ángel F., García Figuerola Paniagua, C., & Alonso Berrocal, J. L. (2007). Applying proper noun detection in Spanish for information retrieval. Ibersid: Journal of Information and Documentation Systems (ISSNe 2174-081X; ISSN 1888-0967), 1, 109–116. https://doi.org/10.54886/ibersid.v1i.3272

Download Citation

Issue

Vol. 1 (2007)

Section

Articles

License

Copyright (c) 2007 Authors retain their copyright, but transfer the exploitation rights (reproduction, distribution, public communication and transformation) to the journal in a non-exclusive way and guarantee the right to the first publication of their work to the journal, which will be simultaneously subjected to the license CC BY-NC-ND. Authors take whole personal responsibility on fulfilling all the appropiate ethical codes and laws, and obtaining all the necessary copyright permissions regarding their articles. Institutional and self- archiving is allowed and encouraged.

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

© 2007- . Authors retain their copyright, but transfer the exploitation rights (reproduction, distribution, public communication and transformation) to the journal in a non-exclusive way and guarantee the right to the first publication of their work to the journal, which will be simultaneously subjected to the license CC BY-NC-ND. Authors take whole personal responsibility on fulfilling all the appropiate ethical codes and laws, and obtaining all the necessary copyright permissions regarding their articles. Institutional and self- archiving is allowed and encouraged.

Applying proper noun detection in Spanish for information retrieval

Authors

DOI:

Abstract

Downloads

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Current Issue

Information

Language