Using Stemming Algorithms on a Grid Environment
Valeriana Roncero (COPPE/UFRJ)Myrian Costa (COPPE/UFRJ)
Nelson Ebecken (COPPE/UFRJ)
Abstract:
Stemming algorithms are commonly used in Information Retrieval with the goal of reducing the number of the words which are in the same morphological variant in a common representation. Stemming analysis is one of the tasks of the pre-processing phase on text mining that consumes a lot of time. This study proposes a model of distributed stemming analysis on a grid environment to reduce the stemming processing time; this speeds up the text preparation. This model can be integrated into grid-based text mining tool, helping to improve the overall performance of the text mining process.
Keywords:
Query processing and information retrieval in Grids