PMI Based Clustering Algorithm for
Feature Reduction in Text
Classification

P.Jeyadurga; Prof. P. R. Vijaya Lakshmi; J.S.Kanchana

Абстрактный

PMI Based Clustering Algorithm for Feature Reduction in Text Classification

P.Jeyadurga, Prof. P. R. Vijaya Lakshmi, J.S.Kanchana

Feature clustering is a feature reduction method that reduces the dimensionality of feature vectors for text classification. In this paper an incremental feature clustering approach is proposed that uses Semantic similarity to cluster the features. Pointwise Mutual Information (PMI) is widely used word similarity measure, which finds Semantic similarity between two words and is an alternative for distributional similarity. PMI computation requires simple statistics about two words for similarity measure, that is number of cooccurrences or correlations between two concepts of fixed size are computed. Once the words from preprocessed documents are fed, clusters are formed and one feature (head word) is identified for each cluster which are used for indexing the document. PMI assumes that a word have single sense, but clustering can be optimized further if polysemies of words are considered. Hence PMI may be combined with PMImax, which estimates correlation between the closest senses of two words also, thereby better feature reduction and execution time compared with other approaches.

Отказ от ответственности: Этот реферат был переведен с помощью инструментов искусственного интеллекта и еще не прошел проверку или верификацию

Основные моменты журнала

Аэрокосмическая техника Биомедицинская инженерия Биохимия Ботаника Динамика жидкостей Методы хроматографии Прикладные науки

Индексировано в

Академические ключи

ResearchBible

CiteFactor

Космос ЕСЛИ

РефСик

Университет Хамдарда

Всемирный каталог научных журналов

научный руководитель

Импакт-фактор Международного инновационного журнала (IIJIF)

Международный институт организованных исследований (I2OR)

Cosmos

Посмотреть больше

Международные журналы

Инженерное дело медицинские науки Общие науки Фармацевтические науки

Международный журнал исследований в области науки, техники и технологий

Абстрактный

PMI Based Clustering Algorithm for Feature Reduction in Text Classification

Основные моменты журнала

Индексировано в

Международные журналы

Адрес