A Concept-based Model for Enhancing Text Categorization

author: Shady Shehata, School of Computer Science, University of Waterloo
published: Sept. 14, 2007,   recorded: September 2007,   views: 8261
Categories

Related content

Report a problem or upload files

If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.
Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Lecture popularity: You need to login to cast your vote.
  Delicious Bibliography

Description

Most of text categorization techniques are based on word and/or phrase analysis of the text. Statistical analysis of a term frequency captures the importance of the term within a document only. However, two terms can have the same frequency in their documents, but one term contributes more to the meaning of its sentences than the other term. Thus, the underlying model should indicate terms that capture the semantics of text. In this case, the model can capture terms that present the concepts of the sentence, which leads to discover the topic of the document. A new concept-based model that analyzes terms on the sentence and document levels rather than the traditional analysis of document only is introduced. The concept-based model can effectively discriminate between non-important terms with respect to sentence semantics and terms which hold the concepts that represent the sentence meaning. The proposed model consists of concept-based statistical analyzer, conceptual ontological graph representation, and concept extractor. The term which contributes to the sentence semantics is assigned two different weights by the concept-based statistical analyzer and the conceptual ontological graph representation. These two weights are combined into a new weight. The concepts that have maximum combined weights are selected by the concept extractor. A set of experiments using the proposed concept-based model on different datasets in text categorization is conducted. The experiments demonstrate the comparison between traditional weighting and the concept-based weighting obtained by the combined approach of the concept-based statistical analyzer and the conceptual ontological graph. The evaluation of results is relied on two quality measures, the Macro-averaged F1 and the Error rate. These quality measures are improved when the newly developed concept-based model is used to enhance the quality of the text categorization

Link this page

Would you like to put a link to this lecture on your homepage?
Go ahead! Copy the HTML snippet !

Reviews and comments:

Comment1 mr aa, October 8, 2008 at 11:34 a.m.:

very nice lecture


Comment2 Khalid Elgazzar, March 23, 2009 at 1 a.m.:

Fabulous speaker


Comment3 jon Smith, June 27, 2019 at 9:25 p.m.:

Thank you so much for this. I was into this issue and tired to tinker around to check if its possible but couldnt get it done. Now https://vidmate.onl/ that i have seen the way you did it, thanks guys
with
regards


Comment4 jon Smith, June 27, 2019 at 9:25 p.m.:

so much for this. I was into this issue and tired to tinker around to check if its possible but couldnt get it done. Now that i have https://showbox.run/ seen the way you did it, thanks guys
with
regards


Comment5 charle ryals, July 29, 2020 at 1:35 p.m.:

This video lecture shows how to improve the quality of this community.The https://teen-stripping-game.pw/ will make them understand about how to play their favorite games at free.Thanks for sharing this video lecture link with us.

Write your own review or comment:

make sure you have javascript enabled or clear this field: