Personal tools
You are here: Home Projects Text genre classification project
Document Actions

Text genre classification project

by Olivier Gevaert last modified 2006-04-26 11:15

Text classification is the automatic categorisation of texts based on features gathered from the text. Automatic text categorisation can classify texts in different types (i.e. scientific, news, story, etc.), according to the subject (football, cars, computers, etc.) or texts can be classified by genre (objective or subjective, positive or negative). Genre means here if the text is positive or negative about a certain topic. In this project texts are classified as positive or negative opinions. The classification is applied on movie reviews, game reviews, restaurant reviews and book reviews. A number of different feature sets are implemented to try to catch the opinion of a text. Then machine learning techniques are applied to train a classifier such as Naïve Bayes or decision tree learning to categorise the texts where a number of different features are extracted. Then the classifier is tested in another domain (i.e. a movie classifier is tested on the restaurant reviews) to look if its performance can be generalised.


« November 2009 »
Su Mo Tu We Th Fr Sa
1 2 3 4 5 6 7
8 9 10 11 12 13 14
15 16 17 18 19 20 21
22 23 24 25 26 27 28
29 30
 

Powered by Plone CMS, the Open Source Content Management System

This site conforms to the following standards: