An Overview of E-Documents Classification

Aurangzeb , khan and Baharum, Baharudin and Khairullah, khan (2010) An Overview of E-Documents Classification. In: 2009 International Conference on Machine Learning and Computing.

[img] PDF - Published Version
Restricted to Registered users only



With the increasing availability of electronic documents and the rapid growth of the World Wide Web, the task of automatic categorization of documents becomes the key method for organizing the information, knowledge and trend detection. With the growing availability of online resources, and popularity of fast and rich resources on web, classification of e-documents, news, personal blogs, and extraction of knowledge and trend from the documents has become an interesting area for research, as the World Wide Web is the fastest media for news and events collection from world. So the growing phenomenon of the textual data needs text mining, machine learning and natural language processing techniques and methodologies to organize and extract pattern and knowledge from the documents. This overview focused on the existing literature and explored the main techniques and methods for automatic documents classification i.e. documents representation, classifier construction and knowledge extraction and also discussed the issues along with the approaches and opportunities.

Item Type:Conference or Workshop Item (Paper)
Subjects:T Technology > T Technology (General)
Departments / MOR / COE:Departments > Computer Information Sciences
ID Code:6430
Deposited By: Dr Baharum Baharudin
Deposited On:26 Sep 2011 09:36
Last Modified:19 Jan 2017 08:24

Repository Staff Only: item control page

Document Downloads

More statistics for this item...