Scalable Text Filtering System

Foong, Oi Mean and Ahmad Izuddin Zainal Abidin, A. and Yong, S.P. (2006) Scalable Text Filtering System. In: M2USIC Conference, 16-17 November 2006, Kuala Lumpur, Malaysia.

[img] PDF - Published Version
Restricted to Registered users only



The advancement in computing enables anyone to become information producer, resulting in rapidly growing information in the Internet. One concern arising from this phenomenon is the easy access to offensive, vulgar or obscene page by anyone with access to Internet. One of the solutions for this concern is filtering software. This paper presents a prototype called DocFilter that filters harmful content of text document without human intervention. The prototype is designed to extract each word of the document, stem the words into its root and compare each word to the list of harmful words in the hash set. Two systems evaluation were conducted to ascertain the performance of DocFilter system. Using various blocking levels, the prototype yields average filtering scores of 73.4%. The system is regarded to have produced an effective filtering accuracy of offensive words for most English text document.

Item Type:Conference or Workshop Item (Paper)
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments / MOR / COE:Departments > Computer Information Sciences
ID Code:2569
Deposited By: Foong Oi Mean
Deposited On:30 Jul 2010 07:11
Last Modified:19 Jan 2017 08:27

Repository Staff Only: item control page

Document Downloads

More statistics for this item...