Logo

Information extraction using Link Grammar

N., Zamin (2009) Information extraction using Link Grammar. In: 2009 WRI World Congress on Computer Science and Information Engineering, CSIE 2009, 31 March 2009 through 2 April 2009, Los Angeles, CA.

[img] PDF
Restricted to Registered users only

11Kb

Official URL: http://www.scopus.com/inward/record.url?eid=2-s2.0...

Abstract

In the last few years, Information Extraction (IE) has become a rapidly expanding field as the machine-readable documents keep growing exponentially. IE is the perfect solution to transform factual knowledge from publications into database entries. Many efforts have been made to automatically extract and mine scientific texts ranging from biochemical to terrorism attacks reports. This study is looking into the opportunity to extract important facts from the PETRONAS Health Safety and Environment (HSE) reports for database construction and analysis purpose. The reports are currently managed by PETRONAS Group HSE in Malaysia which contain the information on incidents and accidents occurred during the design, construction, operation and maintenance by all the PETRONAS Operating Units locally and worldwide. The effort to automate PETRONAS HSE reports will greatly benefit the PETRONAS Group HSE to automatically populate the database entries in which traditionally the task is arduous and time consuming. Many algorithms have been reported for IE ranging from simple statistical methods to advanced Natural language Processing (NLP) methods. This study investigates one of the NLP approach known as Link Grammar1 (LG) for extracting relevant information. LG appears within limited literature search to be the most suitable candidate algorithm. However, an exhaustive literature search will reveal the algorithm best suited to this application work. © 2008 IEEE.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:AS-links; Database construction; Database entry; Factual knowledge; Health safety; Information extraction; Link grammar; Literature search; Malaysia; Natural language processing; Operating units; Operation and maintenance; PETRONAS; Scientific texts; Terrorism attack; Computational linguistics; Computer science; Database systems; Information analysis; Mining; Natural language processing systems
Subjects:Q Science > Q Science (General)
Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments / MOR / COE:Departments > Computer Information Sciences
ID Code:158
Deposited By: Mrs Norshuhani Zamin
Deposited On:24 Feb 2010 14:38
Last Modified:19 Jan 2017 08:25

Repository Staff Only: item control page

Document Downloads

More statistics for this item...