Cascade Generalization Based Functional Tree for Website Phishing Detection

Balogun, A.O. and Adewole, K.S. and Bajeh, A.O. and Jimoh, R.G. (2021) Cascade Generalization Based Functional Tree for Website Phishing Detection. Communications in Computer and Information Science, 1487 C. pp. 288-306.

Full text not available from this repository.
Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

The advent of the web and internet space has seen its adoption for rendering various services -from financial to medical services. This has brought an increase in the rate of cybersecurity issues over the years and a prominent one is the phishing attack where malicious websites mimic the appearance and functionalities of another legitimate website to collect users� credentials required for access to services. Several measures have been proposed to mitigate this attack; blacklisting and variants of machine learning approaches have been employed, yielding good performance results. However, there is a need to increase the rate of identification of phishing attacks and reduce the rate of false positives. This study proposes the use of a functional tree (FT) machine learning approach to mitigate phishing attacks. FT, a hybridization of multivariate decision trees and discriminant function using constructive induction, uses logistic regression for splitting tree nodes and leaf prediction, unlike the conventional decision tree that simply split nodes based on the data. Furthermore, a variant of the FT is proposed based on cascade generalization (CG-FT). Three datasets with varied instance distributions, both balanced and imbalanced, are used in the empirical investigation of the performance of the proposed CG-FT. The results showed that FT has improved performances over some selected baseline classifiers. Relative to FT, the CG-FT techniques showed improvement in the detection of a phishing attack with Area Under the Curve (AUC) and True Positive rate (TP-rate) ranging from 98�99.6 and 92�97 respectively in the datasets. Also, the false-positive rate is reduced with values ranging from 1.7 to 6.1. The proposed CG-FT showed improvement over all the other reviewed approaches based on studied performance metrics. The use of FT and its hybridization with cascade generalization (CG-FT) showed an improvement in performance in the mitigation of phishing attacks. © 2021, Springer Nature Singapore Pte Ltd.

Item Type: Article
Impact Factor: cited By 0
Uncontrolled Keywords: Computer crime; Cybersecurity; Decision trees; Machine learning, Cascade generalization; Cyber security; False positive; Functional tree; Hybridisation; Machine learning approaches; Medical services; Performance; Phishing attacks; Phishing detections, Websites
Depositing User: Ms Sharifah Fahimah Saiyed Yeop
Date Deposited: 25 Mar 2022 01:33
Last Modified: 25 Mar 2022 01:33
URI: http://scholars.utp.edu.my/id/eprint/29328

Actions (login required)

View Item
View Item