Reference Hub4
Improving Auto-Detection of Phishing Websites using Fresh-Phish Framework

Improving Auto-Detection of Phishing Websites using Fresh-Phish Framework

Hossein Shirazi, Kyle Haefner, Indrakshi Ray
Copyright: © 2018 |Volume: 9 |Issue: 1 |Pages: 14
ISSN: 1947-8534|EISSN: 1947-8542|EISBN13: 9781522543787|DOI: 10.4018/IJMDEM.2018010104
Cite Article Cite Article

MLA

Shirazi, Hossein, et al. "Improving Auto-Detection of Phishing Websites using Fresh-Phish Framework." IJMDEM vol.9, no.1 2018: pp.1-14. http://doi.org/10.4018/IJMDEM.2018010104

APA

Shirazi, H., Haefner, K., & Ray, I. (2018). Improving Auto-Detection of Phishing Websites using Fresh-Phish Framework. International Journal of Multimedia Data Engineering and Management (IJMDEM), 9(1), 1-14. http://doi.org/10.4018/IJMDEM.2018010104

Chicago

Shirazi, Hossein, Kyle Haefner, and Indrakshi Ray. "Improving Auto-Detection of Phishing Websites using Fresh-Phish Framework," International Journal of Multimedia Data Engineering and Management (IJMDEM) 9, no.1: 1-14. http://doi.org/10.4018/IJMDEM.2018010104

Export Reference

Mendeley
Favorite Full-Issue Download

Abstract

Denizens of the Internet are under a barrage of phishing attacks of increasing frequency and sophistication. Emails accompanied by authentic looking websites are ensnaring users who, unwittingly, hand over their credentials compromising both their privacy and security. Methods such as the blacklisting of these phishing websites become untenable and cannot keep pace with the explosion of fake sites. Detection of nefarious websites must become automated and be able to adapt to this ever-evolving form of social engineering. There is an improved framework that was previously implemented called “Fresh-Phish”, for creating current machine-learning data for phishing websites. The improved framework uses a total of 28 different website features that query using python, then a large labeled dataset is built and analyze over several machine learning classifiers against this dataset to determine which is the most accurate. This modified framework improves the accuracy of modeling those features by using integer rather than binary values where possible. This article analyzes not just the accuracy of the technique, but also how long it takes to train the model.