OPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA

Aditya Putra Bahri

OPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA

Repository Analytics

Statistic Details

Updated data

17Viewes

0Downloaded

17Accessed per month

2Countries

Files

4332101044_JURNAL .pdf (625.94 KB)

LAMPIRAN - HALAMAN PENGESAHAN (1).pdf (456.29 KB)

No.FO.31.6.1-V1 Format Pernyataan Publikasi Mahasiswa (1) (3)_signed.pdf (338.43 KB)

Date

2026-01-17

Authors

Aditya Putra Bahri

Publisher

Politeknik Negri Batam

Abstract

This study evaluates and optimizes the performance of the Logistic Regression algorithm for phishing email detection. The primary challenge lies in balancing the use of technical features (metadata) and textual features (content) to prevent overfitting. This research utilizes a large-scale combined dataset consisting of 102,486 emails, comprising the Phishing dataset (Naser Abdullah Alam) and the Valid dataset (Enron), processed using TF-IDF vectorization and metadata feature extraction techniques. Unlike previous studies, this research implements hyperparameter optimization (C regularization) to assess model stability. Experimental results demonstrate that the Content-Only model yields the most superior and stable performance, achieving an Area Under Curve (AUC) of 0.99 and an F1-Score exceeding 95.61%. In contrast, the incorporation of metadata features in the Hybrid model led to a decline in accuracy at high regularization values, indicating that metadata acts as noise. The study concludes that Logistic Regression utilizing content features alone is sufficiently robust and efficient for phishing detection, eliminating the need for the added complexity of metadata.

Keywords

SOCIAL SCIENCES::Statistics, computer and systems science::Informatics, computer and systems science::Information technology

Citation

APA

URI

https://repository.polibatam.ac.id//handle/PL29/4707

Collections

D4 Rekayasa Keamanan Siber

Full item page

OPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA

Statistic Details

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By