OPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA

dc.contributor.advisorAntoni Haikal
dc.contributor.authorAditya Putra Bahri
dc.date.accessioned2026-05-19T17:55:03Z
dc.date.issued2026-01-17
dc.description.abstractThis study evaluates and optimizes the performance of the Logistic Regression algorithm for phishing email detection. The primary challenge lies in balancing the use of technical features (metadata) and textual features (content) to prevent overfitting. This research utilizes a large-scale combined dataset consisting of 102,486 emails, comprising the Phishing dataset (Naser Abdullah Alam) and the Valid dataset (Enron), processed using TF-IDF vectorization and metadata feature extraction techniques. Unlike previous studies, this research implements hyperparameter optimization (C regularization) to assess model stability. Experimental results demonstrate that the Content-Only model yields the most superior and stable performance, achieving an Area Under Curve (AUC) of 0.99 and an F1-Score exceeding 95.61%. In contrast, the incorporation of metadata features in the Hybrid model led to a decline in accuracy at high regularization values, indicating that metadata acts as noise. The study concludes that Logistic Regression utilizing content features alone is sufficiently robust and efficient for phishing detection, eliminating the need for the added complexity of metadata.
dc.identifier.citationAPA
dc.identifier.kodeprodiKODEPRODI57302#Rekayasa Keamanan Siber
dc.identifier.nidnNIDN8942560023
dc.identifier.nimNIM4332101044
dc.identifier.urihttps://repository.polibatam.ac.id//handle/PL29/4707
dc.publisherPoliteknik Negri Batam
dc.subjectSOCIAL SCIENCES::Statistics, computer and systems science::Informatics, computer and systems science::Information technology
dc.titleOPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA
dc.title.alternativeOPTIMASI HYPERPARAMETER PADA MODEL REGRESI LOGISTIK UNTUK MENINGKATKAN AKURASI DETEKSI PHISHING BERBASIS KONTEN DAN METADATA
dc.typeArticle

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
4332101044_JURNAL .pdf
Size:
625.94 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
LAMPIRAN - HALAMAN PENGESAHAN (1).pdf
Size:
456.29 KB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
No.FO.31.6.1-V1 Format Pernyataan Publikasi Mahasiswa (1) (3)_signed.pdf
Size:
338.43 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: