Статья

Online Recruitment Fraud (ORF) Detection Using Deep Learning Approaches

Natasha AkramSchool of Electrical Engineering and Computer Science (SEECS), National University of Sciences and Technology (NUST), Islamabad, PakistanRabia IrfanSchool of Electrical Engineering and Computer Science (SEECS), National University of Sciences and Technology (NUST), Islamabad, PakistanAhmad Sami Al‐ShamaylehDepartment of Data Science and Artificial Intelligence, Faculty of Information Technology, Al-Ahliyya Amman University, Amman, JordanAdila KousarSchool of Electrical Engineering and Computer Science (SEECS), National University of Sciences and Technology (NUST), Islamabad, PakistanAbdul QaddosDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanMuhammad ImranDepartment of Computer Science, COMSATS University Islamabad (CUI), Islamabad, PakistanAdnan AkhunzadaDepartment of Data and Cybersecurity, College of Computing and Information Technology, University of Doha for Science and Technology, Doha, Qatar

2024en

ABI

Аннотация

Most companies nowadays are using digital platforms for the recruitment of new employees to make the hiring process easier. The rapid increase in the use of online platforms for job posting has resulted in fraudulent advertising. The scammers are making money through fraudulent job postings. Online recruitment fraud has emerged as an important issue in cybercrime. Therefore, it is necessary to detect fake job postings to get rid of online job scams. In recent studies, traditional machine learning and deep learning algorithms have been implemented to detect fake job postings; this research aims to use two transformer-based deep learning models, i.e., Bidirectional Encoder Representations from Transformers (BERT) and Robustly Optimized BERT-Pretraining Approach (RoBERTa) to detect fake job postings precisely. In this research, a novel dataset of fake job postings is proposed, formed by the combination of job postings from three different sources. Existing benchmark datasets are outdated and limited due to knowledge of specific job postings, which limits the existing models’ capability in detecting fraudulent jobs. Hence, we extend it with the latest job postings. Exploratory Data Analysis (EDA) highlights the class imbalance problem in detecting fake jobs, which tends the model to act aggressively toward the minority class. Responding to overcome this problem, the work at hand implements ten top-performing Synthetic Minority Oversampling Technique (SMOTE) variants. The models’ performances balanced by each SMOTE variant are analyzed and compared. All implemented approaches are performed competitively. However, BERT+SMOBD SMOTE achieved the highest balanced accuracy and recall of about 90%.

Перевод пока недоступен

Идентификаторы

DOI: 10.1109/access.2024.3435670

Цитирования и источники

Цитирований: 5Использованных источников: 0

Показатели — AkademScholar