Modified adaptive synthetic SMOTE to improve classification performance in imbalanced datasets
dc.contributor.author | Gameng, Hazel A. | |
dc.contributor.author | Gerardo, Bobby D. | |
dc.contributor.author | Medina, Ruji P. | |
dc.date.accessioned | 2024-08-16T06:35:04Z | |
dc.date.available | 2024-08-16T06:35:04Z | |
dc.date.issued | 2020-06-16 | |
dc.identifier.citation | Gameng, H. A., Gerardo, B. B., & Medina, R. P. (2019). Modified adaptive synthetic SMOTE to improve classification performance in imbalanced datasets. In 2019 IEEE 6th International Conference on Engineering Technologies and Applied Sciences (ICETAS) (pp. 1-5). Kuala Lumpur, Malaysia: IEEE. | en |
dc.identifier.isbn | 978-1-7281-4082-7 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14353/609 | |
dc.description.abstract | The oversampling technique in the data preprocessing has been utilized to mitigate the imbalanced data problem in the real research scenario. This imbalance may reduce the ability of classification algorithms to recognize cases of interest leading to misclassification of positive samples as negative class or the false positive generation. Synthetic Minority Oversampling Technique (SMOTE) is one of the oversampling techniques existing and the Adaptive Synthetic (Adasyn) SMOTE is one of its many variants. K-Nearest Neighbor (KNN) is incorporated in Adasyn. In this study, Manhattan distance is applied in the KNN computations. This modified Adasyn was evaluated in terms of its effectiveness in the performance measure of overall accuracy, precision, recall and F1 measure on the six imbalanced datasets using logistic regression as the classification algorithm. The modified Adasyn dominated over SMOTE and the original Adasyn by 66.67 percent of the total performance metric count. It leads the accuracy and recall count with 4 out of 6, precision count with 3 out of 6, and the F1 measure count with 5 over 6. Thus, proving that the modified Adasyn can provide an efficient solution in decreasing misclassification on imbalanced datasets. | en |
dc.description.sponsorship | Acknowledgment: The authors are appreciative to the reviewers for the positive remarks and suggestions. This effort is supported by Philippine Government through the Commission on Higher Education. The authors are grateful for the financial provision provided on this study. | en |
dc.language.iso | en | en |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en |
dc.subject | Adaptive Synthetic SMOTE | en |
dc.subject | Classification | en |
dc.subject | Imbalanced dataset | en |
dc.subject | Manhattan Distance | en |
dc.subject | SMOTE | en |
dc.subject | Classification performance | en |
dc.subject | ADASYN | en |
dc.subject | Oversampling methods | en |
dc.subject.lcsh | Logistic regression analysis | en |
dc.subject.lcsh | Nearest neighbor analysis (Statistics) | en |
dc.subject.lcsh | Algorithms | en |
dc.subject.lcsh | Information storage and retrieval systems--Classification | en |
dc.subject.lcsh | Problem solving--Data processing | en |
dc.title | Modified adaptive synthetic SMOTE to improve classification performance in imbalanced datasets | en |
dc.type | Conference paper | en |
dcterms.accessRights | Limited public access | en |
dc.citation.firstpage | 1 | en |
dc.citation.lastpage | 5 | en |
dc.identifier.doi | 10.1109/ICETAS48360.2019.9117287 | |
dc.citation.conferencetitle | 6th IEEE International Conference on Engineering, Technologies and Applied Sciences, ICETAS 2019 | en |
local.isIndexedBy | Scopus | en |
local.subject.agrovoc | algorithms | en |
local.subject.agrovoc | data processing | en |
local.subject.agrovoc | classification systems | en |
Dateien zu dieser Ressource
Dateien | Größe | Format | Anzeige |
---|---|---|---|
Zu diesem Dokument gibt es keine Dateien. |
Das Dokument erscheint in:
-
Conference Papers [14]
Conference papers published externally, written by WVSU faculty members, staff, and students