Machine learning in detecting and interpreting business incubator success data and datasets
Abstract
This research contributes to creating a proposed architectural model by utilizing several machine learning (ML) algorithms, heatmap correlation, and ML interpretation. Several algorithms are used, such as K-nearest neighbors (KNN) to the adaptive boosting (AdaBoost) algorithm, and heatmap correlation is used to see the relationship between variables. Finally, select K-best is used in the results, showing that several proposed model ML algorithms such as AdaBoost, CatBoost, and XGBoost have accuracy, precision, and recall of 94% and an F1-score of 93%. However, the computing time the best ML is AdaBoost with 0.081s. Then, finally, the proposed model results of the interpretation of AdaBoost using select K-best are the best features “last revenue” and “first revenue” with k feature values of 0.58 and 0.196, these features influence the success of the business. The results show that the proposed model successfully utilized model classification, correlation, and interpretation. The proposed model still has weaknesses, such as the ML model being outdated and not having too many interpretation features. The future research might maximize with ML models and the latest interpretations. These improvements could be in the form of ML algorithms that are more immune to data uncertainty, and interpretation of results with wider data.
Keywords
AdaBoost; Feature; Interpretation; Machine learning; Model architecture; Select K-best
Full Text:
PDFDOI: http://doi.org/10.11591/ijict.v14i2.pp446-456
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
The International Journal of Informatics and Communication Technology (IJ-ICT)
p-ISSN 2252-8776, e-ISSN 2722-2616
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).