CLASSIFYING IRIS FLOWER DATA USING ALGORITHMS NAÏVE BAYES, RANDOM FOREST AND KNN | Núi | TNU Journal of Science and Technology

CLASSIFYING IRIS FLOWER DATA USING ALGORITHMS NAÏVE BAYES, RANDOM FOREST AND KNN

About this article

Received: 03/06/21                Revised: 02/07/21                Published: 14/07/21

Authors

Nguyen Van Nui Email to author, TNU - University of Information and Communication Technology

Abstract


Iris is a beautiful flower, representing luck and love courage, loyalty, and wisdom. Therefore, the classification and accurate prediction of Iris flower brings many important meanings in practice. Although there have been many scientific publications related to classification and prediction of Iris flowers, the classification and prediction performance of these publications still have certain limitations that need to be studied for further improvement. In this paper, the author proposes model to classify and predict Iris flowers on the basis of the application of the Weka toolkit and the Naïve Bayes, Random Forest and KNN algorithms. The results reveal that all three algorithms above give high accuracy (over 95%), so it is suitable for building model to classify Iris flowers. However, the two algorithms, Random Forest and KNN (k=3), show better stability and objectivity than the Naïve Bayes algorithm.

Keywords


Data classifying Naïve Bayes; Random Forest; KNN; Iris; Iris flower

References


[1] J. P. Pinto, S. Kelur, and J. Shetty, “Iris Flower Species Identification Using Machine Learning Approach,” 2018 4th International Conference for Convergence in Technology (I2CT), SDMIT Ujire, Mangalore, India. Oct 27-28, 2018.

[2] M. Swain, S. K. Dash, S. Dash, and A. Mohapatra, “An approach for Iris Plant Classification Using Neural Network,” International Journal on Soft Computing (ÍC), vol. 3, no. 1, pp. 79-89, February 2012.

[3] C. Geetha, R. Ram, and N. Vali, “Iris-flower Classification,” Eurasian Journal of Analytical Chemistry, vol. 12, no. 3, pp. 51-63, 2017.

[4] A. Eldem, H. Eldem, and D. Üstün, A model of Deep Neural Network for Iris Classification with Different Activation Functions, 978-1-5386-6878-8/18/$31.00 ©2018 IEEE, 2018.

[5] T. Cao, Some examples of classification using SOM and MLP Neural Network, July 11, 2013.

[6] T. X. Tran and V. N. Nguyen, "Classifying protein s-farnesylation sites with support vector machine and decision tree," TNU Journal of Science and Technology, vol. 204, no. 11, pp. 149-154, 2019.

[7] H. J. Kao, V. N. Nguyen, K. Y. Huang, W. C. Chang, and T. Y. Lee, "SuccSite: Incorporating Amino Acid Composition and Informative k-spaced Amino Acid Pairs to Identify Protein Succinylation Sites," Genomics, Proteomics and Bioinformatics (Q1, SCI, IF: 6.615), June 2020.

[8] R. A. Fisher, “The Use of Multiple Measurements in Taxonomic Problems,” Annals of Eugenics, vol. 7, pp. 179-188, 1936.

[9] E. Anderson, “The Species Problem in Iris,” Annals of the Missouri Botanical Garden, vol. 23, no. 3, pp. 457-509, 1936.

[10] D. Dua and C. Graff, UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science, 2019.

[11] K. Lee and V. N. Nguyen, "SNARE-CNN: a 2D convolutional neural network architecture to identify SNARE proteins from high-throughput sequencing data," Peer J Computer Science, vol. 5, 2019, Art. no. e177, doi: https://doi.org/10.7717/peerj-cs.177.

[12] V. N. Nguyen and H. M. Nguyen, “Identification of protein S-Farnesyl cysteine prenylation sites based on substrate specificities,” International Journal of Science and Research (IJSR), vol. 7, no. 6, pp. 758-763, June 2018.

[13] V. N. Nguyen, T. X. Tran, H. M. Nguyen, H. T. Nguyen, and T. Y. Lee, “A new schema to identify S-farnesyl cysteine prenylation sites with substrate motifs,” in Advances in Intelligent Systems and Computing ICTA 2016, in Advances in Information and Communication Technology, vol. 538, Springer, Cham., 2017, doi: 10.1007/978-3-319-49073-1.

[14] V. M. Bui and V. N. Nguyen, "The prediction of Succinylation site in protein by analyzing amino acid composition" in Advances in Information and Communication Technology. ICTA 2016, in Advances in Intelligent Systems and Computing, vol. 538, Springer, Cham., doi: 10.1007/978-3-319-49073-1.




DOI: https://doi.org/10.34238/tnu-jst.4594

Refbacks

  • There are currently no refbacks.
TNU Journal of Science and Technology
Rooms 408, 409 - Administration Building - Thai Nguyen University
Tan Thinh Ward - Thai Nguyen City
Phone: (+84) 208 3840 288 - E-mail: jst@tnu.edu.vn
Based on Open Journal Systems
©2018 All Rights Reserved