Genetic Programming for Medicinal Plant Family Identification System
AbstractInformation about medicinal plants that is available in text documents is generally quite easy to access, however, one needs some efforts to use it. This research was aimed at utilizing crucial information taken from a text document to identify the family of several species of medicinal plants using a heuristic approach, i.e. genetic programming. Each of the species has its unique features. The genetic program puts the characteristics or special features of each family into a tree form. There are a number of processes involved in the investigated method, i.e. data acquisition, booleanization, grouping of training and test data, evaluation, and analysis. The genetic program uses a training process to select the best individual, initializes a generate-rule process to create several individuals and then executes a fitness evaluation. The next procedure is a genetic operation process, which consists of tournament selection to choose the best individual based on a fitness value, the crossover operation and the mutation operation. These operations have the purpose of complementing the individual. The best individual acquired is the expected solution, which is a rule for classifying medicinal plants. This process produced three rules, one for each plant family, displaying a feature structure that distinguishes each of the families from each other. The genetic program then used these rules to identify the medicinal plants, achieving an average accuracy of 86.47%.
Bappenas, Indonesia Biodiversity and Action Plan 2003-2020, Jakarta, http://www.bappenas.go.id/node/82/406/strategi-dan-rencana-aksi-keane karagaman-hayati-indonesia--indonesian-biodiversity-strategy-and-action -plan---ibsap-nglish-version/ (7 December 2012).
Groombridge, B., & Jenkins, M., World Atlas of Biodiversity, Earth's Living Resources in the 21st Century, Berkeley University of California Press, https://archive.org/details/worldatlasofbiod02groo (10 September 2012)
Stadler, P.F., Fitness Landscape, https://www.bioinf.uni-leipzig.de/~studla/Publications/PREPRINTS/01-pfs-004.pdf (15 Septem-ber 2012).
Yuningsih, F., Image Searching Using Heuristic Method for Image Retrieval System, Master Thesis, Department of Computer Science, Faculty of Mathematics and Natural Sciences, Bogor Agricultural University, Bogor, 2009. (Text in Bahasa Indonesia).
Walker, M., Introduction to Genetic Programming, http://www.cs. montana.edu/~bwall/cs580/introduction_to_gp.pdf (9 September 2012).
Yuan, J.Y., Lin, J.Y., RenKe, H. & Yang, W.P., Learning to Rank for Information Retrieval Using Genetic Programming, http://www2.sdufe. edu.cn/wangsq/teaching/2012-09_CI/readings6-RankGP.pdf (13 Februa-ry 2012).
Bramer, M., Principles of Data Mining, London, Springer. 2007.
Lukas, I.A., The Ending of A King-Rook-King (KRK) Chess Gameby Using Genetic Programming, National Conference of System and Informatics, November 15th, 2008, Bali, Indonesia KNS, pp. 328-334, 2008. (Text in Bahasa Indonesia)
Koza, J.R., Genetic Programming On the Programming of Computers by Means of Natural Selection. London, MIT Press, 1992.
Poli, R., Langdon W.B., & McPhee, N.F., A Field Guide to Genetic Programming, Creative Commons, 2008.
Carvalho, M.G., Laender, A.H., Goncalves, M.A., & Silvia, A.S., A Genetic Programming Approach to Record Deduplication, IEEE Transactions on Knowledge and Data Engineering, 24(3), pp. 399-412, 2012.
Davis, J. & Goadrich, M., The Relationship Between Precision-Recall and ROC Curves, International Conference on Machine Learning. Appearing in Proceedings of the 23rd International Conference on Machine Learning, pp. 233-240, 2006.
Keng, H., Orders and Family of Malayan Seed Plants, Singapore University Press, 1978.
Balgooy, M.M.J.V., Malesian Seed Plants, Portraits of Non-Tree Families, 3nd ed, National Herbarium Nederland-University Leiden Branch, 2001.
Tjitrosoepomo, G., Morphology of Plants, Yogyakarta, Indonesia: UGM University Press, 2009. (Text in Bahasa Indonesia)