The PDF file you selected should load here if your Web browser has a PDF reader plug-in installed (for example, a recent version of Adobe Acrobat Reader).
If you would like more information about how to print, save, and work with PDFs, Highwire Press provides a helpful Frequently Asked Questions about PDFs.
Alternatively, you can download the PDF file directly to your computer, from where it can be opened using a PDF reader. To download the PDF, click the Download link above.
BibTex Citation Data :
@article{JOIV1813, author = {Wirarama Wedashwara and Budi Irmawati and Heri Wijayanto and I Wayan Agus Arimbawa and Vandha Pradwiyasma Widartha}, title = {Text Classification Using Genetic Programming with Implementation of Map Reduce and Scraping}, journal = {JOIV : International Journal on Informatics Visualization}, volume = {7}, number = {2}, year = {2023}, keywords = {Text Classification; Genetic Programming; Web Scraping; Map-reduce}, abstract = {Classification of text documents on online media is a big data problem and requires automation. Text classification accuracy can decrease if there are many ambiguous terms between classes. Hadoop Map Reduce is a parallel processing framework for big data that has been widely used for text processing on big data. The study presented text classification using genetic programming by pre-processing text using Hadoop map-reduce and collecting data using web scraping. Genetic programming is used to perform association rule mining (ARM) before text classification to analyze big data patterns. The data used are articles from science-direct with the three keywords. This study aims to perform text classification with ARM-based data pattern analysis and data collection system through web-scraping, pre-processing using map-reduce, and text classification using genetic programming. Through web scraping, data has been collected by reducing duplicates as much as 17718. Map-reduce has tokenized and stopped-word removal with 36639 terms with 5189 unique terms and 31450 common terms. Evaluation of ARM with different amounts of multi-tree data can produce more and longer rules and better support. The multi-tree also produces more specific rules and better ARM performance than a single tree. Text classification evaluation shows that a single tree produces better accuracy (0.7042) than a decision tree (0.6892), and the lowest is a multi-tree(0.6754). The evaluation also shows that the ARM results are not in line with the classification results, where a multi-tree shows the best result (0.3904) from the decision tree (0.3588), and the lowest is a single tree (0.356).}, issn = {2549-9904}, pages = {384--390}, doi = {10.30630/joiv.7.2.1813}, url = {https://joiv.org/index.php/joiv/article/view/1813} }
Refworks Citation Data :
@article{{JOIV}{1813}, author = {Wedashwara, W., Irmawati, B., Wijayanto, H., Arimbawa, I., Widartha, V.}, title = {Text Classification Using Genetic Programming with Implementation of Map Reduce and Scraping}, journal = {JOIV : International Journal on Informatics Visualization}, volume = {7}, number = {2}, year = {2023}, doi = {10.30630/joiv.7.2.1813}, url = {} }Refbacks
- There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
__________________________________________________________________________
JOIV : International Journal on Informatics Visualization
ISSN 2549-9610 (print) | 2549-9904 (online)
Organized by Department of Information Technology - Politeknik Negeri Padang, and Institute of Visual Informatics - UKM and Soft Computing and Data Mining Centre - UTHM
W : http://joiv.org
E : joiv@pnp.ac.id, hidra@pnp.ac.id, rahmat@pnp.ac.id
View JOIV Stats
is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.