การเปรียบเทียบวิธีการแก้ปัญหาข้อมูลไม่สมดุล สำหรับการจำแนกกลุ่มรายได้ของผู้ประกอบการร้านยาประเภท ข.ย.1

นพมาศ อัครจันทโชติ; ดิเรก พนิตสุภากมล; Noppamas Akarachantachote; Direk Panitsupakamol

Please use this identifier to cite or link to this item: https://has.hcu.ac.th/xmlui/handle/123456789/2995

Title:	การเปรียบเทียบวิธีการแก้ปัญหาข้อมูลไม่สมดุล สำหรับการจำแนกกลุ่มรายได้ของผู้ประกอบการร้านยาประเภท ข.ย.1
Other Titles:	Comparison of Imbalanced Data Problem Solving for Income Classification of Type I Pharmacies Entrepreneur
Authors:	นพมาศ อัครจันทโชติ ดิเรก พนิตสุภากมล Noppamas Akarachantachote Direk Panitsupakamol Huachiew Chalermprakiet University. Faculty of Science and Technology Huachiew Chalermprakiet University. Faculty of Science and Technology. Undergraduate Research
Keywords:	ข้อมูลไม่สมดุล Imbalanced data การสุ่มตัวอย่าง (สถิติ) Sampling (Statistics) การจำแนก Classification รายได้ Income ร้านขายยา Drugstores วิทยาศาสตร์และเทคโนโลยี
Issue Date:	2007
Abstract:	ข้อมูลไม่สมดุลพบได้ในหลายสถานการณ์ ซึ่งโดยทั่วไปวิธีการจำแนกประเภทข้อมูลมีแนวโน้มที่จะทำนายข้อมูลเป็นกลุ่มส่วนมาก อันจะส่งผลถึงประสิทธิภาพที่ต่ำในการทำนายกลุ่มส่วนน้อย การสุ่มตัวอย่างเพิ่มสำหรับกลุ่มส่วนน้อยเป็นแนวทางหนึ่งในการจัดการกับปัญหาการจำแนกข้อมูลไม่สมดุล วัตถุประสงค์ของการวิจัยนี้ 1) เพื่อเปรียบเทียบประสิทธิภาพ การแก้ปัญหาข้อมูลไม่สมดุลด้วยการสุ่มตัวอย่างซ้ำระหว่างวิธีการสุ่มตัวอย่างเพิ่มข้อมูลเริ่มต้นอย่างสุ่ม และการสุ่มตัวอย่าง เพิ่มกลุ่มส่วนน้อยด้วยการสังเคราะห์ 2) เพื่อเปรียบเทียบประสิทธิภาพในการจำแนกข้อมูลระหว่างการถดถอยลอจิสติก และ ต้นไม้ตัดสินใจ สำหรับการจำแนกกลุ่มรายได้ผู้ประกอบการร้านยาประเภท ข.ย.1 โดยค่าวัดประสิทธิภาพที่ใช้ในการเปรียบเทียบได้แก่ ค่าความแม่นยำ อัตราความถูกต้องในการทำนายกลุ่มส่วนน้อย อัตราความถูกต้องในการทำนายกลุ่มส่วนมาก และค่าการวัดเอฟ ผลที่ได้ปรากฏว่า การสุ่มตัวอย่างเพิ่มกลุ่มส่วนน้อยด้วยการสังเคราะห์มีประสิทธิภาพสูงที่สุดใน การจำแนกสำหรับทุกวิธีการจำแนกประเภท Imbalanced data are frequently found in many situations. General classification techniques tend to biased toward the majority class. This causes low performance in predicting minority class. Oversampling for minority class is a strategy to handle class imbalance classification. This research intends 1) to compare the efficiency of resampling method between Random over-sampling (ROS) and Synthetic Minority Over-sampling TEchnique (SMOTE), and 2) to compare the efficiency of classifiers between logistic regression and decision tree in solving the imbalance data of Type I Pharmacies Entrepreneur. Performance measures for this this comparison are accuracy, true positive rate, true negative rate, and measure. The results show that over-sampling by SMOTE has high performance on classifying the data from minority class for all classification techniques.
Description:	การประชุมเสนอผลงานวิจัยระดับชาติ มหาวิทยาลัยสุโขทัยธรรมาธิราช ครั้งที่ 9 (The 9th STOU National Research Conference) วันที่ 29 พฤศจิกายน 2562 ณ อาคารเฉลิมพระเกียรติ 80 พรรษา 5 ธันวาคม 2550 : หน้า 1577-1586. สามารถเข้าถึงบทความฉบับเต็ม (Full text) ได้ที่ : https://www.stou.ac.th/thai/grad_stdy/Masters/%E0%B8%9D%E0%B8%AA%E0%B8%AA/research/9th/FullPaper/SS/Poster/P-SS%20012%20นางสาวนพมาศ%20อัครจันทโชติ.pdf
URI:	https://has.hcu.ac.th/jspui/handle/123456789/2995
Appears in Collections:	Science and Technology - Proceeding Document

Files in This Item:

File	Description	Size	Format
Comparison-of-Imbalanced-Data-Problem-Solving-for-Income-Classification.pdf		100.24 kB	Adobe PDF	View/Open

Show full item record

HAS(Huachiew Archiving System) Welcome to our digital repository of Huachiew Chalermprakiet University

HAS(Huachiew Archiving System)
Welcome to our digital repository of Huachiew Chalermprakiet University