TY - JOUR
T1 - Feature selection methods and sampling techniques to financial distress prediction for Vietnamese listed companies
AU - Vu, Loan Thi
AU - Vu, Lien Thi
AU - Nguyen, Nga Thu
AU - Do, Phuong Thi Thuy
AU - Dao, Daniel
N1 - © Loan Thi Vu, Lien Thi Vu, Nga Thu Nguyen, Phuong Thi Thuy Do, Dong Phuong Dao, 2019
Loan Thi Vu, Lien Thi Vu, Nga Thu Nguyen, Phuong Thi Thuy Do and Dong Phuong Dao (2019). Feature selection methods and sampling techniques to financial distress prediction for Vietnamese listed companies. Investment Management and Financial Innovations, 16(1), 276-290. doi:10.21511/imfi.16(1).2019.22
PY - 2019/3/25
Y1 - 2019/3/25
N2 - The research is taken to integrate the effects of variable selection approaches, as well as sampling techniques, to the performance of a model to predict the financial distress for companies whose stocks are traded on securities exchanges of Vietnam. A firm is financially distressed when its stocks are delisted as requirement from Vietnam Stock Exchange because of making a loss in 3 consecutive years or having accumulated a loss greater than the company’s equity. There are 12 models, constructed differently in feature selection methods, sampling techniques, and classifiers. The feature selection methods are factor analysis and F-score selection, while 3 sets of data samples are chosen by choice-based method with different percentages of financially distressed firms. In terms of classifying technique, logistic regression together with SVM are used in these models. Data are collected from listed firms in Vietnam from 2009 to 2017 for 1, 2 and 3 years before the announcement of their delisting requirement. The experiment’s results highlight the outperformance of the SVM model with F-score selection method in a data sample containing the highest percentage of non-financially distressed firms.
AB - The research is taken to integrate the effects of variable selection approaches, as well as sampling techniques, to the performance of a model to predict the financial distress for companies whose stocks are traded on securities exchanges of Vietnam. A firm is financially distressed when its stocks are delisted as requirement from Vietnam Stock Exchange because of making a loss in 3 consecutive years or having accumulated a loss greater than the company’s equity. There are 12 models, constructed differently in feature selection methods, sampling techniques, and classifiers. The feature selection methods are factor analysis and F-score selection, while 3 sets of data samples are chosen by choice-based method with different percentages of financially distressed firms. In terms of classifying technique, logistic regression together with SVM are used in these models. Data are collected from listed firms in Vietnam from 2009 to 2017 for 1, 2 and 3 years before the announcement of their delisting requirement. The experiment’s results highlight the outperformance of the SVM model with F-score selection method in a data sample containing the highest percentage of non-financially distressed firms.
KW - financial distress prediction
KW - feature selection
KW - sampling technique
KW - logistic regression model
KW - support vector machine (SVM)
KW - variable selection approaches
KW - financially distressed firms
U2 - 10.21511/imfi.16(1).2019.22
DO - 10.21511/imfi.16(1).2019.22
M3 - Article
VL - 16
SP - 276
EP - 290
JO - Investment Management and Financial Innovations
JF - Investment Management and Financial Innovations
IS - 1
ER -