uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features

Ali, M; Ali, SI; Kim, D; Hur, T; Bang, J; Lee, S; Kang, Byeong; Hussain, M

128254 - An efficient and comprehensive ensemble-based feature selection methodology to select informative features.pdf (4.61 MB)

uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features

journal contribution

posted on 2023-05-19, 21:10 authored by Ali, M, Ali, SI, Kim, D, Hur, T, Bang, J, Lee, S, Byeong KangByeong Kang, Hussain, M

Feature selection is considered to be one of the most critical methods for choosing appropriate features from a larger set of items. This task requires two basic steps: ranking and filtering. Of these, the former necessitates the ranking of all features, while the latter involves filtering out all irrelevant features based on some threshold value. In this regard, several feature selection methods with well-documented capabilities and limitations have already been proposed. Similarly, feature ranking is also nontrivial, as it requires the designation of an optimal cutoff value so as to properly select important features from a list of candidate features. However, the availability of a comprehensive feature ranking and a filtering approach, which alleviates the existing limitations and provides an efficient mechanism for achieving optimal results, is a major problem. Keeping in view these facts, we present an efficient and comprehensive univariate ensemble-based feature selection (uEFS) methodology to select informative features from an input dataset. For the uEFS methodology, we first propose a unified features scoring (UFS) algorithm to generate a final ranked list of features following a comprehensive evaluation of a feature set. For defining cutoff points to remove irrelevant features, we subsequently present a threshold value selection (TVS) algorithm to select a subset of features that are deemed important for the classifier construction. The uEFS methodology is evaluated using standard benchmark datasets. The extensive experimental results show that our proposed uEFS methodology provides competitive accuracy and achieved (1) on average around a 7% increase in f-measure, and (2) on average around a 5% increase in predictive accuracy as compared with state-of-the-art methods.

Funding

Ministry of Trade, Industry and Energy

History

Publication title

PLOS One

Volume

13

Issue

8

Article number

e0202705

Number

e0202705

Pagination

1-28

ISSN

1932-6203

Department/School

School of Information and Communication Technology

Publisher

Public Library of Science

Place of publication

USA

Rights statement

Repository Status

Open

Socio-economic Objectives

Information systems, technologies and services not elsewhere classified

Usage metrics

Keywords

univariate ensemble-based feature selection (uEFS) methodology

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features

Funding

Ministry of Trade, Industry and Energy

History

Publication title

Volume

Issue

Article number

Number

Pagination

ISSN

Department/School

Publisher

Place of publication

Rights statement

Repository Status

Socio-economic Objectives

Usage metrics

Categories

Keywords

Licence

Exports