Applications of nature-inspired algorithms for dimension reduction: Enabling efficient data analytics Book Chapter

Mohammadi, FG, Amini, MH, Arabnia, HR. (2020). Applications of nature-inspired algorithms for dimension reduction: Enabling efficient data analytics . 1123 67-84. 10.1007/978-3-030-34094-0_4

cited authors

  • Mohammadi, FG; Amini, MH; Arabnia, HR

abstract

  • In Mohammadi et al. (Evolutionary computation, optimization and learning algorithms for data science. arXiv preprint, arXiv: 1908.08006, 2019), we have explored the theoretical aspects of feature selection and evolutionary algorithms. In this chapter, we focus on optimization algorithms for enhancing data analytic process, i.e., we propose to explore applications of nature-inspired algorithms in data science. Feature selection optimization is a hybrid approach leveraging feature selection techniques and evolutionary algorithms process to optimize the selected features. Prior works solve this problem iteratively to converge to an optimal feature subset. Feature selection optimization is a non-specific domain approach. Data scientists mainly attempt to find an advanced way to analyse data n with high computational efficiency and low time complexity, leading to efficient data analytics. Thus, by increasing generated/measured/sensed data from various sources, analysis, manipulation and illustration of data grow exponentially. Due to the large scale datasets, curse of dimensionality (CoD) is one of the NP-hard problems in data science. Hence, several efforts have been focused on leveraging evolutionary algorithms (EAs) to address the complex issues in large scale data analytics problems. Dimension reduction, together with EAs, lends itself to solve CoD and solve complex problems, in terms of time complexity, efficiently. In this chapter, we first provide a brief overview of previous studies that focused on solving CoD using feature extraction optimization process. We then discuss practical examples of research studies that successfully tackled some application domains, such as image processing, sentiment analysis, network traffics/anomalies analysis, credit score analysis and other benchmark functions/datasets analysis.

publication date

  • January 1, 2020

Digital Object Identifier (DOI)

start page

  • 67

end page

  • 84

volume

  • 1123