Alignment-free comparison of metagenomics sequences via approximate string matching

Alignment-free comparison of metagenomics sequences via approximate string matching Article

Chen, J, Yang, L, Li, L et al. (2022). Alignment-free comparison of metagenomics sequences via approximate string matching . Bioinformatics Advances, 2(1), 10.1093/bioadv/vbac077

cited authors

Chen, J; Yang, L; Li, L; Goodison, S; Sun, Y

authors

Chen, Ji'An

abstract

Summary: Quantifying pairwise sequence similarities is a key step in metagenomics studies. Alignment-free methods provide a computationally efficient alternative to alignment-based methods for large-scale sequence analysis. Several neural network-based methods have recently been developed for this purpose. However, existing methods do not perform well on sequences of varying lengths and are sensitive to the presence of insertions and deletions. In this article, we describe the development of a new method, referred to as AsMac that addresses the aforementioned issues. We proposed a novel neural network structure for approximate string matching for the extraction of pertinent information from biological sequences and developed an efficient gradient computation algorithm for training the constructed neural network. We performed a large-scale benchmark study using real-world data that demonstrated the effectiveness and potential utility of the proposed method.

FIU Discovery

Alignment-free comparison of metagenomics sequences via approximate string matching Article

Overview

cited authors

authors

abstract

publication date

published in

Identifiers

Digital Object Identifier (DOI)

Additional Document Info

volume

issue