A bioinformatics approach to identify telomere sequences

Biotechniques. 2018 Jul;65(1):20-25. doi: 10.2144/btn-2018-0057.

Abstract

Conventional approaches to identify a telomere motif in a new genome are laborious and time-intensive. An efficient new methodology based on next-generation sequencing (NGS), de novo sequence repeat finder (SERF) and fluorescence in situ hybridization (FISH) is presented. Unlike existing heuristic approaches, SERF utilizes an exhaustive analysis of raw NGS reads or assembled contigs for rapid de novo detection of conserved tandem repeats representing telomere motifs. SERF was validated using the NGS data from Ipheion uniflorum and Allium cepa with known telomere motifs. The analysis program was then used on NGS data to investigate the telomere motifs in several additional plant species and together with FISH proved to be an efficient approach to identify as yet unknown telomere motifs.

Keywords: NGS; SERF; bioinformatics; next-generation sequencing; sequence repeat finder; tandem repeats; telomere.

MeSH terms

  • Allium / genetics*
  • Amaryllidaceae / genetics*
  • Computational Biology*
  • Conserved Sequence / genetics
  • High-Throughput Nucleotide Sequencing
  • In Situ Hybridization, Fluorescence
  • Nucleotide Motifs / genetics
  • Sequence Analysis, DNA
  • Tandem Repeat Sequences / genetics
  • Telomere / genetics*