DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine)

Damon P. Little¹

¹Lewis B. and Dorothy Cullman Program for Molecular Systematics, The New York Botanical Garden, Bronx, NY, USA

description

This set of scripts is designed to transform a set of FASTA formated sequences into a queriable DNA barcoding reference database. These scripts were first used by Little and Stevenson (2007).

script usage

(1) Create a MySQL database: “mysql -u root -p barcode < db-tables.sql”.

(2) Create a table of motifs: “patterns.pl size motifs”.
8-10 bp motifs are recommended.

(3) Import fasta formatted sequences: “fst2mysql.pl sequences.fasta division locus”.
The file “sequences.fasta” is assumed to be DNA sequences in FASTA format (either GenBank FASTA, or FASTA with id_genus_species).

(4) Barcode sequences: “barcode.pl locus”.

(5) Identify sequences: “dome.pl sequence”.

requirements

PERL interpreter
MySQL

citation

Little, D. P. 2007. DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine). Program distributed by the author.

download

DOME ID