DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine)
Damon P. Little1
1Lewis B. and Dorothy Cullman Program for Molecular Systematics, The New York Botanical Garden, Bronx, NY, USA
description
This set of scripts is designed to transform a set of FASTA formated sequences into a queriable DNA barcoding reference database. These scripts were first used by Little and Stevenson (2007).
script usage
(1) Create a MySQL database: “mysql -u root -p barcode < db-tables.sql”.
(2) Create a table of motifs: “patterns.pl size motifs”.
8-10 bp motifs are recommended.
(3) Import fasta formatted sequences: “fst2mysql.pl sequences.fasta division locus”.
The file “sequences.fasta” is assumed to be DNA sequences in FASTA format (either GenBank FASTA, or FASTA with id_genus_species).
(4) Barcode sequences: “barcode.pl locus”.
(5) Identify sequences: “dome.pl sequence”.
requirements
PERL interpreter
MySQL
citation
Little, D. P. 2007. DOME ID (Diagnostic Oligo Motifs for Explicit IDentification): a simple SIDE (sequence identification engine). Program distributed by the author.
download
DOME ID