Purpose: To prepare HFV sequence sets, together with related data, for submission to GenBank.
Before you use this tool:
Please verify that the sequences to be submitted are correct.
This is a beta version! Please use with care.
You will be prompted to enter annotation information that GenBank does not have a place for.
These data can be entered directly from a comma delimited (.csv) file.
To save your data in Excel to comma delimited format, go to File> Save As, select CSV (Comma delimited)(*.csv).
Each row in the comma delimited (.csv) file should correspond to a sequence in the Fasta file. The first column should contain the names of the sequences exactly as they appear in the Fasta file. Any differences in sequence names will lead to errors. The order of sequences need not match the Fasta file. Annotation data are associated with sequences by matching sequence names, rather than the order in the files.
Columns may contain sequence annotation data, such as the
sample date, sample country, etc. For details about supported annotation and the requisite format, see the
Annotation fields. This information will be stored in human-readable form in the comment field of the GenBank entry, making it available to researchers
worldwide. Once the GenBank record is released, this information will be automatically loaded into the Los Alamos Sequence Database,
allowing the data to be searchable from our
See an example of Fasta sequences and their CSV annotation data.
Annotation fields help