A BED-like tab-separated value (TSV) file with no header row and with 4 or 5 columns:
accession
: each sequence accession as it appears in Custom Reference FASTA heaer
start
: start position (always set to 0)
end
: end position (sequence length)
genome
: full name of the virus the sequence belongs to (e.g. Influenza A H1N1)
(optional) segment
: how this sequence is labeled within the virus (e.g. Segment 4 (HA)). Set to 'Full' if the sequence is the full genome
This file affects how sequences are labeled in the output.
Sequence names must match those in Custom Reference FASTA. The same set of sequences must appear in both.
If there are multiple viruses, their names should be unique. For example, if there are multiple Influenza genomes, they should not be labeled with the same virus name in the 4th column.
If the Custom Reference FASTA includes sequences from multiple segments, it is strongly recommended to provide this BED file. Otherwise, each segment will be treated independently and not all of them may be used as reference.