Tertiary analysis of STR variants
Short tandem repeats (STRs) are genomic regions composed of repeated short DNA sequences. When STRs expand beyond the normal range, they can cause mutations known as repeat expansions, which may alter gene expression or function.
Expansion of these sequences in certain genes is the underlying mechanism for a group of inherited conditions called repeat expansion disorders. These disorders primarily affect the nervous and muscular systems, leading to diseases such as Fragile X syndrome, amyotrophic lateral sclerosis and Huntington’s disease.
The platform shows repeat counts per allele and highlights values that fall in normal, intermediate or pathogenic ranges (where known).
STR loci filtering and prioritization
STR variants outside the genomic regions specified in the DRAGEN v4.2 expanded variant catalog are excluded before tertiary analysis. This applies regardless of whether DRAGEN (internal or external) or another pipeline was used for the secondary analysis and irrespective of the DRAGEN version.
Variant calling reliability and availability of meaningful annotations for STR loci can vary. To reduce noise and ensure high-confidence calls, only a curated subset of STR loci is included in AI Shortlist analysis (Table 1). STR loci were curated by Emedgene based on strict criteria to ensure technical accuracy:
Validated genotype–phenotype associations and established pathogenicity. Loci must have well-documented links to disease and specific phenotypes. This avoids reporting variants with unclear implications.
Reliable calling performance. STR calling is technically challenging. Some loci are prone to false positives or inaccurate sizing. The subset includes loci that DRAGEN-STR can call consistently and accurately. This avoids reporting low-quality variants.
Expansion thresholds. Loci with defined pathogenic thresholds (e.g., number of repeats linked to disease) are prioritized. This avoids reporting variants that lack clear interpretation guidelines.
STRs that are not tagged by AI Shortlist are still included in the analysis. This means they will not be automatically prioritized or tagged as Most Likely or Candidate by the AI. However, these loci remain available for manual review and tagging.
Table 1. STR loci considered by AI Shortlist
AR
X:67545316-67545385
Spinal and bulbar muscular atrophy (SBMA)
XR
GCA
ATN1
12:6936716-6936773
Dentatorubral-pallidoluysian atrophy (DRPLA)
AD
CAG
ATN1
12:6923960-6923977
TG
ATN1
12:6925117-6925140
GT
ATN1
12:6930800-6930827
AATA
ATXN1
6:16327633-16327723
Spinocerebellar ataxia 1 (SCA1)
AD
TGC
ATXN10
22:45795354-45795424
Spinocerebellar ataxia 10 (SCA10)
AD
ATTCT
ATXN2
12:111598949-111599018
Spinocerebellar ataxia 2 (SCA2)
AD
GCT
ATXN2
12:111461472-111461481
AAAAT
ATXN2
12:111464247-111464282
GT
ATXN2
12:111483194-111483237
AC
ATXN2
12:111487453-111487472
TTTA
ATXN2
12:111568991-111569015
TTTTG
ATXN2
12:111577256-111577279
TTAT
ATXN2
12:111588188-111588219
AAAT
ATXN2
12:111591364-111591375
TAAAAA
ATXN2
12:111591756-111591773
TTG
ATXN2
12:111596205-111596244
AC
ATXN2
12:111596373-111596388
CA
ATXN2
12:111600935-111600958
AAAT
ATXN2
12:111604087-111604106
AT
ATXN3
14:92071009-92071041
Spinocerebellar ataxia 3 (SCA3)
AD
GCT
ATXN3
14:92059829-92059848
AAAG
ATXN3
14:92065889-92065900
AAT
ATXN3
14:92078361-92078376
TTAT
ATXN7
3:63912684-63912714
Spinocerebellar ataxia 7 (SCA7)
AD
GCA
ATXN7
3:63912714-63912725
GCC
ATXN8OS
13:70139383-70139428
CTG
ATXN8OS
13:70139353-70139382
Spinocerebellar ataxia 8 (SCA8)
AD
CTA
ATXN8OS
13:70115027-70115046
GT
ATXN8OS
13:70126607-70126620
TA
ATXN8OS
13:70129375-70129402
TG
C9ORF72
9:27573528-27573546
Frontotemporal dementia and/or amyotrophic lateral sclerosis 1 (FTDALS1)
AD
GGCCCC
CACNA1A
19:13207858-13207897
Spinocerebellar ataxia 6 (SCA6)
AD
CTG
CNBP
3:129172576-129172656
CAGA
DMPK
19:45770204-45770264
Myotonic dystrophy 1 (DM1)
AD
CAG
FMR1
X:147912050-147912110
Fragile X syndrome (FXS)
XD
CGG
FXN
9:69037286-69037304
GAA
FXN
9:69037261-69037285
Friedreich ataxia (FRDA)
AR
A
HTT
4:3074876-3074933
Huntington disease (HD)
AD
CAG
HTT
4:3074939-3074965
CCG
JPH3
16:87604287-87604329
Huntington disease-like 2 (HDL2)
AD
CTG
NOP56
20:2652733-2652757
Spinocerebellar ataxia 36 (SCA36)
AD
GGCCTG
NOP56
20:2652757-2652774
CGCCTG
PPP2R2B
5:146878727-146878757
Spinocerebellar ataxia 12 (SCA12)
AD
GCT
TBP
6:170561906-170562017
Spinocerebellar ataxia 17 (SCA17)
AD
GCA
Last updated
Was this helpful?
