This documentation explains the FASTA format used for defining nucleotide sequences, such as promoters and terminators, with metadata parsing for better integration into tools.
>NM_004562 PRKN | Homo sapiens chromosome 6, GRCh38.p14 Primary Assembly NC_000006.12 | Strand: minus | Promoter | TSS (on chromosome): 162727766 | TSS (on sequence): 2000
TATGAATACAGGTTTAGGAAAAAACAGAAAAGAACCCCAACCAGTAAAAAAAAAATTAAAGTATAACATTAAAAAACATCAAAATTGTAAATATTGTGTAGAAGAAAAACTAAATGATTAACCTGAATGG...
>
): Contains metadata about the sequence. Key fields include:
NM_004562 PRKN
Homo sapiens chromosome 6
GRCh38.p14 Primary Assembly NC_000006.12
Strand: minus
Promoter
or Terminator
TSS (on chromosome): 162727766
TSS (on sequence): 2000
A
, T
, G
, C
).You can upload FASTA files containing regulatory region sequences. Headers must include metadata for proper processing:
Promoter
or Terminator
.plus
or minus
.