This documentation explains the FASTA format used for defining nucleotide sequences, such as promoters and terminators, with metadata parsing for better integration into tools.
>NM_004562 PRKN | Homo sapiens chromosome 6, GRCh38.p14 Primary Assembly NC_000006.12 | Strand: minus | Promoter | TSS (on chromosome): 162727766 | TSS (on sequence): 2000
TATGAATACAGGTTTAGGAAAAAACAGAAAAGAACCCCAACCAGTAAAAAAAAAATTAAAGTATAACATTAAAAAACATCAAAATTGTAAATATTGTGTAGAAGAAAAACTAAATGATTAACCTGAATGG...
>): Contains metadata about the sequence. Key fields include:
NM_004562 PRKNHomo sapiens chromosome 6GRCh38.p14 Primary Assembly NC_000006.12Strand: minusPromoter or TerminatorTSS (on chromosome): 162727766TSS (on sequence): 2000A, T, G, C).You can upload FASTA files containing regulatory region sequences. Headers must include metadata for proper processing:
Promoter or Terminator.plus or minus.