Go to MPEP - Table of Contents
2423.01 Format and Symbols To Be Used in Sequence Listings - 2400 Biotechnology
2423.01 Format and Symbols To Be Used in Sequence Listings
37 CFR 1.822 sets forth the format and symbols to be used for listing nucleotide and/or amino acid sequence data. The codes for representing the nucleotide and/or amino acid characters in the sequences are set forth in the tables of WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. See MPEP § 2422. No other symbols shall be used in nucleotide and amino acid sequences. The "modified base" and "modified and unusual amino acid" codes appearing in WIPO Standard ST.25 (1998), Appendix 2, Tables 2 and 4 (see 37 CFR 1.822 and MPEP § 2422) are not to be set forth in the sequences recited in the Sequence Listing. However, "modified base" or "modified and unusual amino acid" codes may be used in the written description and/or drawing portions of the specification. To properly enter notations for modified codes in the Sequence Listing, the Feature section of the Sequence Listing should be used. That is, a modified base or amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or amino acid is one of those listed in WIPO Standard ST.25 (1998), Appendix 2, Table 2 or 4 and the modification is also set forth in the Feature section of the Sequence Listing. Otherwise, all bases or amino acids not appearing in WIPO Standard ST.25 (1998), Appendix 2, Table 1 or 3 must be listed in a given sequence as "n" or "Xaa," respectively, with further information given in the Feature section of the "Sequence Listing." See 37 CFR 1.823(b).
In 37 CFR 1.822(b) and 37 CFR 1.822(d), the use of three-letter codes for amino acids is required. The use of the three-letter codes for amino acids is preferred over the one-letter codes from the perspective of facilitating the examiner's review of the application papers, including the "Sequence Listing", and the public's, as well as the examiner's, use of the printed patents. The three-letter codes must be presented using the upper case for the first character and lower case for the remaining two characters.
37 CFR 1.822(c) through (e) set forth the format for presenting sequence data. These paragraphs set forth the manner in which the characters in sequences are to be grouped, spaced, presented and numbered.
Go to MPEP - Table of Contents