Skip to content

Commit

Permalink
More docs on FASTA and chain IDs
Browse files Browse the repository at this point in the history
  • Loading branch information
benmwebb committed Sep 10, 2024
1 parent 3648ed8 commit 8172b86
Showing 1 changed file with 7 additions and 2 deletions.
9 changes: 7 additions & 2 deletions pyext/src/topology/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -1368,11 +1368,16 @@ class TopologyReader:
from the file is assumed to be a protein sequence. If it should instead
be treated as RNA or DNA, add an ',RNA' or ',DNA' suffix. For example,
a `fasta_id` of 'myseq,RNA' will read the sequence 'myseq' from the
FASTA file and treat it as RNA.
FASTA file and treat it as RNA. The FASTA header may contain multiple
fields split by pipe (|) characters. If so, the FASTA sequence name is
the first field and the second field (if present) is the UniProt
accession. For example, ">cop9|Q13098" yields a FASTA sequence name
of "cop9" and UniProt accession of "Q13098".
- `pdb_fn`: Name of PDB or mmCIF file with coordinates (if available).
If left empty, will set up as BEADS (you can also specify "BEADS")
Can also write "IDEAL_HELIX".
- `chain`: Chain ID of this domain in the PDB file.
- `chain`: Chain ID of this domain in the PDB or mmCIF file. This is
the "author-provided" chain ID for mmCIF files, not the asym_id.
- `residue_range`: Comma delimited pair defining range.
Can leave empty or use 'all' for entire sequence from PDB file.
The second item in the pair can be END to select the last residue in the
Expand Down

0 comments on commit 8172b86

Please sign in to comment.