Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

You can search for proteins by the following names:

  • Gene Locus name: For sequenced genes, the locus name corresponds to the orf name determined by AGI orf naming convention. For genetic loci (e.g., genes identified by mutation but not yet associated with a sequence) the name corresponds to the accepted symbolic name.AGI orf names have the format AT(1-5 or C,M)gXXXXXX.Where , where the value in parenthesis here corresponds to the chromosome number or organelle genome.
  • Product Name: The name of the protein (e.g. Agamous protein or Dihydroflavanol 4-reductase).
  • Gene Description: Brief summaries of the gene which encodes the protein of interest.This option is useful for including aspects of protein function or localization that are not indicated in the gene or product nameGene Symbol: This is the symbolic name of the locus.
  • GenPeptID: Use this if you know the unique GenBank identifier for the protein.

...

This feature allows you to restrict your search to include only those proteins belonging to the specified structural class Image Removed. You may select multiple options within each parameter by clicking on one selection and then clicking on additional ones while holding down either the CTRL key (PCs) or the Apple key (Mac).

Structural class assignment was performed from annotations of SCOP's superfamilies using HMM models against TIGR's 4.0 Release by Drs JulianGough . Julian Gough and Martin Madera at SCOP database. More information can be found in the following papers.:

Gough, J., Hughey ,R., Karplus ,K., and Chothia ,C.(2001). Assignment ofgenome of genome sequences using a library of hidden Markov models that represent all proteins of known structure. J Mol Biol. 2001 Nov 2;313(4):903-19

...

The predicted molecular weight in kiloDaltons. Molecular weights were calculated using the ATH1.pep file provided by TIGR including all predicted, experimentally verified and hypothetical proteins using the Bioperl function get_mol_wt (found in the SeqStats object at www.bioperl.org).

Calculated PI

The predicted isoelectric point.The isoelectric point is the point at which, on an isoelectric focusing gel the pH at which a protein has a net charge of zero. The pI of a protein is determined by its amino acid composition-and the net contribution of positive and negative charges of the side chains. The calculated pIs were determined using the ATH1.pep file from TIGR and the iep program from EMBOSS www.hgmp.mrc.ac.uk/Software/EMBOSS/. Values are between pH 1-14.

...

Use this option if you want to restrict your search to include only proteins having a specified domain composition. The drop down menus allow you to select the number of occurances occurrences of each domain along with the syntax for identifying the domain. For example, to search for proteins that have one or more occurances occurrences of domain PS00027 IPR045176 do the following:

  • Select the greater than symbol in the first column.
  • Enter one zero '0' in the adjacent input box.
  • Select Prosite INTERPRO from the domain type drop down menu
  • Choose exact match
  • Enter the name of the domain (PS00027IPR045176)

If searching for more than one domain, the search is treated as a logical AND. Therefore, inputing PROSITE entering INTERPRO domain PS00027 IPR045176 AND PFAM ALPHAFOLD domain PF04618 AFA0A1I9LRD1F1 will limit your search to proteins having BOTH domains.

Protein domains are conserved regions of amino acid /structural similarity in protein sequences. Domains generally represent functional units having some form of biological activity. Domains are useful in grouping proteins with little overall sequence similarity. This search allows you to specify both the type and number of domains and use either the domain name , or unique domain identifier. Leaving this option blank will include all proteins in your search. The domains were identified using the interproscan.pl program and INTERPRO databases supplied with the programare the source of the data for this information. For more information see: INTERPRO database.