...
The PLAIN data warehouse is a series of data marts, each focusing on a single central concept and the related information relevant to that concept. Each mart gives the client user the ability to query information about the main concept in a way that optimizes both the amount of effort spent in constructing a query and the processing time required to retrieve the results.
- Shared Resources: This subsystem contains the basic resources shared across all data marts, such as taxon and species variant.
- Locus Detail: This subsystem centers on the concept of locus, a specific location on a chromosome. This data mart contains information on gene models, gene structure, functional annotations, polymorphisms, germplasms, and meta data relevant to all available loci.
- Genomic Region:This subsystem centers on the concepts of a reference genome, a collection of sequences (often chromosomes) taken as the standard for a given organism and genome assembly, and a region in such a genome, a sequence feature with an extent greater than zero. A nucleotide region is composed of bases and a polypeptide region is composed of amino acids. This data mart contains information on genes, transcripts, chromosomes, and contigs, with additional information about polypeptides, CDNAs, and ESTs.
- Protein: This subsystem centers on the concept of a protein, an amino acid possibly related to a transcript. It contains information about the related transcripts, protein domains, and also resource links to additional information about the protein.
...