Presentation is loading. Please wait.

Presentation is loading. Please wait.

Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7.

Similar presentations


Presentation on theme: "Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7."— Presentation transcript:

1 Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7

2 Characterized proteins Hypothetical proteins

3

4

5

6

7

8 Pfam http://www.sanger.ac.uk/Software/Pfam/ Pfam is a database of multiple alignments of protein domains or conserved protein regions.

9

10

11

12

13

14 ls mode: a hit is reported if it globally aligns to the seed fs mode: a hit is reported if it locally aligns to the seed

15 Description Structure info Gene Ontology Links

16

17 What kind of domains can we find in Pfam? Trusted Domains Repeats and Motifs Fragment Domains Nested Domains Disulfide bonds Important residues (e.g active sites) Trans membrane domains

18 What kind of domains can we find in Pfam? Low complexity regions Coiled Coils: (two or three alpha helices that wind around each other) Pfam-B Context domains: are those that despite not scoring above the family threshold are expected to be real based on the other domains found in the protein Signal peptides: (indicate a protein that will be secreted)

19 http://www.expasy.org/tools/scanprosite ProSite is a database of protein domains and motifs that can be searched by either regular expression patterns or sequence profiles.http://www.expasy.org/tools/scanprosite

20

21 Search Results Domains architecture

22

23 http://www.expasy.org/tools/pratt/ PRATT Make a pattern from FASTA format sequences

24 PRATT

25

26 Greed, Overlap and Include Search A-x(1,3)-A on ABACADAEAFA

27

28 Gene Ontology (GO) It is a database of biological processes, molecular functions and cellular components. GO does not contain sequence information nor gene or protein description. GO is linked to gene and protein databases. The GO database is structured as a tree

29 Three principal branches http://www.genedb.org/amigo/perl/go.cgi

30 GO structure is a Directed Acyclic Graph

31 Important: note what is the source of the GO entry

32 GO sources ISSInferred from Sequence/Structural Similarity IDAInferred from Direct Assay IPIInferred from Physical Interaction TASTraceable Author Statement NASNon-traceable Author Statement IMPInferred from Mutant Phenotype IGIInferred from Genetic Interaction IEPInferred from Expression Pattern ICInferred by Curator NDNo Data available IEAInferred from electronic annotation


Download ppt "Today’s menu: -SwissProt/TrEMBL -PROSITE -Pfam -Gene Onltology Protein and Function Databases Tutorial 7."

Similar presentations


Ads by Google