Presentation is loading. Please wait.

Presentation is loading. Please wait.

TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University.

Similar presentations


Presentation on theme: "TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University."— Presentation transcript:

1 TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University

2 Main Stages of TEXTAL electron density map CAPRA C-alpha chains LOOKUP model (initial coordinates) model (final coordinates) Post-processing routines Reciprocal-space refinement/ML DM Human Crystallographer (editing) build-in side-chain and main-chain atoms locally around each CA example: real-space refinement

3 F=

4 CAPRA: C-Alpha Pattern Recognition Algorithm

5

6

7

8

9

10

11

12 Overview of CAPRA goal: predict CA chains from density map not just “tracing” - more than Bones desire 1:1 correspondence, ~3.8A apart based on principles of pattern recognition –use neural net to estimate which pseudo-atoms in trace “look” closest to true C-alphas –use feature extraction to capture 3D patterns in density for input to neural net –use other heuristics for “linking” together into chains, including geometric analysis (s.s.)

13 CAPRA: C-Alpha Pattern-Recognition Algorithm Tracer - remove lattice points from map (lowest density first) without breaking connectivity Neural nework - for each pseudo atom, extract features, input to network, predict distances to CAs (1:10 in trace), trained on example points in real maps Linking - desire long chains, good CA predictions (not in side-chains), “structurally plausible” (e.g. linear, helical) Density Trace Neural Network Linking into C-alpha chains pseudo atoms predictions of distance to true CA map C-alpha coordinates

14 Steps in CAPRA

15 Examples of CAPRA Steps

16 Tracer + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

17 Neural Network

18 Feature Extraction characterize 3D patterns in local density must be “rotation invariant” examples: –average density in region –standard deviation, kurtosis... –distance to center of mass –moments of inertia, ratios of moments –“spoke angles” calculated over spheres of 3A and 4A radius

19 Forward Propagation: Backward Propagation:

20 Selection of Candidate C-alpha’s method: –pick candidates in order of lowest predicted distance first, –among all pseudo-atoms in trace, –as long as not closer than 2.5A notes: –no 3.8A constraint; distance can be as high as 5A –don’t rely on branch points (though often near) –picked in random order throughout map –initially covers whole map, including side-chains and disconnected regions (e.g. noise in solvent)

21 Linking into Chains initial connectivity of CA candidates based on the trace “over-connected” graph - branches, cycles... start by computing connected components (islands, or clusters) two strategies: –for small clusters (<=20 candidates), find longest internal chain with “good” atoms –for large clusters (>20 candidates), incrementally clip branch points using heuristics

22 Extracting Chains from Small Clusters exhaustive depth-first search of all paths scoring function: –length –penalty for inclusion of points with high predicted distance to true CA by neural net –preference for following secondary structure (locally straight or helical)

23 Secondary Structure Analysis generate all 7-mers (connected fragments of candidate CAs of length 7) evaluate “straightness” –ratio of sum of link lengths to end-to-end distance –straightness>0.8 ==> potential beta-strand evaluate “helicity” –average absolute deviation of angles and torsions along 7-mer from ideal values (95º and 50º) –helicity potential alpha-helix

24 Handling Large Clusters start by breaking cycles (near “bad” atoms) clip links at branch points till only linear chains remain clip the most “obvious” links first, e.g. –if other two links are part of sec. struct. –if clipped branch has “bad” atom nearby –if clipped branch is small and other 2 are large ?? ?

25 Example of CA-chains for CzrA fit by CAPRA

26 Results for MVK

27 Results

28 Availability Textal web site: –http://textal.tamu.edu:12321 –server-side processing –free access to Capra –beta-testing of Textal To contact us, email: textal@tamu.edu

29 Acknowledgements Funding –National Institutes of Health –Welch Foundation People –Dr. James C. Sacchettini –The rest of the TEXTAL Group: Tod Romo Kreshna Gopal Reetal Pai


Download ppt "TEXTAL: A System for Automated Model Building Based on Pattern Recognition Thomas R. Ioerger Department of Computer Science Texas A&M University."

Similar presentations


Ads by Google