Presentation is loading. Please wait.

Presentation is loading. Please wait.

Path Profile Estimation and Superblock Formation Jeff Pang Jimeng Sun.

Similar presentations


Presentation on theme: "Path Profile Estimation and Superblock Formation Jeff Pang Jimeng Sun."— Presentation transcript:

1 Path Profile Estimation and Superblock Formation Jeff Pang Jimeng Sun

2 Motivation Why Continuous Profiling? –Continuous Optimization –Dynamic Optimization –Realistic Profiles OptimizeCompileRun Profile Challenges: –Automated –Low overhead –Accuracy Related Work: H. Chen, et al. Dynamic Trace Selection Using Performance Hardware Sampling. CGO, 2003. A. Shye, et al. Analysis of Path Profiling Information Gathered with Performance Monitoring Hardware. ICCA, 2005.

3 Goals Take advantage of modern Performance Monitoring Units –Like in Pentium 4, Itanium, PPC 970, etc. –Allows sampling of last couple branches –“Simulated” for our project using instrumentation Estimate full path profile using samples Validate by doing Superblock formation –Optimization to improve scheduling on VLIW processors –Path-based Superblocks based on Young (1997) Superblock Formation Run with Simulated PMU Path ProfileSample Path Profile Estimation

4 Design Overview Implemented PMU simulator and Superblock optimization as SUIF passes Implemented Estimator offline using sampled branch profiles and SUIF CFG sourcefrontend instrument (pmu sim) backend instrumented program superblock optimized program Offline estimator estimated path profile sampled profile

5 Path Sampling Exact path profile: –Accurate –But expensive Edge profile –Inaccurate (due to the independence assumption) –Cheap It is hard (impossible) to reconstruct the path information Sampling path profile –Periodically sample 4 consecutive branches (branch trace buffer) –Cheap to collect and more accurate than edge profile A C G B FE D 50 Exact paths: ABDEG ACDFG Edge Profile: ABDEG ACDFG and ABDFG ACDEG Sampling: {AB, DE} {AC, DF} => ABDEG ACDFG

6 Hot Path Formation Sampling paths are short Sampling paths => longer paths –Join 2 paths if they can merge into one simple path and the frequencies about both paths are large –e.g. 5000 ABCD, 4000 CDEF => 4000 ABCDEF

7 Path Estimation Accuracy We compare the top 100 paths captured by the exact path profile and the estimated path profile The success rate is Σ est ∩ act cycle act / Σ act cycle act

8 Superblock Formation Creates larger regions to schedule over for hot paths A B C F DG E A B C F DG E C D E A A A A A A B A A BA B Tail DuplicationLoop UnrollingCombinations

9 Superblock Performance Performance results pending –Waiting for CASH simulator setup… Superblock formation on P4 useless –Causes 0-5% slowdown on tested benchmarks (probably due to icache misses) –Need multi-issue architecture to see sched. benefits?


Download ppt "Path Profile Estimation and Superblock Formation Jeff Pang Jimeng Sun."

Similar presentations


Ads by Google