Introduction to Clock Tree Synthesis

Slides:



Advertisements
Similar presentations
Topics Electrical properties of static combinational gates:
Advertisements

Transmission Gate Based Circuits
Advanced Interconnect Optimizations. Buffers Improve Slack RAT = 300 Delay = 350 Slack = -50 RAT = 700 Delay = 600 Slack = 100 RAT = 300 Delay = 250 Slack.
4/22/ Clock Network Synthesis Prof. Shiyan Hu Office: EREC 731.
Introduction to CMOS VLSI Design Sequential Circuits
L 18 : Circuit Level Design 성균관대학교 조 준 동 교수
© Digital Integrated Circuits 2nd Inverter EE4271 VLSI Design The Inverter Dr. Shiyan Hu Office: EERC 518 Adapted and modified from Digital.
Minimizing Clock Skew in FPGAs
1 Lecture 28 Timing Analysis. 2 Overview °Circuits do not respond instantaneously to input changes °Predictable delay in transferring inputs to outputs.
Introduction to CMOS VLSI Design Lecture 19: Design for Skew David Harris Harvey Mudd College Spring 2004.
A Look at Chapter 4: Circuit Characterization and Performance Estimation Knowing the source of delays in CMOS gates and being able to estimate them efficiently.
Introduction to CMOS VLSI Design Clock Skew-tolerant circuits.
EE141 © Digital Integrated Circuits 2nd Timing Issues 1 Digital Integrated Circuits A Design Perspective Timing Issues Jan M. Rabaey Anantha Chandrakasan.
Synchronous Digital Design Methodology and Guidelines
CSE477 L19 Timing Issues; Datapaths.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 19: Timing Issues; Introduction to Datapath.
Clock Design Adopted from David Harris of Harvey Mudd College.
RTL Hardware Design by P. Chu Chapter 161 Clock and Synchronization.
Chapter 11 Timing Issues in Digital Systems Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August 20, 2004; Revised - July.
ECEN 248: INTRODUCTION TO DIGITAL SYSTEMS DESIGN Dr. Shi Dept. of Electrical and Computer Engineering.
Design of Variable Input Delay Gates for Low Dynamic Power Circuits
ENGIN112 L28: Timing Analysis November 7, 2003 ENGIN 112 Intro to Electrical and Computer Engineering Lecture 28 Timing Analysis.
EE42/100, Spring 2006Week 14a, Prof. White1 Week 14a Propagation delay of logic gates CMOS (complementary MOS) logic gates Pull-down and pull-up The basic.
S. Reda EN160 SP’08 Design and Implementation of VLSI Systems (EN1600) Lecture 14: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 8: Clock Distribution, PLL & DLL
© Digital Integrated Circuits 2nd Inverter CMOS Inverter: Digital Workhorse  Best Figures of Merit in CMOS Family  Noise Immunity  Performance  Power/Buffer.
Interconnect Optimizations
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 22: Material Review Prof. Sherief Reda Division of Engineering, Brown University.
S. Reda EN160 SP’07 Design and Implementation of VLSI Systems (EN0160) Lecture 13: Power Dissipation Prof. Sherief Reda Division of Engineering, Brown.
Lecture 5 – Power Prof. Luke Theogarajan
Lecture 7: Power.
CS 151 Digital Systems Design Lecture 28 Timing Analysis.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
9/20/2004EE 42 fall 2004 lecture 91 Lecture #9 Example problems with capacitors Next we will start exploring semiconductor materials (chapter 2). Reading:
Power, Energy and Delay Static CMOS is an attractive design style because of its good noise margins, ideal voltage transfer characteristics, full logic.
M.S.P.V.L. Polytechnic College, Pavoorchatram
The CMOS Inverter Slides adapted from:
On-Chip Communication Architectures
1 Delay Estimation Most digital designs have multiple data paths some of which are not critical. The critical path is defined as the path the offers the.
1. Department of Electronics Engineering Sahand University of Technology NMOS inverter with an n-channel enhancement-mode mosfet with the gate connected.
A comparison between different logic synthesis techniques from the digital switching noise viewpoint G. Boselli, V. Ciriani, V. Liberali G. Trucco Dept.
Logic Synthesis For Low Power CMOS Digital Design.
ASIC Design Flow – An Overview Ing. Pullini Antonio
1 EE 587 SoC Design & Test Partha Pande School of EECS Washington State University
Chapter 07 Electronic Analysis of CMOS Logic Gates
EE4800 CMOS Digital IC Design & Analysis
1 Interconnect/Via. 2 Delay of Devices and Interconnect.
Basics of Energy & Power Dissipation
Bi-CMOS Prakash B.
FPGA-Based System Design: Chapter 2 Copyright  2004 Prentice Hall PTR Topics n Logic gate delay. n Logic gate power consumption. n Driving large loads.
Interconnect/Via.
March 28, Glitch Reduction for Altera Stratix II devices Tomasz S. Czajkowski PhD Candidate University of Toronto Supervisor: Professor Stephen D.
Solid-State Devices & Circuits
FPGA-Based System Design: Chapter 6 Copyright  2004 Prentice Hall PTR Topics n Low power design. n Pipelining.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Clock Distribution Network
Z. Feng MTU EE4800 CMOS Digital IC Design & Analysis 6.1 EE4800 CMOS Digital IC Design & Analysis Lecture 6 Power Zhuo Feng.
Static Timing Analysis - II Sushant SinghSushant Singh.
Circuit Delay Performance Estimation Most digital designs have multiple signal paths and the slowest one of these paths is called the critical path Timing.
ELEC Digital Logic Circuits Fall 2015 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003 Rev /05/2003.
EE141 Timing Issues 1 Chapter 10 Timing Issues Rev /11/2003.
ELEC Digital Logic Circuits Fall 2014 Delay and Power Vishwani D. Agrawal James J. Danaher Professor Department of Electrical and Computer Engineering.
Chapter 9 CAPACITOR.
COE 360 Principles of VLSI Design Delay. 2 Definitions.
High Speed Properties of Digital Gates, Copyright F. Canavero, R. Fantino Licensed to HDT - High Design Technology
A lecture for Arduino Course, Winter 2017/18
Reading: Hambley Ch. 7; Rabaey et al. Sec. 5.2
Limitations of STA, Slew of a waveform, Skew between Signals
Timing Analysis 11/21/2018.
Chapter 10 Timing Issues Rev /11/2003 Rev /28/2003
Presentation transcript:

Introduction to Clock Tree Synthesis

Clock Jargon : Important Terms Clock to an SOC (System on a Chip) is like blood to a human body. Just the way blood flows to each and every part of the human body and regulates metabolism, clock reached each and every sequential device and controls the digital events inside an SOC. Clock Latency: Delay the clock signal takes between any two points. It can be from the source (PLL) to the source pin (Clock pin) of registers. Source Insertion Delay: This is the clock delay from the clock origin point, which could be PLL to the clock definition point. Network Insertion Delay: This refers to the clock delay from the clock definition point to the sink pink of the registers.

Representation of Clock based delays in clock path

Clock Skew:

The Clock Design Problem:

Uneven spread of flops on a chip:

Clock Tree:

Clock Trees:

What is Clock Tree Synthesis (CTS)? It is a process to ensure that clock gets distributed evenly to all sequential elements in a design. CTS is the design step to form clock tree distribution. Goal of the CTS is to minimize clock latency and skew. It is the step where the actual clock tree is synthesized. Before CTS timing tools use ideal clock arrival times. After CTS real clock distribution is available, therefore real clock arrival times are used.

CTS Quality Check- I:

False Data Capture (late clock/clock skew)

CTS Quality Check – II (Maintain PW of 1ns)

CTS Quality Check -III

CTS Quality Check - IV

CTS Quality Check - V

CTS Quality Check –VI

Skew generated due to close proximity of nets

Crosstalk might lead to a functional failure:

CTS Check List: SKEW PULSE WIDTH DUTY CYCLE LATENCY CLOCK TREE POWER SIGNAL INTEGRITY AND CROSS-TALK

H-Tree Algorithm Goal : The clock reaches each and every flop end point and fulfills all the quality check parameters as discussed. It works on divide and conquer approach. Divide the chip into equal parts.

Implemented H-Tree Quality Parameter Check : Skew

Modeling the RC Network Delay on a chip:

Degradation of Pulse Width due to RC Network

Pulse Width degradation kills performance: Duty Cycle Check is a more comprehensive Pulse Width check.

Implemented H-Tree Parameter check: Latency The clock input and output transition waveforms are way off. We need to optimize the RC network delays by reducing the RC stages to improve latency.

Implemented H-Tree Parameter Check : Power Clock Tree Power dissipation is a function of latency and the output transition. Clock signal continuously running from start to end without any breaks or help (Analogy: Athlete’s race). Energy consumed is humongous.

The bigger picture of the problem:

Clock Tree Modeling: Anything having a physical dimension has a finite resistance and capacitance. Clock end points are connected to flip flops which are load Capacitance(CL). This whole wire model can be represented as a lumped RC model. The clock port has a resistance Rd or the driver resistance.

The simplified RC Model:

Dependence of Time Constant on Length:

Clock Tree Buffering: Initially the entire Load Capacitance of wire was on the flop, leading to poor output transitions.

Quantifying the no. of buffers added

Buffer implementation despite few drawbacks: Buffer adds to the area cost, imagine area consumed if few buffers are added on each of the million clock paths on a chip. Load on the buffer is twice than the max capacity of decent operation. How about building a buffer tree at node A? (Analogy: Man pushing 4 blocks alone)

Critical conditions to be satisfied for a satisfactory clock tree built:

Validating the design conditions to be met:

Validating the Quality check parameters (Skew):

Quality Check parameter for H-Tree (PW)

Why Skew test passed and not the PW check? The level of buffering and load driven by each buffer being the same, results in a successful minimal (~0) skew. The output transition is better with the H- Tree as compared to previous results but the rise transition is greater than the fall transition at the output. This results in Pulse Width increase (Time period) and decrease of frequency. Further analysis of the buffer is required to get to our solution. The buffer used in the data path is not the same as the ones used in the clock path. Choice of clock buffers is a tradeoff between the performance and symmetry.

Examining unequal rise and fall times in PW check:

More resistive PMOS allows less current to pass:

Manipulating the size of the PMOS to achieve equal rise, fall times but leakage power(Clock Buffer):

Analyzing quality checks with clock buffers:

Power check as a function of latency:

Dynamic Power Issue: At every switch you consume a finite amount of power. Charging, discharging of capacitance can be related to filling and emptying water buckets. At any time instant all buffers are consuming power (this is small design).

Short Circuit Power: At an intermediate value of input both PMOS and NMOS are on, leading to a common path to ground and short circuit current flows through it.

Leakage Power: When Vgs > Vt, current flows from Source to Drain and Transistor is ON. Even when Vgs < Vt, small current flows called leakage current (steady state).