CSE477 L11 Fast Logic.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 11: Designing for Speed Mary Jane Irwin ( www.cse.psu.edu/~mji.

Slides:



Advertisements
Similar presentations
COMBINATIONAL LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Advertisements

COMBINATIONAL LOGIC DYNAMICS
Designing Static CMOS Logic Circuits
Topics Electrical properties of static combinational gates:
9/15/05ELEC / Lecture 71 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic Circuits.
Progettazione di circuiti e sistemi VLSI La logica combinatoria
Elettronica T A.A Digital Integrated Circuits © Prentice Hall 2003 Inverter CMOS INVERTER.
Digital Integrated Circuits A Design Perspective
Lecture 9: Combinational Circuit Design. CMOS VLSI DesignCMOS VLSI Design 4th Ed. 10: Combinational Circuits2 Outline  Bubble Pushing  Compound Gates.
A Look at Chapter 4: Circuit Characterization and Performance Estimation Knowing the source of delays in CMOS gates and being able to estimate them efficiently.
Fall 06, Sep 19, 21 ELEC / Lecture 6 1 ELEC / (Fall 2005) Special Topics in Electrical Engineering Low-Power Design of Electronic.
Combinational Circuits
Designing Combinational Logic Circuits: Part2 Alternative Logic Forms:
Digital Integrated Circuits A Design Perspective
Logical Effort.
© Digital Integrated Circuits 2nd Inverter CMOS Inverter: Digital Workhorse  Best Figures of Merit in CMOS Family  Noise Immunity  Performance  Power/Buffer.
Lecture #24 Gates to circuits
Outline Noise Margins Transient Analysis Delay Estimation
Lecture 4 – Logical Effort
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 Logical Effort - sizing for speed.
Copyright Agrawal, 2007 ELEC6270 Fall 07, Lecture 6 1 ELEC 5270/6270 Fall 2007 Low-Power Design of Electronic Circuits Dynamic Power: Device Sizing Vishwani.
Digital Integrated Circuits A Design Perspective
Digital Integrated Circuits A Design Perspective
Digital Integrated Circuits A Design Perspective
CMOS VLSI For Computer Engineering Lecture 4 – Logical Effort Prof. Luke Theogarajan parts adapted form Harris – and Rabaey-
S. RossEECS 40 Spring 2003 Lecture 24 Today we will Review charging of output capacitance (origin of gate delay) Calculate output capacitance Discuss fan-out.
Digital Integrated Circuits© Prentice Hall 1995 Combinational Logic COMBINATIONAL LOGIC.
Lecture 21, Slide 1EECS40, Fall 2004Prof. White Lecture #21 OUTLINE –Sequential logic circuits –Fan-out –Propagation delay –CMOS power consumption Reading:
EE415 VLSI Design THE INVERTER DYNAMICS [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
1 Delay Estimation Most digital designs have multiple data paths some of which are not critical. The critical path is defined as the path the offers the.
Mary Jane Irwin ( ) Modified by Dr. George Engel (SIUE)
Mary Jane Irwin ( ) CSE477 VLSI Digital Circuits Fall 2002 Lecture 04: CMOS Inverter (static view) Mary Jane.
Review: CMOS Inverter: Dynamic
EE415 VLSI Design DYNAMIC LOGIC [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]
Elmore Delay, Logical Effort
CSE477 L17 Static Sequential Logic.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 17: Static Sequential Circuits Mary Jane Irwin.
Modern VLSI Design 2e: Chapter 3 Copyright  1998 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
Review: Designing Inverters for Performance  Reduce C L l internal diffusion capacitance of the gate itself l interconnect capacitance l fanout  Increase.
Notices You have 18 more days to complete your final project!
CSE477 L24 RAM Cores.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 24: RAM Cores Mary Jane Irwin ( )
Logical Effort and Transistor Sizing Digital designs are usually expected to operate at high frequencies, thus designers often have to choose the fastest.
Introduction to CMOS VLSI Design Lecture 5: Logical Effort GRECO-CIn-UFPE Harvey Mudd College Spring 2004.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 Digital Integrated Circuits A Design Perspective Designing Combinational Logic Circuits.
ECE442: Digital ElectronicsSpring 2008, CSUN, Zahid Static CMOS Logic ECE442: Digital Electronics.
Designing Combinational Logic Circuits
Chapter 6 (I) Designing Combinational Logic Circuits Static CMOS
Linear Delay Model In general the propagation delay of a gate can be written as: d = f + p –p is the delay due to intrinsic capacitance. –f is the effort.
CMOS Inverter: Dynamic V DD RnRn V out = 0 V in = V DD CLCL t pHL = f(R n, C L )  Transient, or dynamic, response determines the maximum speed at which.
CSE477 L07 Pass Transistor Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 07: Pass Transistor Logic Mary Jane Irwin (
Inverter Chapter 5 The Inverter April 10, Inverter Objective of This Chapter  Use Inverter to know basic CMOS Circuits Operations  Watch for performance.
EE141 © Digital Integrated Circuits 2nd Inverter 1 Digital Integrated Circuits A Design Perspective The Inverter Jan M. Rabaey Anantha Chandrakasan Borivoje.
Spring 2010, Mar 10ELEC 7770: Advanced VLSI Design (Agrawal)1 ELEC 7770 Advanced VLSI Design Spring 2010 Gate Sizing Vishwani D. Agrawal James J. Danaher.
Switch Logic EE141.
Digital Integrated Circuits A Design Perspective
Combinatorial Logic Circuits
EE141 Combinational Circuits 1 Chapter 6 Designing Combinational Logic Circuits November 2002.
Chapter 6 Static CMOS Circuits Boonchuay Supmonchai Integrated Design Application Research (IDAR) Laboratory August, 2004; Revised - June 28, 2005.
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 Digital Integrated Circuits A Design Perspective Designing Combinational Logic Circuits.
EE210 Digital Electronics Class Lecture 10 April 08, 2009
EE141 © Digital Integrated Circuits 2nd Combinational Circuits 1 A few notes for your design  Finger and multiplier in schematic design  Parametric analysis.
Modern VLSI Design 3e: Chapter 3 Copyright  1998, 2002 Prentice Hall PTR Topics n Electrical properties of static combinational gates: –transfer characteristics;
EE415 VLSI Design. Read 4.1, 4.2 COMBINATIONAL LOGIC.
EE534 VLSI Design System Summer 2004 Lecture 12:Chapter 7 &9 Transmission gate and Dynamic logic circuits design approaches.
Modern VLSI Design 3e: Chapter 4 Copyright  1998, 2002 Prentice Hall PTR Topics n Transistor sizing: –Spice analysis. –Logical effort.
EE141 Combinational Circuits 1 Chapter 6 (I) Designing Combinational Logic Circuits Static CMOSStatic CMOS Pass Transistor LogicPass Transistor Logic V1.0.
Circuit Delay Performance Estimation Most digital designs have multiple signal paths and the slowest one of these paths is called the critical path Timing.
CSE477 L06 Static CMOS Logic.1Irwin&Vijay, PSU, 2003 CSE477 VLSI Digital Circuits Fall 2003 Lecture 06: Static CMOS Logic Mary Jane Irwin (
CSE477 VLSI Digital Circuits Fall 2002 Lecture 14: Midterm Review
COMBINATIONAL LOGIC - 2.
Presentation transcript:

CSE477 L11 Fast Logic.1Irwin&Vijay, PSU, 2002 CSE477 VLSI Digital Circuits Fall 2002 Lecture 11: Designing for Speed Mary Jane Irwin ( ) [Adapted from Rabaey’s Digital Integrated Circuits, ©2002, J. Rabaey et al.]

CSE477 L11 Fast Logic.2Irwin&Vijay, PSU, 2002 Cray was a legend in computers … said that he liked to hire inexperienced engineers right out of school, because they do not usually know what’s supposed to be impossible. The Soul of a New Machine, Kidder, pg. 77

CSE477 L11 Fast Logic.3Irwin&Vijay, PSU, 2002 Review: CMOS Inverter: Dynamic V DD RnRn V out V in = V DD CLCL t pHL = f(R n, C L ) t pHL = 0.69 R eqn C L t pHL = 0.69 (3/4 (C L V DD )/ I DSATn ) = 0.52 C L / (W/L n k’ n V DSATn )

CSE477 L11 Fast Logic.4Irwin&Vijay, PSU, 2002 Review: Designing Inverters for Performance  Reduce C L l internal diffusion capacitance of the gate itself l interconnect capacitance l fanout  Increase W/L ratio of the transistor l the most powerful and effective performance optimization tool in the hands of the designer l watch out for self-loading!  Increase V DD l only minimal improvement in performance at the cost of increased energy dissipation  Slope engineering - keeping signal rise and fall times smaller than or equal to the gate propagation delays and of approximately equal values l good for performance l good for power consumption

CSE477 L11 Fast Logic.5Irwin&Vijay, PSU, 2002 Switch Delay Model A R eq A RpRp A RpRp A RnRn CLCL A C int CLCL A RnRn A RpRp B RpRp B RnRn NAND INVERTER B RpRp A RpRp A RnRn B RnRn CLCL NOR

CSE477 L11 Fast Logic.6Irwin&Vijay, PSU, 2002 Input Pattern Effects on Delay  Delay is dependent on the pattern of inputs  Low to high transition l both inputs go low -delay is 0.69 R p /2 C L since two p-resistors are on in parallel l one input goes low -delay is 0.69 R p C L  High to low transition l both inputs go high -delay is R n C L  Adding transistors in series (without sizing) slows down the circuit CLCL A RnRn A RpRp B RpRp B RnRn C int

CSE477 L11 Fast Logic.7Irwin&Vijay, PSU, 2002 Delay Dependence on Input Patterns A=B=1  0 A=1, B=1  0 A=1  0, B=1 time, psec Voltage, V Input Data Pattern Delay (psec) A=B=0  1 69 A=1, B=0  1 62 A= 0  1, B=1 50 A=B=1  0 35 A=1, B=1  0 76 A= 1  0, B= input NAND with NMOS = 0.5  m/0.25  m PMOS = 0.75  m/0.25  m C L = 10 fF

CSE477 L11 Fast Logic.8Irwin&Vijay, PSU, 2002 Transistor Sizing CLCL B RnRn A RpRp B RpRp A RnRn C int B RpRp A RpRp A RnRn B RnRn CLCL

CSE477 L11 Fast Logic.9Irwin&Vijay, PSU, 2002 Transistor Sizing a Complex CMOS Gate OUT = !(D + A (B + C)) D A BC D A B C

CSE477 L11 Fast Logic.10Irwin&Vijay, PSU, 2002 Transistor Sizing a Complex CMOS Gate OUT = !(D + A (B + C)) D A BC D A B C

CSE477 L11 Fast Logic.11Irwin&Vijay, PSU, 2002 Fan-In Considerations DCBA D C B A CLCL C3C3 C2C2 C1C1 Distributed RC model (Elmore delay) t pHL = 0.69 R eqn (C 1 +2C 2 +3C 3 +4C L ) Propagation delay deteriorates rapidly as a function of fan-in – quadratically in the worst case.

CSE477 L11 Fast Logic.12Irwin&Vijay, PSU, 2002 t p as a Function of Fan-In t pH L t pL H t p (psec) fan-in quadratic function of fan-in linear function of fan-in  Gates with a fan-in greater than 4 should be avoided. tptp

CSE477 L11 Fast Logic.13Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 1  Transistor sizing l as long as fan-out capacitance dominates  Progressive sizing In N CLCL C3C3 C2C2 C1C1 In 1 In 2 In 3 M1 M2 M3 MN Distributed RC line M1 > M2 > M3 > … > MN (the fet closest to the output should be the smallest) Can reduce delay by more than 20%; decreasing gains as technology shrinks

CSE477 L11 Fast Logic.14Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 2  Input re-ordering l when not all inputs arrive at the same time C2C2 C1C1 In 1 In 2 In 3 M1 M2 M3 CLCL C2C2 C1C1 In 3 In 2 In 1 M1 M2 M3 CLCL critical path 1 010 101 charged

CSE477 L11 Fast Logic.15Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 2  Input re-ordering l when not all inputs arrive at the same time C2C2 C1C1 In 1 In 2 In 3 M1 M2 M3 CLCL C2C2 C1C1 In 3 In 2 In 1 M1 M2 M3 CLCL critical path charged 1 0101 1 delay determined by time to discharge C L, C 1 and C 2 delay determined by time to discharge C L 1 1 0101 charged discharged

CSE477 L11 Fast Logic.16Irwin&Vijay, PSU, 2002 Sizing and Ordering Effects DCBA D C B A CLCL C3C3 C2C2 C1C1 Progressive sizing in pull-down chain gives up to a 23% improvement. Input ordering saves 5% critical path A – 23% critical path D – 17% = 100 fF

CSE477 L11 Fast Logic.17Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 3  Alternative logic structures F = ABCDEFGH

CSE477 L11 Fast Logic.18Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 4  Isolating fan-in from fan-out using buffer insertion CLCL CLCL  Real lesson is that optimizing the propagation delay of a gate in isolation is misguided.

CSE477 L11 Fast Logic.19Irwin&Vijay, PSU, 2002 Fast Networks: Design Technique 5 - Logical Effort  The optimum fan-out for a chain of N inverters driving a load C L is f =  (C L /C in ) l so, if we can, keep the fan-out per stage around 4.  Can the same approach (logical effort) be used for any combinational circuit? l For a complex gate, we expand the inverter equation t p = t p0 (1 + C ext /  C g ) = t p0 (1 + f/  ) to t p = t p0 (p + g f/  ) -t p0 is the intrinsic delay of an inverter -f is the effective fan-out (C ext /C g ) – also called the electrical effort -p is the ratio of the instrinsic (unloaded) delay of the complex gate and a simple inverter (a function of the gate topology and layout style) -g is the logical effort N

CSE477 L11 Fast Logic.20Irwin&Vijay, PSU, 2002 Intrinsic Delay Term, p  The more involved the structure of the complex gate, the higher the intrinsic delay compared to an inverter Gate Typep Inverter1 n-input NANDn n-input NORn n-way mux2n XOR, XNORn 2 n-1 Ignoring second order effects such as internal node capacitances

CSE477 L11 Fast Logic.21Irwin&Vijay, PSU, 2002 Logical Effort Term, g  g represents the fact that, for a given load, complex gates have to work harder than an inverter to produce a similar (speed) response l the logical effort of a gate tells how much worse it is at producing an output current than an inverter (how much more input capacitance a gate presents to deliver it same output current) Gate Typeg (for 1 to 4 input gates) 1234 Inverter1 NAND4/35/3(n+2)/3 NOR5/37/3(2n+1)/3 mux222 XOR412

CSE477 L11 Fast Logic.22Irwin&Vijay, PSU, 2002 Example of Logical Effort  Assuming a pmos/nmos ratio of 2, the input capacitance of a minimum-sized inverter is three times the gate capacitance of a minimum-sized nmos (C unit ) A + B A B AB A B A B AB A A A

CSE477 L11 Fast Logic.23Irwin&Vijay, PSU, 2002 Example of Logical Effort  Assuming a pmos/nmos ratio of 2, the input capacitance of a minimum-sized inverter is three times the gate capacitance of a minimum-sized nmos (C unit ) A + B A B AB A B A B AB A A A 2 1 C unit = C unit = C unit = 5

CSE477 L11 Fast Logic.24Irwin&Vijay, PSU, 2002 Delay as a Function of Fan-Out  The slope of the line is the logical effort of the gate  The y-axis intercept is the intrinsic delay normalized delay fan-out f NAND2: g=4/3, p = 2 INV: g=1, p=1 intrinsic delay effort delay  Can adjust the delay by adjusting the effective fan-out (by sizing) or by choosing a gate with a different logical effort  Gate effort: h = fg

CSE477 L11 Fast Logic.25Irwin&Vijay, PSU, 2002 Path Delay of Complex Logic Gate Network  Total path delay through a combinational logic block t p =  t p,j = t p0  (p j + (f j g j )/  )  So, the minimum delay through the path determines that each stage should bear the same gate effort f 1 g 1 = f 2 g 2 =... = f N g N  Consider optimizing the delay through the logic network how do we determine a, b, and c sizes? 1 a b c CLCL 5

CSE477 L11 Fast Logic.26Irwin&Vijay, PSU, 2002 Path Delay Equation Derivation  The path logical effort, G =  g i  And the path effective fan-out (path electrical effort) is F = C L /g 1  The branching effort accounts for fan-out to other gates in the network b = (C on-path + C off-path )/C on-path  The path branching effort is then B =  b i  And the total path effort is then H = GFB  So, the minimum delay through the path is D = t p0 (  p j + (N  H)/  ) N

CSE477 L11 Fast Logic.27Irwin&Vijay, PSU, 2002 Path Delay of Complex Logic Gates, con’t  For gate i in the chain, its size is determined by s i = (g 1 s 1 )/g i  (f j /b j ) j=1 i-1 1 a b c CLCL 5  For this network l F = C L /C g1 = 5 l G = 1 x 5/3 x 5/3 x 1 = 25/9 l B = 1 (no branching) l H = GFB = 125/9, so the optimal stage effort is  H = Fan-out factors are f 1 =1.93, f 2 =1.93 x 3/5 = 1.16, f 3 = 1.16, f 4 = 1.93 l So the gate sizes are a = f 1 g 1 /g 2 = 1.16, b = f 1 f 2 g 1 /g 3 = 1.34 and c = f 1 f 2 f 3 g 1 /g 4 =

CSE477 L11 Fast Logic.28Irwin&Vijay, PSU, 2002 Fast Complex Gates: Design Technique 6  Reducing the voltage swing l linear reduction in delay l also reduces power consumption l requires use of “sense amplifiers” on the receiving end to restore the signal level (will look at their design when covering memory design) t pHL = 0.69 (3/4 (C L V DD )/ I DSATn ) = 0.69 (3/4 (C L V swing )/ I DSATn )

CSE477 L11 Fast Logic.29Irwin&Vijay, PSU, 2002 TG Logic Performance  Effective resistance of the TG is modeled as a parallel connection of R p (= (V DD – V out )/(-I Dp )) and R n (=V DD – V out )/I Dn ) V out, V Resistance, k  RpRp RnRn 2.5V 0V 2.5VV out RpRp RnRn R eq = R n || R p W/L n =0.50/0.25 W/L p =0.50/0.25  So, the assumption that the TG switch has a constant resistive value, R eq, is acceptable

CSE477 L11 Fast Logic.30Irwin&Vijay, PSU, 2002 Delay of a TG Chain CCCC VNVN V1V1 ViVi V i CCCC R eq V in VNVN V1V1 ViVi V i+1 V in  Delay of the RC chain (N TG’s in series) is t p (V n ) = 0.69  kCR eq = 0.69 CR eq (N(N+1))/2  0.35 CR eq N 2 k=1 N

CSE477 L11 Fast Logic.31Irwin&Vijay, PSU, 2002 TG Delay Optimization  Can speed it up by inserting buffers every M switches  Delay of buffered chain (M TG’s between buffer) t p = 0.69  N/M CR eq (M(M+1))/2  + (N/M - 1) t pbuf M opt = 1.7  (t pbuf /CR eq )  3 or 4 C V in VNVN CC C 5 0 C M

CSE477 L11 Fast Logic.32Irwin&Vijay, PSU, 2002 Next Lecture and Reminders  Next lecture l Designing energy efficient logic -Reading assignment – Rabaey, et al, 5.5 &  Reminders l HW3 due Oct 10 th (hand in to TA) l Class cancelled on Oct 10 th as make up for evening midterm l I will be out of town Oct 10 th through Oct 15 th and Oct 18 th through Oct 23 rd, so office hours during those periods are cancelled l We will have a guest lecturer on Oct 22 nd l Evening midterm exam scheduled -Wednesday, October 16 th from 8:15 to 10:15pm in 260 Willard -Only one midterm conflict filed for so far