Presentation is loading. Please wait.

Presentation is loading. Please wait.

COMPARISON OF ADAPTIVE VOLTAGE/FREQUENCY SCALING AND ASYNCHRONOUS PROCESSOR ARCHITECTURES FOR NEURAL SPIKE SORTING J. Leverett A. Pratt R. Hochman May.

Similar presentations


Presentation on theme: "COMPARISON OF ADAPTIVE VOLTAGE/FREQUENCY SCALING AND ASYNCHRONOUS PROCESSOR ARCHITECTURES FOR NEURAL SPIKE SORTING J. Leverett A. Pratt R. Hochman May."— Presentation transcript:

1 COMPARISON OF ADAPTIVE VOLTAGE/FREQUENCY SCALING AND ASYNCHRONOUS PROCESSOR ARCHITECTURES FOR NEURAL SPIKE SORTING J. Leverett A. Pratt R. Hochman May 2013 – EE241 Final Project

2 Introduction to ‘Spikes’ Frequency Range10-500Hz Duration3-5ms Resolution8-bit ‘Spikes,’ also called ‘action potentials,’ are when the electrical potential of a neuron rises shortly. It is part of the communication protocol of the brain.

3 Spike Sorting Systems Operate at constant 37°C Extremely low power Extremely low area

4 Purpose Statement Adaptive Voltage and Frequency Scaling Asynchronous Timing vs. Determine the conditions under which adaptive voltage and frequency scaling provides a better power performance than asynchronous timing in a neural spike processor. under which asynchronous timing provides a better power performance than adaptive voltage and frequency scaling in a neural spike processor.

5 Theory E = I*VDD*t If you can decrease voltage and/or current faster than the increase in delay for an operation, you save energy. Asynchronous circuits process data at the same energy ‘every time.’ When data rates are low, I and VDD may be reduced such that the energy per spike of a synchronous solution is lower than the asynchronous solution. The issue is that at some point, leakage current dominates, making increased operating time undesirable.

6 Verilog Function Detection of the largest value in a spike and the point at which it occurs facilitates spike sorting. Sorting spikes on chip results in a reduced amount of transmitted data.

7 Synchronous Logic

8 Prior Art

9 Asynchronous Logic

10 Data Collection Methodology Determination of max. frequency and E vs. VDD. Known Simplifications Process Corners Entire Verification Space Noise/Margins Critical Path Monitors Adaptive control circuitry Synthesis Place and Route.sp netlist E vs. f Data VerilogCadencehSpice Matlab sim LVS sim *RC Parasitic extraction Logic Demonstrations of functionality with synch and asynch.

11 Spice Data: Energy/Op vs. Frequency Asynchronous: 300 less MOSFETS >5000 less Rs and Cs At 20kHz: Synch: 1.9E-9A – 7.6pJ Asynch: 1.71E-9A, 6.8pJ

12 Spice Data: Current Breakdown - Synchronous

13 Assumptions Process Corners Only used TT as effects should be linear on both architectures. Verification Space Test vector data was designed to be representative, not the worse case scenario. Testing across a wide range of frequencies and voltages is an open problem. Noise/Margins Assumed no noise for this analysis. Noise should hurt both circuits equally.

14 Subthreshold critical path monitors are highly sensitive to PVT both globally and locally. Current is exponential with changes in,, and T. Implies a currently unachievable level of specificity. First order approximation might be to double energy. Critical Path Monitor Adaptive Control Circuitry, Clock, + Vdd Assuming that the adaptive frequencies are less than the global clock, we can implement the change in frequency with a clock divider (counter). Voltage scaling circuitry would take the global supply and reduce it using one of many possible techniques. –Linear Regulators: Inefficient for large changes in voltage. –Capacitive Regulator: Results in supply ripples. –Magnetic Regulator: Highest efficiency, but requires external parts

15 Matlab: Transient State Machine at.5MHz

16 Matlab: Transient State Machine at.6Mhz

17 Matlab: Transient State Machine at 5Mhz

18 Matlab: Total Energy + Vdd vs. Event Frequency

19 Conclusion Asynchronous logic operates at a higher frequency for all supply values than the synchronous version. Similarly, for an equal operating frequencies, asynchronous can operate at a lower VDD. For the speed of a neural processor, both circuits operate at the lowest supply voltage and leakage current dominates. There is no means to perform adaptive analysis. In an application where incoming data frequency is on the order of magnitude of the intrinsic asynchronous circuit operating frequency, the adaptive circuit requires less energy, as long as the average voltage required of the adaptive circuit is less than the intrinsic voltage of the asynchronous circuit.

20 References and Acknowledgements Acknowledgements: Brian Zimmer, Nathan Narevsky, Jan Rabaey, Stevo Bailey, Tsung-Te Liu, Bora Nikolic

21 APPENDIX

22 Spice Data: Power vs. Max Operating Freq.


Download ppt "COMPARISON OF ADAPTIVE VOLTAGE/FREQUENCY SCALING AND ASYNCHRONOUS PROCESSOR ARCHITECTURES FOR NEURAL SPIKE SORTING J. Leverett A. Pratt R. Hochman May."

Similar presentations


Ads by Google