Presentation is loading. Please wait.

Presentation is loading. Please wait.

High-performance Cortex™-M4 MCU

Similar presentations

Presentation on theme: "High-performance Cortex™-M4 MCU"— Presentation transcript:

1 High-performance Cortex™-M4 MCU
STM32 F4 series High-performance Cortex™-M4 MCU

2 Announcement highlights
The STM32 F4 series brings to the market the world’s highest performance Cortex™-M microcontrollers 168 MHz FCPU/210 DMIPS 363 Coremark score The STM32 F4 series extends the STM32 portfolio 250+ compatible devices already in production, including the F1 series, F2 series and ultra-low-power L1 series The STM32 F4 series reinforces ST’s current leadership in Cortex-M microcontrollers, with 45% world market share by units in (2010 or cumulated 2007 to Q1/11) according to ARM reporting 1 2 3

3 STM32 F4 series: Most powerful Cortex-M
10-Apr-17 Key features The STM32 microcontroller family brings a new degree of freedom to MCU users by combining five key benefits: -168 MHz CPU speed, ART Accelerator and multi-AHB bus offers faster response time and more processing capability. - Leading-edge performance and digital signal processing capability thanks to its 32-bit ARM Cortex-M4 core with floating point unit - Outstanding dynamic power efficiency with scaling (multi gear voltage regulator: mid performance and high performance settings), with low-voltage operation, low-power modes and a sub microamp real-time clock - A rich set of first-class peripherals, such as a 2x full duplex I²S, fast 12-bit ADC, dual CAN, advanced timers, Ethernet and USB OTG Maximum integration, with everything you need to operate the MCU actually included in the device, such as oscillators, PLL, regulators and reset circuit - And an excellent tool and software ecosystem with one of the broadest offers for IDE, Metalanguage tools such as Matlab, DSP library, starter kits, software libraries and stacks 1 3

4 STM32 F4: World’s #1 in performance
Dhrystone It takes ART to be #1 in performance: It is a combination of core, embedded Flash design, process, acceleration techniques. 1

5 STM32F4 versus competitors (Coremark)

6 ST’s ART Accelerator™ The adaptive real-time memory accelerator unleashes the Cortex-M4 core’s maximum processing performance equivalent to 0-wait state execution Flash up to 168 MHz The ART Accelerator is an ST technology using a prefetch queue and branch cache technology to access 128-bit wide Flash. Based on CoreMark benchmark, the result is a performance equivalent to 0-wait state execution from Flash. The branch cache is made of 64 × 128-bit buffers for code and 8 x 128-bit buffers for data. Each time an event such as subroutine call, an interrupt or a branch occurs and breaks the linear execution of the code, the ART Accelerator checks if the first instructions of the event are already stored in the cache, and if so, the instruction is immediately pushed to the prefetch queue with no performance loss. The prefetch cache is so deep that most applications achieve a 0-wait state execution performance from Flash. 1

7 Real-time performance
User interface: DMA transfers of the graphical icons from Flash to display Decompressed audio stream to 112kByte SRAM block Access to the MP3 data for decompression Compressed audio stream (MP3) to 16kByte SRAM block DMA transfer to audio output stage (I2S) MP3 decoder code execution by core 32-bit multi-AHB bus matrix In addition to a performance equivalent to 0-wait execution from Flash using the ART Accelerator, the 32-bit multi-AHB bus matrix interconnects all masters (at the top of the chart: CPU, DMAs, Ethernet, USB HS) and slaves (on the right-hand side: Flash memory, SRAM, FSMC, AHB and APB peripherals) and ensures a seamless and efficient operation even when several high-speed peripherals are working simultaneously. The nodes on the matrix represent the actual connections over the 7-layer matrix, between master and slave. For instance, let’s take the example of an MP3 audio player with graphical user interface. A compressed audio stream coming from the USB high speed peripheral is directly stored in the 16kB system RAM block thanks to the DMA controller dedicated to the USB high speed peripheral. In parallel, the CPU can process the audio data (MP3 decompression and/or equalizer) taking full advantage of the DSP extension of CM4. The processed data are written into the 112kB system RAM block and are transferred automatically with the system DMA controller to the I2S interface which transmit synchronously the audio data to an external CODEC. In parallel, the second system DMA controller can fetch graphical content from the flash to build automatically the Graphical User Interface on the external display connected to the STM32F4 through the FSMC interface. 1

8 STM32 F4 series High-performance digital signal controller
Single precision Ease of use Better code efficiency Faster time to market Eliminate scaling and saturation Easier support for meta-language tools FPU What is Cortex-M4? Ease of use of C programming Interrupt handling Ultra-low power MCU Harvard architecture Single-cycle MAC Barrel shifter DSP Based on the ARM Cortex-M4 core, the STM32 F4 series combines the control performances of the Cortex-M3 core and the DSP capability of a single cycle DSP MAC for data processing. In addition, the STM32 F4 embeds a single precision FPU. Cortex-M4 1

9 Outstanding power efficiency
230 μA/MHz, 38.6 mA at 168 MHz executing Coremark benchmark from Flash memory (with peripherals off), made possible with: ST’s 90 nm process allowing the CPU core to run at only 1.2 V 230 µA/MHz, 38.6 mA at 168 MHz (executing Coremark benchmark from Flash memory, with ART Accelerator™ enabled and all peripherals off), 1.2 V is supplied by an internal regulator with power scaling support: the user can choose between 2 settings of the regulator (lower voltage for mid performance with 10% less dynamic consumption, higher voltage for high performance). VDD min down to 1.7 V (on all packages except the LQFP64 which offers 1.8 V min) Besides the LQFP64, all packages offer the option, through a dedicated pin, to turn off the brownout protection which is trimmed to reset the microcontroller below 1.8 V. When the brownout is disabled, the product can work down to 1.7 V min. This feature allows the product to be used in low-voltage applications where a 1.8 V regulator (+/- 5%) requires the microprocessor to work down to 1.7 V. Such applications usually provide a reset circuit that handles the reset management of the microcontroller. Low power modes with backup and SRAM and RTC support. For instance, in Vbat mode, the RTC and backup SRAM typically draw less 1uA each Typical values in VBAT mode ART Accelerator™ reducing the number of accesses to Flash Voltage scaling to optimize performance/power consumption VDD min down to 1.7 V Low-power modes with backup SRAM and RTC support 1

10 Low power and real life applications
Low power in real life applications is not just Low-power mode Need to consider the % of time spend in LP mode and in Run mode Run Low power % Low power mode time mA/MHz % Mode Average consumption time mA/MHz 1

11 Average consumption If competitors are claiming better low-power modes, these are only an advantage if the overall system is spending more than 90% of the time doing nothing in low-power mode MHz 50 Competitor F 40 Competitor R 30 STM32 F4 20 10 % Run 1 20 40 60 80 100

12 Superior and innovative peripherals
168 MHz and ADC 2.4 MSPS Ethernet with IEEE 1588v2 HW crypto/hash coprocessor and <1 µA RTC Audio architecture 2 USB OTG 2 full duplex I²S The audio class architecture of the F4 series adds important features compared to the F2 series: The 2 I²S peripherals are now full duplex. Features available in the F2 series and maintained in the F4 series: The USB start of frame output pin is still present to simplify synchronization of the USB flow with the I²S flow. The dedicated audio PLL allows independent configuration of the I²S clock from the rest of the system (USB, Ethernet, CPU, etc.). The camera interface is a universal 8- to 14-bit parallel interface (no industry standing name). It supports the following data formats: - 8-bit progressive video monochrome or raw Bayer - YCbCr 4:2:2 progressive video - RGB 565 progressive video compressed data (like JPEG) It also supports the following features: - continuous mode or snapshot (single-frame) mode - automatic image cropping - 8-word FIFO - AHB slave interface with the capability of controlling the GP-DMA (request/acknowledge) using 1 channel - Various interrupt flags such as end-of-line, end-of-frame, vertical-synchronization, overrun or errors flags 1

13 Maximum integration The 1-Mbyte Flash and 192-Kbyte SRAM memories available in the product accommodate advanced software stacks and user data, with no need for external memories 4-Kbyte SRAM battery back-up: EEPROM used to save application state, calibration data In addition, 528 bytes of OTP memory make it possible to store critical user data such as Ethernet MAC addresses or cryptographic keys 1

14 STM32 – leading Cortex-M portfolio
Over 250 pin-to-pin compatible part numbers 2

15 STM32 product series 4 product series 2

16 Extensive tools and SW Evaluation board for full product feature evaluation Hardware evaluation platform for all interfaces Possible connection to all I/Os and all peripherals Discovery kit for cost-effective evaluation and prototyping Large choice of development IDE solutions from the STM32 and ARM ecosystem STM3240G-EVAL $349 STM32F4DISCOVERY   $14.90 2

17 STM32 F4 block diagram 2 Feature highlight 168 MHz Cortex-M4 CPU
Floating point unit (FPU) ART Accelerator TM Multi-level AHB bus matrix 1-Mbyte Flash, 192-Kbyte SRAM 1.7 to 3.6 V supply RTC: <1 µA typ, sub second accuracy 2x full duplex I²S 3x 12-bit ADC µs/2.4 MSPS 168 MHz timers 1.7 V on all packages except LQFP64 which offers 1.8 V VDDmin 3x 2.4 MSPS ADC (up to 7.2 MSPS in interleaved mode) 2

18 STM32 F4 portfolio 2 18 Scalable product offer
- Get the right product from more than 30 derivatives Develop your platform or upgrade your application by taking a larger package or larger memory size or richer set of peripherals All part numbers are in production except the WLCSP available in TBC. 2 18

19 STM32 F4 series – applications served
Building Security/fire/HVAC Test and measurement Consumer Communication Points of sale/inventory management Industrial automation and solar panels Transportation Medical Points of sale/inventory: 2-D bar code scanners using the camera interface, FSMC and DSP+FPU processing power. Industrial automation and solar panels: communication gateways or control nodes using the dual CAN, advanced timers running up to 168 MHz, Ethernet PTP, FSMC and MCU processing power. Transportation: communication gateways using the high-density Flash and SRAM for SW stacks, CAN, multiple USARTs. Medical: single-chip solution for motor control and user interfaces, using the advanced timers, processing power and 2.4 MSPS ADCs. Building: control panels using the FSMC, processing power, high-density Flash and SRAM. Security/fire/HVAC: IP cameras, access control, HVAC controllers using the Ethernet, 2 advanced timers, processing power and 2.4 MSPS ADCs. Test and measurement: industrial sensors using the camera interface, DSP and FPU processing power, the low dynamic power, the 4-Kbyte back-up SRAM and 2.4 MSPS ADCs. Consumer: Bluetooth headsets, mobile phone accessories using the 1.7 V min, the WLCSP package, the DSP processing power, high-density SRAM, and audio architecture with full duplex I²S. Communications: office phones using Ethernet, the high-density Flash and SRAM, the FSMC and DSP processing power. 2

20 Market update 3 Final Cortex-M MCU volume in 2010: 144 M units
Growth into 2011 continues to be strong and healthy Strong ARM growth also acknowledged by Semicast STM32 = ~ 45% market share in cumulated Cortex-M shipments 3

21 STM32 platform leadership: Google trends
STM32 leading the MCU trends Still growing STM32 1.00 Competitor A 0.36 Competitor B 0.66 Competitor C 1.56 Optional slide TBC. Need updated figures with RX600, NXP LPC1800 and LPC4300, Freescale Kinetis, TI Stellaris… 3

22 Key messages to remember
STM32 F4 series World’s highest performance Extends the STM32 portfolio to over 250+ compatible devices One-in-two Cortex-M MCUs shipped worldwide is an STM32 Discovery kits available now STM32F4DISCOVERY

23 Thank you

24 Glossary ART Accelerator ™ : ST’s adaptive real-time accelerator
CMSIS: Cortex™ microcontroller software interface standard MCU: microcontroller unit DSC: digital signal controller DSP: digital signal processor FPU: floating point unit RTC: real-time clock MPU: memory protection unit FSMC: flexible static memory controller

Download ppt "High-performance Cortex™-M4 MCU"

Similar presentations

Ads by Google