67d0603cba9c28e7e7869262 Promo Arm Arcadia Pr Image

New Cortex-A Targets Edge AI

March 11, 2025

Arm’s Cortex-A320/Ethos-U85 NPU platform delivers low-power, high-performance AI.

Related To:

What you’ll learn:

What’s in Arm’s new edge compute platform?
Quad-core Cortex-A320 handles many AI/ML tasks itself.
Cortex-A320 boasts improved security.

Arm’s low-end edge-computing solution combines the Cortex-M85 with an Ethos-U85 neural processing unit (NPU). The platform can do some amazing stuff when it comes to artificial intelligence and machine learning (AI/ML), but sometimes the compute chores need a bit more horsepower. Enter the Cortex-A320 (Fig. 1).

1. Arm wants to give developers targeting the edge a more robust AI compute platform, which includes multiple Arm Cortex-A320 cores and an Ethos-U85 NPU to fill out the low end of the Cortex-A spectrum.

Quad-Core Cortex-A320 Handles Many AI/ML Tasks Itself

The Arm Cortex-A320 cores are designed to be low power and very efficient compared to the rest of the Cortex-A line. They’re based on the Arm9.2-A architecture. A typical quad Cortex-A320 layout adds cache and NEON/SVE2 SIMD engines with crypto support (Fig. 2). Multicore versions utilize Arm’s DSU-120T, DynamIQ Shared Unit (DSU).

Quad-core Cortex-A320 layout — 2. Shown is a typical quad-core Cortex-A320 layout.

The Ethos-U85 NPU continues to do the heavy AI/ML lifting, but the Cortex-A320 provides improved AI/ML model handling especially when compared to the low-end Cortex-A35 and the Cortex-M85. Arm touts up to a 70% performance boost plus a 10X increase for AI/ML chores. It’s 50% more power-efficient compared to an Arm Cortex-A520.

Dimosthenis Rossidis, Senior Product Manager, IoT Line of Business with Arm, notes that the Cortex-A320 gets some of its improvements from “a narrow fetch and decode datapath, densely banked L1 caches, a reduced-port integer register file, and other optimizations.” The cores have an 8-stage pipeline.

The Cortex-A320 includes support for NEON matrix multiply instructions. It supports the BFloat16 floating-point format that’s popular with compact AI/ML models. It also handles the Scalable Vector Extension (SVE2) instructions.

^{>>Check our coverage of embedded world 2025, and this TechXchange for similar articles and videos}

Embedded

Notable News from embedded world 2025

Check out the latest news from embedded world 2025 in Nuremberg, Germany.

TechXchange

AI on the Edge

Artificial intelligence requires compute horsepower, but more efficient algorithms and specialized hardware have made it practical for edge nodes.

The combination of the Cortex-320 and Ethos-U85 is designed to work with large language models (LLMs) locally over models with over one billion parameters.

Cortex-A320 Boasts Improved Security

Arm’s edge-AI application solution also pushes high-level security support (Fig. 3). This includes features like Arm’s Memory Tagging Extension (MTE) and Pointer Authentication (PAC) with Branch Target Identification (BTI). These are designed to improve memory safety at the hardware level.

MTE utilizes 4-bit address tagging and 4-bit memory tagging. The former adds information to pointers while the latter is associated with a 16-block region, also known as a tag granule. If the address and memory tags agree, then pointer references will work otherwise and the error is flagged.

PAC/BTI is designed to mitigate jump/return-oriented programming attacks. PAC added instructions to insert an authentication code into the upper bits of the 64-bit return address pointer. These bits were unused. Attackers can’t modify them, allowing for errors to be detected at runtime. BTI support is similar, but it addresses jumps. These features tend to be hidden by a compiler.

The Cortex-A320 supports Secure EL2 (Exception Level 2), which enhances Arm’s TrustZone security isolation.

About the Author

William G. Wong | Senior Content Director - Electronic Design and Microwaves & RF

I am Editor of Electronic Design focusing on embedded, software, and systems. As Senior Content Director, I also manage Microwaves & RF and I work with a great team of editors to provide engineers, programmers, developers and technical managers with interesting and useful articles and videos on a regular basis. Check out our free newsletters to see the latest content.

You can send press releases for new products for possible coverage on the website. I am also interested in receiving contributed articles for publishing on our website. Use our template and send to me along with a signed release form.

Check out my blog, AltEmbedded on Electronic Design, as well as his latest articles on this site that are listed below.

You can visit my social media via these links:

I earned a Bachelor of Electrical Engineering at the Georgia Institute of Technology and a Masters in Computer Science from Rutgers University. I still do a bit of programming using everything from C and C++ to Rust and Ada/SPARK. I do a bit of PHP programming for Drupal websites. I have posted a few Drupal modules.

I still get a hand on software and electronic hardware. Some of this can be found on our Kit Close-Up video series. You can also see me on many of our TechXchange Talk videos. I am interested in a range of projects from robotics to artificial intelligence.