M5Stack LLM-8850 Expansion Card with Axera AX8850 — 24 TOPS M.2 AI Accelerator

M5Stack has introduced the LLM-8850 expansion card, a compact AI acceleration module built on the M.2 M-Key 2242 standard. At its core is the Axera AX8850 SoC, delivering up to 24 TOPS of INT8 performance. Designed for versatility, the module can slot into a wide range of hosts, including the Raspberry Pi 5, Rockchip RK3588-based SBCs, and x86 mini-PCs with an available M.2 M-Key interface.

The card integrates 8GB of LPDDR4x RAM and 32Mbit of SPI NOR flash, along with a capable video subsystem. It supports 8Kp30 H.264/H.265 video encoding and 8Kp60 decoding, with the ability to process up to 16 simultaneous 1080p streams. To maintain stability under load, the board is equipped with active cooling—a small turbine fan paired with a CNC-milled aluminum heatsink—preventing thermal throttling.

M5Stack LLM-8850 — Specs & Features

M5Stack LLM-8850 — Specifications & Feature Summary

Ready-to-drop-in M.2 M-Key AI acceleration module powered by the Axera AX8850. Includes a concise spec table and feature list for documentation or product pages.

SoCAxera AX8850
CPU8 × Cortex-A55, up to 1.7 GHz
NPU24 TOPS (INT8)
VPU — EncodeH.264 / H.265, up to 8K @30fps; scaling & cropping
VPU — DecodeH.264 / H.265, up to 8K @60fps; up to 16× 1080p concurrent streams; scaling & cropping
Memory8GB LPDDR4x, 64-bit, 4266 Mbps
Storage32 Mbit QSPI NOR (bootloader only)
Host InterfaceM.2 M-Key (2242), PCIe 2.0 ×2
CoolingTurbine fan + CNC aluminum heatsink
Power3.3V via M.2 connector; < 7 W max
Dimensions42.6 × 24.0 × 9.7 mm
Weight14.7 g
Operating Temp.0–60 °C (sustained load ~70 °C at room temp)
OS SupportUbuntu 20.04 / 22.04 / 24.04, Debian 12 (Linux only; driver: axcl-smi)
Typical ApplicationsLLM inference, vision processing, multimodal and audio models
MSRP / Channels$99 — M5Stack store, AliExpress
AI Performance
24 TOPS (INT8) NPU suitable for edge LLM inference and multimodal workloads.
Compact M.2 Form Factor
M.2 M-Key 2242—fits Raspberry Pi 5 (with adapter), Rockchip SBCs, and x86 mini-PCs with M-Key slot.
High-Res Video
8K encode/decode support and up to 16× 1080p concurrent decode streams for video analytics use-cases.
Memory & Boot
8GB LPDDR4x for model runtime; 32Mbit QSPI NOR for bootloader only.
Linux-First
Driver (axcl-smi) and demos target Ubuntu / Debian—no Windows/macOS support at present.
Active Cooling
Integrated micro-turbine and CNC heatsink to avoid thermal throttling under sustained loads.
Power Efficient
Sub-7W maximum draw via the M.2 connector—suitable for edge and embedded hosts.
Model Ecosystem
Official demos for vision, LLMs, multimodal, audio, and generative models are available on the vendor wiki.
M5Stack LLM-8850 Expansion Card with Axera AX8850

Software Support and Compatibility

The LLM-8850 runs exclusively on Linux. It supports Ubuntu 20.04 / 22.04 / 24.04 and Debian 12, but does not currently support Windows, macOS, or even WSL. This is because its axcl-smi driver is only available for Linux environments.

Once the driver is installed on a Raspberry Pi 5, Linux SBC, or mini-PC, developers can access demo programs and model packages via the official wiki. Supported models cover a broad range of AI tasks:

  • Vision: YOLO11, Yolo-World-V2, Yolov7-face, Depth-Anything-V2, MixFormer-V2, Real-ESRGAN, Super-Resolution, RIFE
  • Large Language Models (LLMs): Qwen3-0.6B, Qwen3-1.7B, Qwen2.5-0.5B-Instruct, Qwen2.5-1.5B-Instruct, DeepSeek-R1-Distill-Qwen-1.5B, MiniCPM4-0.5B
  • Multimodal: InternVL3-1B, Qwen2.5-VL-3B-Instruct, SmolVLM2-500M-Video-Instruct, LibCLIP
  • Audio: Whisper, MeloTTS, SenseVoice, CosyVoice2, 3D-Speaker-MT
  • Generative: lcm-lora-sdv1-5, SD1.5-LLM8850, LivePortrait

Using the LLM-8850 with Raspberry Pi 5

To connect the LLM-8850 to a Raspberry Pi 5, M5Stack provides an M.2 HAT+ M-Key adapter board, enabling stable PCIe communication between the two.

Although official benchmark results are not yet available, early data from the wiki indicates, for example, that the Qwen3-0.6B model can reach 12.88 tokens/s with w8a16 quantization.

From a raw performance perspective, the LLM-8850 is competitive with other edge AI modules, such as the 26 TOPS Hailo-8. While Hailo-8 excels in computer vision workloads, the AX8850 shows stronger efficiency in large language model inference, giving the LLM-8850 an edge in certain AI domains.

M5Stack LLM-8850 Expansion Card

In terms of pricing, the LLM-8850 sits in the same bracket as most AI accelerator modules. At $99, it is more affordable than the Raspberry Pi AI HAT+ (26 TOPS, $110) and significantly undercuts the Hailo-8 M.2 expansion card (~$200).

The module is available through two main channels:

Both list the card at $99.

Like it? Share it:

Embedsbc related posts:

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top