Summary
Hailo Technologies is a Tel Aviv-based fabless AI chip company developing dedicated edge inference accelerators — the Hailo-8, Hailo-10, and Hailo-15 chip families — available as M.2, PCIe, and mPCIe add-in modules for embedding AI inference into existing compute platforms. Hailo’s chips are notable for the highest documented TOPS/watt ratio in the add-in accelerator market: the Hailo-8 delivers 26 TOPS at 2.5W in an M.2 2280 module — enabling AI inference on camera systems, embedded computers, drones, vehicles, and industrial equipment without a GPU. Hailo has achieved broad OEM adoption across automotive ADAS (including Toyota, Denso, and Tier 1 auto suppliers), smart camera vendors, industrial AI companies, and defense-adjacent integrators.
Key Facts
- Founded: 2017
- HQ: Tel Aviv, Israel
- Funding: ~$250M+ raised; Series C completed (2022); investors include Poalim Equity, ABB Technology Ventures, Cardium, Mann+Hummel Ventures, and others; valuation estimated >$1B post-Series C
- Founders: Orr Danon (CEO), Avi Baum (CTO) — both veteran Israeli chip engineers with prior experience at EZchip (acquired by Mellanox/NVIDIA)
- Value chain position: Fabless chip designer; sells chips in bulk to OEM customers and sells Hailo AI Modules (M.2, PCIe, mPCIe) directly to integrators and developers
- Fabrication: TSMC — Hailo-8 at TSMC 16nm FinFET; Hailo-15 at TSMC 5nm
- Primary markets: Automotive ADAS, smart cameras and surveillance, industrial AI inspection, robotics, medical imaging, defense-adjacent (drone AI, tactical sensors)
Key Products
Hailo-8 — M.2 / PCIe / mPCIe AI Accelerator (26 TOPS, 2.5W)
The Hailo-8 is the chip that established Hailo’s market position. Fabricated at TSMC 16nm, the Hailo-8 delivers 26 TOPS (tera-operations per second) at INT8 precision while consuming only 2.5W — a TOPS/watt ratio that substantially outperforms Intel integrated NPUs and Google’s Edge TPU. The chip is available in M.2 2280, PCIe (x1 or x4), and mini-PCIe form factors, making it compatible with an enormous base of existing embedded computers, industrial PCs, smart cameras, and single-board computers.
Key specifications:
- Performance: 26 TOPS (INT8)
- Power: 2.5W typical; <3.5W peak
- Fabrication: TSMC 16nm FinFET
- Form factors available: M.2 2280 (key M or B+M), PCIe half-height half-length, mPCIe
- Memory: 32MB on-chip SRAM (no external DRAM required for inference — eliminates DRAM power)
- Interface: PCIe Gen3 x1 (standard); Gen3 x4 (full-bandwidth)
- Software: HailoRT runtime; TAPPAS (vision AI application framework); integration with GStreamer pipelines; ONNX/TFLite model compilation via Hailo Dataflow Compiler
- Notable deployment: Raspberry Pi AI HAT+ (announced 2024) — Hailo-8L (13 TOPS, lower-TDP variant) on an official Raspberry Pi accessory; dramatically expanded developer ecosystem
Hailo-8L variant: Lower-performance (13 TOPS) version of Hailo-8 at lower cost and even lower power, used in the Raspberry Pi AI HAT+ and camera modules.
Hailo-10H — Ultra-Low Power Inference
The Hailo-10H is a lower-power, lower-cost accelerator targeting applications where even the Hailo-8’s 2.5W is too much — battery-powered cameras, IoT sensors, wearable devices, and ultra-compact embedded systems. Performance is reduced relative to Hailo-8, but the power envelope (sub-1W operation possible) opens use cases that no other add-in accelerator can address.
Key specifications:
- Performance: Approximately 10 TOPS (INT8)
- Power: <1W typical; target: sub-milliwatt idle
- Target use cases: Battery-powered AI cameras, wearable AI, IoT edge nodes, drone AI payload nodes
Hailo-15 — Next Generation, TSMC 5nm (20–40 TOPS per chip)
The Hailo-15 family represents Hailo’s move to TSMC 5nm and a significant performance step. Multiple SKUs cover different power/performance points:
- Hailo-15H: ~40 TOPS at <5W (high-performance M.2/PCIe variant)
- Hailo-15M: ~20 TOPS at lower power
- Hailo-15L: Ultra-low power variant
Key specifications:
- Fabrication: TSMC 5nm
- Interface: PCIe Gen4 x4 (higher bandwidth than Hailo-8)
- Memory: Enhanced on-chip SRAM; larger SRAM enables larger model inference
- Notable improvement over Hailo-8: Transformer model support — Hailo-15 is specifically optimized for attention-based models (ViT, YOLO-NAS, RT-DETR) that were less efficient on Hailo-8’s CNN-optimized architecture
- Availability: Production availability Q3/Q4 2025 per company announcements; modules in deployment through 2026
Hailo AI Software Stack
- Hailo Dataflow Compiler (DFC): Compiles neural networks (ONNX, TFLite, Caffe) to Hailo hardware; performs quantization (INT8/INT16), graph optimization, and hardware-specific scheduling
- HailoRT: Real-time runtime library for inference deployment; C/C++/Python APIs; GStreamer plugin integration
- TAPPAS: Vision AI application framework with pre-built pipelines for object detection, classification, tracking, and multi-stream processing
- Model Zoo: Pre-compiled, validated models for common vision tasks (YOLOv8, ResNet, EfficientDet, etc.) — reduces time-to-deployment for common use cases
What It Is / How It Works
Architecture: Structure-Defined Dataflow
Hailo’s chip architecture is based on “Structure-Defined Dataflow” — a hardware design philosophy where the chip’s internal compute graph is a direct mapping of the neural network’s data flow graph, rather than a general-purpose matrix multiply array. In a conventional GPU or general NPU, all operations go through shared memory and execution units; in Hailo’s architecture, each layer of the network has dedicated resources and direct data paths to the next layer.
The on-chip SRAM (32MB on Hailo-8) is large enough to hold entire intermediate activations for typical CNN inference tasks, eliminating the need for external DRAM access during inference. DRAM access is one of the largest power consumers in AI inference; removing it is the primary reason Hailo achieves 26 TOPS at 2.5W when competing architectures require 5–10× more power for similar throughput.
The trade-off: The dataflow architecture is extremely efficient for models whose structure matches the chip’s internal wiring, but less flexible for unusual architectures. Hailo-8 is very efficient for standard CNNs (YOLO, ResNet, EfficientNet) but less so for transformers — a known limitation that Hailo-15’s architecture specifically addresses.
Raspberry Pi AI HAT+ Effect
The Raspberry Pi AI HAT+ (shipping 2024) with the Hailo-8L created a massive developer ecosystem for Hailo hardware — developer boards are how AI chip companies establish software ecosystems and trained engineers. NVIDIA’s Jetson dominance in edge AI is partly built on the developer community that grew around the Jetson Nano (cheap developer module). Hailo’s Raspberry Pi partnership is a strategic move to build similar developer ecosystem density, which then pulls through commercial deployments when those developers build products.
Defense-Adjacent Applications
Hailo hardware appears in defense-adjacent deployments through system integrators and OEMs who embed Hailo modules in:
- Tactical camera and surveillance systems: AI analytics on soldier-worn cameras, fixed perimeter surveillance, and ISR camera payloads — Hailo-8’s 2.5W is compatible with battery-powered or solar-powered deployments
- Drone payload AI: Small UAS/UAV onboard inference for object detection, target classification, and geolocation — Hailo-8M.2 fits in tight payload bays where GPU modules cannot
- Vehicle-mounted sensor processing: Forward-looking ADAS/sensor fusion supplementary inference in military vehicles (supplementing, not replacing, larger GPU compute for lower-priority inference tasks)
- Smart IED detection / ground sensor networks: Persistent sensor nodes with local AI inference to reduce data transmission requirements in bandwidth-constrained tactical environments
Hailo does not publicly identify specific DoD contracts, but its products are available through US defense electronics distributors and are listed in several defense-oriented embedded computing vendors’ hardware catalogs.
Notable Developments
- 2024: Raspberry Pi AI HAT+ launched with Hailo-8L — rapidly expands developer ecosystem; tens of thousands of developer units shipped
- 2025: Hailo-15H production ramp at TSMC 5nm; PCIe Gen4 modules becoming available through distribution
- 2025: TAPPAS 3.x release — expanded model zoo (YOLOv8, RT-DETR, SAM-Edge); improved multi-stream pipeline performance
- Ongoing: Smart camera market penetration through Axis Communications and Hanwha Vision partnerships — Hailo chips embedded in network camera SoCs
- Ongoing: Automotive design-in wins; Hailo chips are embedded in Toyota/Denso advanced ADAS platforms (Level 2+ features)
Key People
Orr Danon — Co-Founder & CEO
- Previously at EZchip Technologies (acquired by Mellanox/NVIDIA for $811M in 2016 — network processor chips)
- Education: Computer Science, Technion – Israel Institute of Technology
Avi Baum — Co-Founder & CTO
- Previously at EZchip Technologies; chip architecture and design
- Education: Electrical Engineering, Technion – Israel Institute of Technology
Claim Verification
Claim: 26 TOPS at 2.5W for Hailo-8
Status: Verified (with standard precision caveat)
Supporting sources:
- Hailo-8 datasheet — company datasheet specifies 26 TOPS at INT8 precision, 2.5W typical power
- Multiple third-party publications and embedded computing review sites have independently measured Hailo-8 power draw in the 2.0–2.8W range under inference load, consistent with specification
- Raspberry Pi AI HAT+ documentation (which uses Hailo-8L at 13 TOPS / ~1.5W) corroborates the power efficiency claims by independent community testing
Refuting / questioning sources:
- TOPS is measured at INT8 precision; FP16 throughput is lower (most inference workloads use INT8 after quantization, so INT8 TOPS is the relevant figure)
- TOPS figure represents peak compute throughput; actual model throughput depends on memory bandwidth, model structure efficiency, and compiler optimization — real-world FPS on specific models may be lower than peak TOPS suggests
Summary: The 26 TOPS / 2.5W specification is well-supported by independent measurement. The efficiency advantage over alternatives is real and documented. Standard caveat: TOPS is peak, not sustained on all model types.
Sources
- Hailo Technologies — Official Website
- Hailo — AI Accelerators Products
- Hailo Developer Zone — SDK, model zoo, TAPPAS documentation
- Raspberry Pi AI HAT+ — consumer ecosystem entry point
- TSMC 5nm Process — fabrication context for Hailo-15