HYMEX: Energy-Efficient HYbrid MEmory based Architecture for NeXt-Generation AI Systems

This page is under preparation. Please visit this page to have an early peak at the project.

Links: [JSPS Grants-in-Aid for Scientific Research] / [University of Aizu] / [PI's Researchmap] / [Project Database] / [Project Page]

Information

Funder: KAKENHI (Grant-in-Aid for Scientific Research B)
Project No: 26K02883
Duration: 2026/04-2029/03
Summary: Artificial Intelligence (AI) has made great progress, but its high energy consumption remains a serious challenge, especially for small devices such as smartphones, IoT sensors, and battery- or solar-powered systems. This project proposes a new energy-efficient neuromorphic AI accelerator using 3D integrated circuits (3D-ICs) that combine Non-Volatile Memory (NVM) and SRAM to reduce power usage both during operation and idle periods. NVM can retain data without power, significantly lowering idle energy consumption, but it is typically slower and more energy-intensive during read/write operations than SRAM. To address this, we introduce a hybrid memory architecture in which stacked NVM layers are placed above the logic layer to shorten data transfer distance, while a small, fast SRAM layer acts as a cache between logic and NVM to improve speed and reduce read/write energy. In addition, adaptive power management techniques dynamically adjust energy usage based on available power with minimal impact on accuracy and performance. With these technologies, the proposed design aims to reduce energy consumption during operation and during idle time, enabling practical low-power AI chips for real-world applications such as smartphones, IoT devices, and smart sensors.

Research Technology & Prototyping

Our research group specializes in the design and validation of high-efficiency AI hardware and next-generation memory-compute systems. We leverage industry-leading AMD (formerly Xilinx) FPGA platforms to bridge the gap between architectural theory and physical silicon realization.

Core Methodologies

Full-System Prototyping: We utilize a rigorous RTL-based design flow (Verilog/VHDL) within the Vivado™ and Vitis™ environments. This ensures cycle-accurate hardware validation and reliable performance benchmarking for complex neural accelerators.

3D-IC & Interconnect Emulation: Recognizing the challenges of iterative 3D-IC fabrication, we use advanced Programmable Network-on-Chip (NoC) features to simulate high-bandwidth vertical interconnects and through-silicon via (TSV) latency characteristics.

Advanced Memory Modeling: Our group develops custom behavioral wrappers to model the timing, energy signatures, and data-retention properties of emerging technologies, such as Non-Volatile Memory (NVM) and high-density 3D-stacked SRAM.

By using adaptive computing platforms as high-fidelity emulation vehicles, we can explore innovative architectural trade-offs—such as Memory–Compute Co-Design and Event-Driven AI—with the precision of hardware-level execution. This methodology allows us to accelerate the development cycle for low-power AI chips targeting edge devices, smartphones, and autonomous IoT sensors.

Open Source

The final program will be released in Github under MIT license in the future.

Team Members

Students

AY2026

HANYU Yuga
SHARMA Atharv
LUYEN Van-Vu
KOMATSUZAKI Aruki

Schedule

Early Prototyping (Prior AY2026)

As a early prototyping, we use HumanDATA Kintex 7 EDX-008-70T to validate the read/write of MRAM by Everspin.

Overall schedule

Meeting

AY2026

Date	Meeting	Minutes
2026-04-13	Project Kickoff	Discussed scope and timeline
2026-04-27	Bi-Weekly Sync

Contact

khanh [at] u-aizu.ac.jp

HYMEX: Energy-Efficient HYbrid MEmory based Architecture for NeXt-Generation AI Systems

Information

Research Technology & Prototyping

Core Methodologies

Team Members

Students

AY2026

Schedule

Early Prototyping (Prior AY2026)

Overall schedule

Meeting

AY2026

Contact

Funder