Abstract: Tiny AI-edge devices use nvCIM for power-off weight storage and active-mode computation, enabling high energy efficiency (EF) and low power-on latency. While tiny Transformer models offer ...