Title here
Summary here
A memory architecture where the CPU and GPU share the same physical RAM pool, eliminating the discrete VRAM limitation. Used in Apple Silicon and AMD Strix Halo (Ryzen AI Max). The entire 128GB is accessible to both CPU and GPU, allowing much larger models to run at GPU speeds.