Intel AMX, Cache Optimization, Large Language Model Inference, NUMA-aware Systems, Performance Analysis.