Inference Speed Is the Key To Unleashing AI’s Potential