From Google to OpenAI, everyone keeps talking about AI inference and how it is a critical part of the infrastructure. Google even released the new Ironwood TPU and called it the chip for the age of inference.