LLM Inference in Production by BentoML; “core concepts and performance metrics, to optimization techniques and operation best practices”