ChatML Inference