LLM performance evaluation