← Back to changelog
October 11, 2024
Langfuse LLM-as-a-judge now supports any (tool-calling) LLM
Hassieb Pakzad
Tool calling makes Langfuse Evals reliable. Previously, only OpenAI models were supported. With this update, you can use any tool-calling LLM when setting up an LLM-as-a-judge evaluator.
Prior to creating an evaluator, you can now select any model that supports tool calls for which you have an LLM API key in Langfuse. On evaluator creation, Langfuse will test the model with a sample run to ensure it works as expected.
Learn more