MEVZU N°12808.05.2026ISTANBULYEAR I — VOL. III

MEVZU N° TAG / VOL. 093

#method

0 blog · 0 news · 2 wiki

§03

Wiki

02

Pairwise Comparison

An eval method that asks which of two models' answers to the same prompt is better.

EN: Pairwise Comparison
TR: İkili Karşılaştırma

LLM-as-Judge

An evaluation method in which an LLM is used to judge another model's output.

EN: LLM-as-Judge
TR: Yargıç Olarak LLM