Skip to content
MEVZU N°124ISTANBUL
Glossary · Advanced · 2023

DPO — Direct Preference Optimization

An RLHF alternative that directly optimises a model on preference data, skipping the explicit RL loop.

EN — English term
DPO (Direct Preference Optimization)
TR — Turkish term
DPO — Doğrudan Tercih Optimizasyonu

External Links