Skip to content
MEVZU N°124ISTANBUL
Glossary · Advanced · 2022

RLAIF — RL from AI Feedback

An alignment approach that uses another LLM, instead of human labellers, as the source of preference signals.

EN — English term
RLAIF (RL from AI Feedback)
TR — Turkish term
RLAIF — AI Geri Bildirimiyle Pekiştirmeli Öğrenme