Predictive Human Preference: From Model Ranking to Model Routing

Table of contents

Ranking Models Using Human Preference
���. How Preferential Ranking Works
���. Correctness of Chatbot Arena Ranking
������.. Eval data
������.. Results
Predicting Human Preference For Each Prompt
���. Experiment setup
���. Experiment results
������.. Domain-specific and query-specific leaderboards
Conclusion


Human preference has emerged to be both the Northstar and a powerful tool for AI model development. Human preference guides post-training techniques including RLHF an...

 •  0 comments  •  flag
Share on Twitter
Published on February 27, 2024 16:00
No comments have been added yet.