Predictive Human Preference: From Model Ranking to Model Routing

Sampling for Text Generation What I learned from looking at 900 mo...

Predictive Human Preference: From Model Ranking to Model Routing

Table of contents

Ranking Models Using Human Preference
��. How Preferential Ranking Works
��. Correctness of Chatbot Arena Ranking
��.. Eval data
��.. Results
Predicting Human Preference For Each Prompt
��. Experiment setup
��. Experiment results
��.. Domain-specific and query-specific leaderboards
Conclusion

Human preference has emerged to be both the Northstar and a powerful tool for AI model development. Human preference guides post-training techniques including RLHF an...

View more on Chip Huyen's website »

Like • 0 comments • flag

Published on February 27, 2024 16:00

No comments have been added yet.

Chip Huyen's Blog

Chip Huyen's profile
4065 followers