Subscribe
Sign in
Share this post
arXiv Daily
2024년 4월 8일
Copy link
Facebook
Email
Notes
More
2024년 4월 8일
Kim Seonghyeon
Apr 8, 2024
Share this post
arXiv Daily
2024년 4월 8일
Copy link
Facebook
Email
Notes
More
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Read →
Comments
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts
Share this post
2024년 4월 8일
Share this post
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences