Exploring Self-Play Beyond Competition: Language Model Learning in Negotiation Games – Austen Liao (JHU)
Abstract Game-playing agents like AlphaGo have achieved superhuman performance through self-play, which is theoretically guaranteed to yield optimal policies in competitive games. However, most language tasks are partially or fully cooperative, so it is an[…]