about
publications
cv

Announcement_6

Created in June 24, 2025

2025

New! New preprint out on SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning.

© Copyright 2026 Simon Yu. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash. Last updated: January 26, 2026.