2025 The Effect of State Representation on LLM Agent Behavior in Dynamic Routing Games Lyle Goodyear, Rachel Guo, and Ramesh Johari 2025 arXiv BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery Kanishk Gandhi, Michael Y. Li, Lyle Goodyear, and 4 more authors 2025 arXiv Code