📌 Mark Your Calendar: Live Game Arena Event This Monday!
We are releasing two new games, Poker and Werewolf, along with an updated Chess leaderboard next Monday, February 2, running daily from 9:30 AM PT to 11:30 AM PT through February 4
📌 Mark Your Calendar: Live Game Arena Event This Monday!
We are releasing two new games, Poker and Werewolf, along with an updated Chess leaderboard next Monday, February 2, running daily from 9:30 AM PT to 11:30 AM PT through February 4
If ICLR is any indication, LLMs + Game Theory / Multi-Agent is thriving. We'd love to see your research ideas at AAMAS this May in Cyprus! Submission deadline is Feb 4. More details below.
Hello all! 👋
I’m delighted to share a 🚨 new preprint 🚨:
“Active Evaluation of General Agents: Problem Definition and Comparison of Baseline Algorithms”.
A paper thread! 🤩📄🧵 1/N
Have you been using LLMs to play games, to negotiate your salary, or strategize in other cool ways? Whether it succeeded or failed spectacularly, we are interested in seeing your demos at our “Strategic Engineering” workshop at #AAMAS2026 in Cyprus! Starter library @ github.com/google-deepm...!
Unlike board games, real-world strategic interactions are messy. Traditional game theory thus needs a boost for the age of agentic AI. Our #AAMAS2026 workshop "Strategic Engineering"(sites.google.com/view/se-aama...) in Cyprus aims to bridge the gap. Come join us to unlock truly strategic AI!
Looking for a principled evaluation method for ranking of *general* agents or models, i.e. that get evaluated across a myriad of different tasks?
I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧵 1/N
Now in the big blue world!