- In the first two weeks since its launch, the developers have learned that LLMs cannot do math but are more reliable when they output the intermediary steps to solving a problem, and that the GPT-4 can be used to "oppress" ChatGPT in a game.
- The developers plan to add more challenging levels to the game and make it more representative of real-world conditions.
















