This is a beta release of SyntaxGym. Please send questions and comments to contact@syntaxgym.org.

Summary results

The radar plots below give a high-level overview of performance across multiple models and test suite tags.