Individual results
View docsView in-depth performance of a single language model on a single test suite.
Region-by-region surprisal
Sample item for Reflexive Number Agreement (masculine; with object relative clause)
The first item of the test suite is shown below for quick reference. Please visit the page for Reflexive Number Agreement (masculine; with object relative clause) to see the full list of items.
Item |
Condition
|
intro | np_subject | that | the | embed_np | embed_vp | matrix_v | reflexive |
---|---|---|---|---|---|---|---|---|---|
Item | Condition | intro | np_subject | that | the | embed_np | embed_vp | matrix_v | reflexive |
1 | match_sing | The | author | that | the | senators | liked | hurt | himself |
1 | mismatch_sing | The | author | that | the | senators | liked | hurt | themselves |
1 | match_plural | The | authors | that | the | senator | liked | hurt | themselves |
1 | mismatch_plural | The | authors | that | the | senator | liked | hurt | himself |
Prediction performance for GPT-2 XL on Reflexive Number Agreement (masculine; with object relative clause)
Accuracy |
Formula
|
Description |
---|---|---|
Accuracy | Prediction | Description |
84.21% | (660,match_sing/8,reflexive) < (662,mismatch_sing/8,reflexive) | No description provided. |
94.74% | (661,match_plural/8,reflexive) < (659,mismatch_plural/8,reflexive) | No description provided. |