Viewing test suite Reflexive Number Agreement (feminine; with prepositional phrase)
Reference
"Marvin R. & Linzen T. (2018). Targeted syntactic evaluation of language models. "
Number of items
19
Tags
Models evaluated
88% (8/9)
Description
The noun phrase that a reflexive pronoun ("herself", "himself", "themselves") corefers with must command it in a sense similar to that relevant for negative-polarity items. In these test suites, the reflexive pronoun ending the sentence can only corefer to the subject of the sentence, with which it must agree in number: a singular subject requires a singular reflexive, and a plural subject requires a plural reflexive.
Items for reflexive_prep_fem
Item |
Condition
|
intro | np_subject | prep | the | prep_np | matrix_v | reflexive |
---|---|---|---|---|---|---|---|---|
Item | Condition | intro | np_subject | prep | the | prep_np | matrix_v | reflexive |
1 | match_sing | The | author | next to | the | senators | hurt | herself |
1 | mismatch_sing | The | author | next to | the | senators | hurt | themselves |
1 | match_plural | The | authors | next to | the | senator | hurt | themselves |
1 | mismatch_plural | The | authors | next to | the | senator | hurt | herself |
2 | match_sing | The | pilot | behind | the | teachers | injured | herself |
2 | mismatch_sing | The | pilot | behind | the | teachers | injured | themselves |
2 | match_plural | The | pilots | behind | the | teacher | injured | themselves |
2 | mismatch_plural | The | pilots | behind | the | teacher | injured | herself |
3 | match_sing | The | doctor | in front of | the | guards | trusted | herself |
3 | mismatch_sing | The | doctor | in front of | the | guards | trusted | themselves |
3 | match_plural | The | doctors | in front of | the | guard | trusted | themselves |
3 | mismatch_plural | The | doctors | in front of | the | guard | trusted | herself |
4 | match_sing | The | farmer | near | the | clerks | embarrassed | herself |
4 | mismatch_sing | The | farmer | near | the | clerks | embarrassed | themselves |
4 | match_plural | The | farmers | near | the | clerk | embarrassed | themselves |
4 | mismatch_plural | The | farmers | near | the | clerk | embarrassed | herself |
5 | match_sing | The | manager | to the side of | the | architects | disguised | herself |
5 | mismatch_sing | The | manager | to the side of | the | architects | disguised | themselves |
5 | match_plural | The | managers | to the side of | the | architect | disguised | themselves |
5 | mismatch_plural | The | managers | to the side of | the | architect | disguised | herself |
6 | match_sing | The | customer | across from | the | athletes | hated | herself |
6 | mismatch_sing | The | customer | across from | the | athletes | hated | themselves |
6 | match_plural | The | customers | across from | the | athlete | hated | themselves |
6 | mismatch_plural | The | customers | across from | the | athlete | hated | herself |
7 | match_sing | The | officer | next to | the | actors | doubted | herself |
7 | mismatch_sing | The | officer | next to | the | actors | doubted | themselves |
7 | match_plural | The | officers | next to | the | actor | doubted | themselves |
7 | mismatch_plural | The | officers | next to | the | actor | doubted | herself |
8 | match_sing | The | teacher | behind | the | ministers | hurt | herself |
8 | mismatch_sing | The | teacher | behind | the | ministers | hurt | themselves |
8 | match_plural | The | teachers | behind | the | minister | hurt | themselves |
8 | mismatch_plural | The | teachers | behind | the | minister | hurt | herself |
9 | match_sing | The | senator | in front of | the | actors | injured | herself |
9 | mismatch_sing | The | senator | in front of | the | actors | injured | themselves |
9 | match_plural | The | senators | in front of | the | actor | injured | themselves |
9 | mismatch_plural | The | senators | in front of | the | actor | injured | herself |
10 | match_sing | The | consultant | near | the | secretaries | suspected | herself |
10 | mismatch_sing | The | consultant | near | the | secretaries | suspected | themselves |
10 | match_plural | The | consultants | near | the | secretary | suspected | themselves |
10 | mismatch_plural | The | consultants | near | the | secretary | suspected | herself |
11 | match_sing | The | guard | to the side of | the | executives | embarrassed | herself |
11 | mismatch_sing | The | guard | to the side of | the | executives | embarrassed | themselves |
11 | match_plural | The | guards | to the side of | the | executive | embarrassed | themselves |
11 | mismatch_plural | The | guards | to the side of | the | executive | embarrassed | herself |
12 | match_sing | The | clerk | across from | the | authors | disguised | herself |
12 | mismatch_sing | The | clerk | across from | the | authors | disguised | themselves |
12 | match_plural | The | clerks | across from | the | author | disguised | themselves |
12 | mismatch_plural | The | clerks | across from | the | author | disguised | herself |
13 | match_sing | The | architect | next to | the | pilots | hated | herself |
13 | mismatch_sing | The | architect | next to | the | pilots | hated | themselves |
13 | match_plural | The | architects | next to | the | pilot | hated | themselves |
13 | mismatch_plural | The | architects | next to | the | pilot | hated | herself |
14 | match_sing | The | athlete | behind | the | doctors | doubted | herself |
14 | mismatch_sing | The | athlete | behind | the | doctors | doubted | themselves |
14 | match_plural | The | athletes | behind | the | doctor | doubted | themselves |
14 | mismatch_plural | The | athletes | behind | the | doctor | doubted | herself |
15 | match_sing | The | actor | in front of | the | farmers | hurt | herself |
15 | mismatch_sing | The | actor | in front of | the | farmers | hurt | themselves |
15 | match_plural | The | actors | in front of | the | farmer | hurt | themselves |
15 | mismatch_plural | The | actors | in front of | the | farmer | hurt | herself |
16 | match_sing | The | minister | near | the | managers | injured | herself |
16 | mismatch_sing | The | minister | near | the | managers | injured | themselves |
16 | match_plural | The | ministers | near | the | manager | injured | themselves |
16 | mismatch_plural | The | ministers | near | the | manager | injured | herself |
17 | match_sing | The | taxi driver | to the side of | the | customers | suspected | herself |
17 | mismatch_sing | The | taxi driver | to the side of | the | customers | suspected | themselves |
17 | match_plural | The | taxi drivers | to the side of | the | customer | suspected | themselves |
17 | mismatch_plural | The | taxi drivers | to the side of | the | customer | suspected | herself |
18 | match_sing | The | secretary | across from | the | officers | embarrassed | herself |
18 | mismatch_sing | The | secretary | across from | the | officers | embarrassed | themselves |
18 | match_plural | The | secretaries | across from | the | officer | embarrassed | themselves |
18 | mismatch_plural | The | secretaries | across from | the | officer | embarrassed | herself |
19 | match_sing | The | executive | next to | the | teachers | disguised | herself |
19 | mismatch_sing | The | executive | next to | the | teachers | disguised | themselves |
19 | match_plural | The | executives | next to | the | teacher | disguised | themselves |
19 | mismatch_plural | The | executives | next to | the | teacher | disguised | herself |
Predictions for reflexive_prep_fem
Formula
|
Description |
---|---|
Formula | Description |
(564,match_sing/7,reflexive) < (566,mismatch_sing/7,reflexive) | No description provided. |
(565,match_plural/7,reflexive) < (563,mismatch_plural/7,reflexive) | No description provided. |
Results for reflexive_prep_fem
Model | Prediction 1 accuracy | Prediction 2 accuracy | |
---|---|---|---|
Model | Prediction 1 accuracy | Prediction 2 accuracy | |
TinyLSTM | 5.26% | 100.00% | Visualize results |
GPT-2 | 21.05% | 100.00% | Visualize results |
GPT-2 XL | 57.89% | 94.74% | Visualize results |
JRNN | 0.00% | 100.00% | Visualize results |
Ordered Neurons | 0.00% | 100.00% | Visualize results |
RNNG | 10.53% | 100.00% | Visualize results |
Transformer XL | 21.05% | 100.00% | Visualize results |
Vanilla LSTM | 5.26% | 100.00% | Visualize results |