Individual results

View docs

View in-depth performance of a single language model on a single test suite.

Region-by-region surprisal
Sample item for Reflexive Number Agreement (masculine; with prepositional phrase)
Item
Condition
intro np_subject prep the prep_np matrix_v reflexive
Item Condition intro np_subject prep the prep_np matrix_v reflexive
1 match_sing The author next to the senators hurt himself
1 mismatch_sing The author next to the senators hurt themselves
1 match_plural The authors next to the senator hurt themselves
1 mismatch_plural The authors next to the senator hurt himself
Prediction performance for Transformer XL on Reflexive Number Agreement (masculine; with prepositional phrase)
Accuracy
Formula
Description
AccuracyPredictionDescription
73.68% (550,match_sing/7,reflexive) < (552,mismatch_sing/7,reflexive) No description provided.
89.47% (551,match_plural/7,reflexive) < (549,mismatch_plural/7,reflexive) No description provided.