Individual results

View docs

View in-depth performance of a single language model on a single test suite.

Region-by-region surprisal
Sample item for Subordination (with subject relative clause)
Item
Condition
Subordinate clause 1 subj_modifier Subordinate clause 2 obj_modifier Main clause
Item Condition Subordinate clause 1 subj_modifier Subordinate clause 2 obj_modifier Main clause
1 sub_no-matrix As the doctor who wore a white lab jacket studied the book that described several advances in cancer therapy .
1 no-sub_no-matrix The doctor who wore a white lab jacket studied the book that described several advances in cancer therapy .
1 sub_matrix As the doctor who wore a white lab jacket studied the book that described several advances in cancer therapy , the nurse walked into the room .
1 no-sub_matrix The doctor who wore a white lab jacket studied the book that described several advances in cancer therapy , the nurse walked into the room .
Prediction performance for GPT-2 on Subordination (with subject relative clause)
Accuracy
Formula
Description
AccuracyPredictionDescription
82.61% (644,sub_no-matrix/5,Main clause) > (646,no-sub_no-matrix/5,Main clause) No description provided.
100.00% (645,sub_matrix/5,Main clause) < (643,no-sub_matrix/5,Main clause) No description provided.