Individual results

View docs

View in-depth performance of a single language model on a single test suite.

Region-by-region surprisal
Sample item for Subject-Verb Number Agreement (with object relative clause)
Item
Condition
intronp_subjectthattheembed_npembed_vpmatrix_vcontinuation
ItemConditionintronp_subjectthattheembed_npembed_vpmatrix_vcontinuation
1 match_sing The author that the senators hurt is good
1 mismatch_sing The author that the senators hurt are good
1 match_plural The authors that the senator hurt are good
1 mismatch_plural The authors that the senator hurt is good
Showing 1 to 4 of 4 entries
Prediction performance for TinyLSTM on Subject-Verb Number Agreement (with object relative clause)
Accuracy
Formula
Description
AccuracyPredictionDescription
10.53% match_sing.matrix_v < mismatch_sing.matrix_v No description provided.
52.63% match_plural.matrix_v < mismatch_plural.matrix_v No description provided.
Showing 1 to 2 of 2 entries