Individual results
View docsView in-depth performance of a single language model on a single test suite.
Region-by-region surprisal
Sample item for Cleft Structure (with modifier)
The first item of the test suite is shown below for quick reference. Please visit the page for Cleft Structure (with modifier) to see the full list of items.
Item |
Condition
|
intro | subj | verb | modifier | passive | verb.1 | matrix_v |
---|---|---|---|---|---|---|---|---|
Item | Condition | intro | subj | verb | modifier | passive | verb.1 | matrix_v |
1 | np_mismatch | What | he | did | after the ingredients had been bought from the store | was | the meal | |
1 | np_match | What | he | ate | after the ingredients had been bought from the store | was | the meal | |
1 | vp_match | What | he | did | after the ingredients had been bought from the store | was | prepare | the meal |
1 | vp_mismatch | What | he | ate | after the ingredients had been bought from the store | was | prepare | the meal |
Prediction performance for TinyLSTM on Cleft Structure (with modifier)
Accuracy |
Formula
|
Description |
---|---|---|
Accuracy | Prediction | Description |
55.00% | ((607,np_mismatch/7,matrix_v)-(605,np_match/7,matrix_v))+(((606,vp_mismatch/6,verb.1)+(606,vp_mismatch/7,matrix_v))-((608,vp_match/6,verb.1)+(608,vp_match/7,matrix_v)))>0 | We expect that the Matrix Verb has lower surprisal in the NP Match condition, where we have a lexicalized verb (“ate” instead of “did”). In addition, we expect that the sum of the Verb 1 + Matrix Verb has lower surprisal in the VP Match condition, where it cannot be the object of a lexicalized verb such as “ate.” Together, the differences between these sums should be greater than zero. We add a VP modifier in this test suite. |