News
SCROLLS measures a model's performance on NLP tasks such as natural language understanding (NLU), question-answering, and summarization, evaluated on seven different datasets containing text ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results