A collection of evaluation benchmarks for the Italian language.
Simone Conia
·
AI & ML interests
Natural Language Processing, Multilinguality, Knowledge Graphs, Semantics, Large Language Models
Recent Activity
updated a model 8 days ago
principled-intelligence/scope-guard-4B-q-2601 updated a model 8 days ago
principled-intelligence/scope-guard-4B-g-2601 authored a paper 29 days ago
ReTraceQA: Evaluating Reasoning Traces of Small Language Models in Commonsense Question Answering