Evaluation tool to assess the cultural relevance of images for user-defined culture labels
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model
Beyond Understanding: Evaluating the Pragmatic Gap in LLMs' Cultural Processing of Figurative Language
-
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents
Paper • 2403.08715 • Published • 21 -
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Paper • 2310.11667 • Published • 4 -
cmu-lti/sotopia
Updated • 221 • 4 -
cmu-lti/sotopia-pi
Viewer • Updated • 33.4k • 640 • 8
Evaluation tool to assess the cultural relevance of images for user-defined culture labels
-
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents
Paper • 2403.08715 • Published • 21 -
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Paper • 2310.11667 • Published • 4 -
cmu-lti/sotopia
Updated • 221 • 4 -
cmu-lti/sotopia-pi
Viewer • Updated • 33.4k • 640 • 8
datasets
11
cmu-lti/stateful
Viewer
•
Updated
•
500
•
19
cmu-lti/caire-specific
Viewer
•
Updated
•
68
•
30
cmu-lti/interactive-swe
Viewer
•
Updated
•
500
•
103
cmu-lti/caire-universal
Viewer
•
Updated
•
400
•
19
cmu-lti/caire-index-ckpts
Updated
•
15
cmu-lti/AI-LieDar
Updated
•
32
cmu-lti/agents_vs_script
Viewer
•
Updated
•
20.3k
•
48
•
3
cmu-lti/sotopia
Updated
•
221
•
4
cmu-lti/sotopia-pi
Viewer
•
Updated
•
33.4k
•
640
•
8
cmu-lti/cobracorpus
Viewer
•
Updated
•
32.6k
•
80
•
4