许仔阳
yang198
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel
Translation
upvoted
a
paper
about 2 months ago
FinAuditing: A Financial Taxonomy-Structured Multi-Document Benchmark
for Evaluating LLMs