Hypothesis

2 Matching Annotations

May 2026
arxiv.org arxiv.org

Untitled document

1
1. fxp007 15 May 2026
  
  in Public
  
  A Few-shot algorithm can further improve performance using just tens of annotated examples.
  
  少样本算法仅需数十个标注样本即可进一步提升性能，这显著降低了数据标注成本，提高了模型的实用性和可扩展性。
  
  少样本学习数据效率性能提升
Visit annotations in context

Tags

数据效率

性能提升

少样本学习

Annotators

fxp007

URL

arxiv.org/html/2510.18318v4
huggingface.co huggingface.co

https://huggingface.co/papers/2604.20987

1
1. fxp007 01 May 2026
  
  in Public
  
  Experiments across six game environments show that COSPLAY with an 8B base model achieves over 25.1 percent average reward improvement against four frontier LLM baselines on single player game benchmarks while remaining competitive on multi player social reasoning games.
  
  在六个游戏环境中进行的实验表明，COSPLAY框架在单人游戏基准测试中，与四个前沿的LLM基线相比，平均奖励提高了25.1%，同时在多人社交推理游戏中也保持了竞争力。
  
  实验结果性能提升基准测试
Visit annotations in context

Tags

实验结果

基准测试

性能提升

Annotators

fxp007

URL

huggingface.co/papers/2604.20987