PROVING TEST SET CONTAMINATION IN BLACK BOX LANGUAGE MODELS
我复现的源码:https://github.com/Whiffe/test_set_contamination
b站复现视频:https://www.bilibili.com/video/BV14d1CYWE26/
顶会论文复现:PROVING TEST SET CONTAMINATION IN BLACK BOX LANGUAGE MODELS
Oren等人交换了一些基准测试中问题的顺序,并用生成新数据的方式测试模型,作为检测数据泄露的一种方法。 来自:论文翻译:arxiv-2024 Training on the Benchmark Is Not All You Need
https://openreview.net/forum?id=KS8mIvetg2