Stepwise Self-Consistent Mathematical Reasoning with Large Language Models

828 篇文章

已下架不支持订阅

18 篇文章
本文提出了一种名为SSC-CoT的算法,用于解决大型语言模型在多步骤数学推理中的挑战。通过中间步骤的交集选择和知识图谱查询,SSC-CoT提高了复杂三角问题解决的有效性,并在TriMaster100数据集和MATH level 5数据集上展现出优越性能。

本文是LLM系列文章,针对《Stepwise Self-Consistent Mathematical Reasoning with Large Language Models》的翻译。

摘要

使用大型语言模型进行复杂的数学推理是困难的,主要是由于多步骤推理的复杂性。这一过程的主要挑战包括(1)选择关键的中间结果来推进程序,以及(2)对潜在解决方案的有限探索。为了解决这些问题,我们引入了一种新的算法,即逐步自洽思想链(SSC-CoT)。SSCCoT采用了一种基于各种推理链的交集来选择中间步骤的策略。此外,SSC-CoT使模型能够通过查询包括相关领域知识的知识图谱来发现关键的中间步骤。为了验证SSC CoT,我们提出了一个新的数据集TriMaster100,专门用于复杂的三角问题。该数据集包含100个问题,每个解决方案都被分解为评分的中间步骤,有助于对数学推理过程进行全面评估。在TriMaster100上,SSCCoT将最先进方法的有效性提高了三倍。此外,我们在广泛认可的复杂数学问题数据集MATH level 5上对SSCCoT进行了基准测试,其准确率超过了第二好的方法7.2%。代码和TriMaster100数据集可在以下位置找到:https://github.com/zhao-zilong/ssc-cot

1 引言

2 相关工作

3 TriMaster100数据集

4 循序渐进的自洽思维链

5 实验

6 结论

在这

已下架不支持订阅

one@raspberrypi:~ $ v4l2-ctl -d /dev/video0 --list-formats-ext ioctl: VIDIOC_ENUM_FMT Type: Video Capture [0]: 'YUYV' (YUYV 4:2:2) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [1]: 'UYVY' (UYVY 4:2:2) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [2]: 'YVYU' (YVYU 4:2:2) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [3]: 'VYUY' (VYUY 4:2:2) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [4]: 'RGBP' (16-bit RGB 5-6-5) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [5]: 'RGBR' (16-bit RGB 5-6-5 BE) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [6]: 'RGBO' (16-bit A/XRGB 1-5-5-5) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [7]: 'RGBQ' (16-bit A/XRGB 1-5-5-5 BE) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [8]: 'RGB3' (24-bit RGB 8-8-8) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [9]: 'BGR3' (24-bit BGR 8-8-8) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [10]: 'RGB4' (32-bit A/XRGB 8-8-8-8) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [11]: 'BA81' (8-bit Bayer BGBG/GRGR) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [12]: 'GBRG' (8-bit Bayer GBGB/RGRG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [13]: 'GRBG' (8-bit Bayer GRGR/BGBG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [14]: 'RGGB' (8-bit Bayer RGRG/GBGB) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [15]: 'pBAA' (10-bit Bayer BGBG/GRGR Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [16]: 'BG10' (10-bit Bayer BGBG/GRGR) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [17]: 'pGAA' (10-bit Bayer GBGB/RGRG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [18]: 'GB10' (10-bit Bayer GBGB/RGRG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [19]: 'pgAA' (10-bit Bayer GRGR/BGBG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [20]: 'BA10' (10-bit Bayer GRGR/BGBG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [21]: 'pRAA' (10-bit Bayer RGRG/GBGB Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [22]: 'RG10' (10-bit Bayer RGRG/GBGB) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [23]: 'pBCC' (12-bit Bayer BGBG/GRGR Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [24]: 'BG12' (12-bit Bayer BGBG/GRGR) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [25]: 'pGCC' (12-bit Bayer GBGB/RGRG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [26]: 'GB12' (12-bit Bayer GBGB/RGRG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [27]: 'pgCC' (12-bit Bayer GRGR/BGBG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [28]: 'BA12' (12-bit Bayer GRGR/BGBG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [29]: 'pRCC' (12-bit Bayer RGRG/GBGB Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [30]: 'RG12' (12-bit Bayer RGRG/GBGB) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [31]: 'pBEE' (14-bit Bayer BGBG/GRGR Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [32]: 'BG14' (14-bit Bayer BGBG/GRGR) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [33]: 'pGEE' (14-bit Bayer GBGB/RGRG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [34]: 'GB14' (14-bit Bayer GBGB/RGRG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [35]: 'pgEE' (14-bit Bayer GRGR/BGBG Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [36]: 'GR14' (14-bit Bayer GRGR/BGBG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [37]: 'pREE' (14-bit Bayer RGRG/GBGB Packed) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [38]: 'RG14' (14-bit Bayer RGRG/GBGB) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [39]: 'BYR2' (16-bit Bayer BGBG/GRGR) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [40]: 'GB16' (16-bit Bayer GBGB/RGRG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [41]: 'GR16' (16-bit Bayer GRGR/BGBG) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [42]: 'RG16' (16-bit Bayer RGRG/GBGB) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [43]: 'GREY' (8-bit Greyscale) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [44]: 'Y10P' (10-bit Greyscale (MIPI Packed)) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [45]: 'Y10 ' (10-bit Greyscale) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [46]: 'Y12P' (12-bit Greyscale (MIPI Packed)) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [47]: 'Y12 ' (12-bit Greyscale) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [48]: 'Y14P' (14-bit Greyscale (MIPI Packed)) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [49]: 'Y14 ' (14-bit Greyscale) Size: Stepwise 16x16 - 16376x16376 with step 1/1 [50]: 'Y16 ' (16-bit Greyscale) Size: Stepwise 16x16 - 16376x16376 with step 1/1
09-25
评论
成就一亿技术人!
拼手气红包6.0元
还能输入1000个字符
 
红包 添加红包
表情包 插入表情
 条评论被折叠 查看
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包

打赏作者

UnknownBody

你的鼓励将是我创作的最大动力

¥1 ¥2 ¥4 ¥6 ¥10 ¥20
扫码支付:¥1
获取中
扫码支付

您的余额不足,请更换扫码支付或充值

打赏作者

实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值