HUIDU.io- Find Resources, Discuss Cooperation, Join HUIDU

研究人員使用錯誤中學習的人類學習法來訓練AI模型

Share

2023-11-06

研究人員提出從錯誤中學習（Learning from Mistake）的LeMA訓練法，以提升開源LLM在數學解題等推理任務上的效能

_arXiv:2310.20689 [cs.CL]

研究人員以2種問題資料集（GSM8K及MATH）實驗LeMa方法對5個開源LLM的效果，並比較只以CoT資料集來微調LLM的效果。結果顯示，以LLaMA-2-70B為例，它在兩種方法下，在GSM8K的準確率分別為83.5%及81.4%，在MATH則分別為25.0%及23.6%。此外，他們也實驗了WizardMath及MetaMath二種專門領域LLM的準確率，在GSM8K資料集測試中，獲致84.2%及85.4% pass@1 準確率，而MATH資料集則達27.1%及26.9%，這個成績超越非執行（non-execution）開源模型在同樣任務中的表現。

此外，他們發現，在同樣資料量的訓練集下，LeMA方法也比純CoT微調來得好。此外，整合CoT資料及修正資料，微調效果更優於單一的微調結果。

研究人員已將LeMA的程式碼、模型、資料公開在GitHub上。

Popular selection

1spin4win grows its Latin American presence by partnering with Fortuna Juegos

B2B Tech Infrastructure Gains Momentum in Philippine Gaming Sector

PropellerAds Positions Itself as a Go-To Traffic Source for iGaming Advertisers Ahead of a High-Demand Season

Manila delivers: Highlights from SiGMA Asia 2026

SBC Summit Canada to Make Player Safety a Key Pillar of 2026 Agenda

New Jersey July Gambling Revenue Hits $606M, Sweeps Casinos Banned

GAT Expo CDMX 2026 Kicks Off Today in Mexico with a Sold-Out Opening Reception at Big Bola Casino Santa Fe

Online gambling, crypto pose ongoing money laundering risks in Philippines, analyst says

JILI Partners with Cricket Legend AB de Villiers (ABD) to Launch Exclusive Branded Game Series 100% 11

Vietnam’s Controlled Gaming Shift Gains Ground, But Domestic Demand Still Lags

GAT Expo Puerto Rico Will Pulse with the New Era of Gaming in the Caribbean

PropellerAds Shared a New iGaming Case Study: 97,674 Installs and 12,701 Deposits in 3 Months

Brazil Proposes Raising Gambling Tax Rate to 24%, With Revenue Allocated to Social Security and Healthcare

Indiana online casino bill stalls in House committee

Vietnam's tightening online gaming policy creates new market opportunities

Disclaimer：

Details

GROWTH DRIVEN GLOBAL PTE. LTD. 202618650K

101 THOMSON ROAD, #28-03A, UNITED SQUARE, SINGAPORE 307591

[email protected]

Copyright 2026 HuiDu

About HUIDU Contact Us Disclaimer Terms Privacy Policy Terms of Use Integrity Reporting Law Enforcement Requests