VoladorLuYu 's Collections LLM+Math
updated
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper
• 2403.02884
• Published
• 17
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published
• 141
Improving Small Language Models' Mathematical Reasoning via Mix Thoughts
Distillation
Paper
• 2401.11864
• Published
• 2
Common 7B Language Models Already Possess Strong Math Capabilities
Paper
• 2403.04706
• Published
• 18
Boosting of Thoughts: Trial-and-Error Problem Solving with Large
Language Models
Paper
• 2402.11140
• Published
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical
Reasoning in Large Language Models
Paper
• 2404.03887
• Published
• 1
Metacognitive Capabilities of LLMs: An Exploration in Mathematical
Problem Solving
Paper
• 2405.12205
• Published
Improve Mathematical Reasoning in Language Models by Automated Process
Supervision
Paper
• 2406.06592
• Published
• 29
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large
Language Models
Paper
• 2406.17294
• Published
• 11
DotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical Reasoning
Paper
• 2407.04078
• Published
• 21
We-Math: Does Your Large Multimodal Model Achieve Human-like
Mathematical Reasoning?
Paper
• 2407.01284
• Published
• 81
Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical
Reasoning
Paper
• 2407.00782
• Published
• 24
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced
Mathematical Reasoning
Paper
• 2409.12568
• Published
• 50
Alchemy: Amplifying Theorem-Proving Capability through Symbolic Mutation
Paper
• 2410.15748
• Published
• 13
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities
Using Only Forward Passes
Paper
• 2410.16930
• Published
• 7
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic
reasoning
Paper
• 2409.12183
• Published
• 39
Flow-DPO: Improving LLM Mathematical Reasoning through Online
Multi-Agent Learning
Paper
• 2410.22304
• Published
• 18
O1 Replication Journey: A Strategic Progress Report -- Part 1
Paper
• 2410.18982
• Published
• 3
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and
Beyond
Paper
• 2503.10460
• Published
• 30