
LLMs
文章平均质量分 95
Large Language Models
ShadyPi
写写博客当笔记
展开
-
[Microsoft] A Comparative Study of DSL Code Generation
Natural Language to Domain Specific Language原创 2025-04-08 04:57:44 · 809 阅读 · 0 评论 -
Proximal Policy Optimization (PPO) in LLM Training
PPO in LLM Training原创 2025-01-28 15:16:22 · 1031 阅读 · 0 评论 -
[OpenAI Codex] Evaluating Large Language Models Trained on Code
Codex from OpenAI原创 2025-01-21 12:17:38 · 1076 阅读 · 0 评论 -
[DeepMind AlphaCode] Competition-Level Code Generation with AlphaCode
DeepMind AlphaCode原创 2025-01-22 14:23:44 · 602 阅读 · 0 评论 -
[幻方 DeepSeek-R1] DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1原创 2025-01-30 05:08:40 · 903 阅读 · 0 评论 -
[GRPO] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
DeepSeekMath: GRPO原创 2025-01-28 15:20:43 · 946 阅读 · 1 评论