arxiv.org/abs/2402.03300

Preview meta tags from the arxiv.org website.

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

https://arxiv.org/abs/2402.03300

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Mathematical reasoning poses a significant challenge for language models due to its complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which continues pre-training DeepSeek-Coder-Base-v1.5 7B with 120B math-related tokens sourced from Common Crawl, together with natural language and code data. DeepSeekMath 7B has achieved an impressive score of 51.7% on the competition-level MATH benchmark without relying on external toolkits and voting techniques, approaching the performance level of Gemini-Ultra and GPT-4. Self-consistency over 64 samples from DeepSeekMath 7B achieves 60.9% on MATH. The mathematical reasoning capability of DeepSeekMath is attributed to two key factors: First, we harness the significant potential of publicly available web data through a meticulously engineered data selection pipeline. Second, we introduce Group Relative Policy Optimization (GRPO), a variant of Proximal Policy Optimization (PPO), that enhances mathematical reasoning abilities while concurrently optimizing the memory usage of PPO.

Bing

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

https://arxiv.org/abs/2402.03300

DuckDuckGo

https://arxiv.org/abs/2402.03300

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

General Meta Tags
25
- title
  [2402.03300] DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
- title
  open search
- title
  open navigation menu
- title
  contact arXiv
- title
  subscribe to arXiv mailings
Open Graph Meta Tags
10
- og:type
  website
- og:site_name
  arXiv.org
- og:title
  DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
- og:url
  https://arxiv.org/abs/2402.03300v3
- og:image
  /static/browse/0.3.4/images/arxiv-logo-fb.png
Twitter Meta Tags
6
- twitter:site
  @arxiv
- twitter:card
  summary
- twitter:title
  DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open...
- twitter:description
  Mathematical reasoning poses a significant challenge for language models due to its complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which continues pre-training...
- twitter:image
  https://static.arxiv.org/icons/twitter/arxiv-logo-twitter-square.png
Link Tags
12
- apple-touch-icon
  /static/browse/0.3.4/images/icons/apple-touch-icon.png
- canonical
  /abs/2402.03300
- icon
  /static/browse/0.3.4/images/icons/favicon-32x32.png
- icon
  /static/browse/0.3.4/images/icons/favicon-16x16.png
- manifest
  /static/browse/0.3.4/images/icons/site.webmanifest

arxiv.org/abs/2402.03300

Linked Hostnames

Thumbnail

Search Engine Appearance

Google

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Bing

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DuckDuckGo

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

General Meta Tags

Open Graph Meta Tags

Twitter Meta Tags

Link Tags

Links