Yihua Zhang
  • About
  • Blogs
  • CV
  • Publications
  • Photos

Post_deepseek_r1

January 20, 2025

2025

:tada: My new technical post From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning (千呼万唤始出来:DeepSeek-R1 如何通过强化学习实现复杂推理) is now online! English and Chinese versions both available!

© Copyright 2026 Yihua Zhang. Powered by Jekyll. Last updated: January 26, 2026.