
Microsoft AI Introduces rStar2-Agent: A 14B Math Reasoning Model Trained with Agentic Reinforcement Learning
Microsoft unveils rStar2-Agent, a 14B math reasoning model trained with agentic reinforcement learning, outperforming larger LLMs like DeepSeek-R1 and Phi-4 while running on just 64 MI300X GPUs.