How a big shift in training LLMs led to a capability explosion Ars Technica 2025-07-07 11:00 Source Original site Reinforcement learning, explained with a minimum of math and jargon.