Raisonnement

Sign in to follow this category

Reasoning models and test-time compute

Why spend compute at inference: CoT, o1/o3, DeepSeek-R1 (GRPO, the "aha moment"), best-of-N, PRMs, search, scaling laws, and the limits.

2026-06-19 18 min read