Latest — Apr 29, 2025 Not for private gain This week: Reinforcement Learning with Verifiable Rewards, naming eras, model welfare, investors welfare
Who controls what these institutions do? This week: The 2025 AI Index Report, Understanding of trade, What failure looks like
When the model resorts to unfaithful reasoning This week: Biology of a LLM, entrepreneurs as theorists, a grammar of hypotheses
If we tell them to maximize profit, they will This week: Economic agents, cost of complexity, complexity comes and goes
A superhuman ability to bullshit This week: Bullshit, a comprehensive survey, labour market disruption, and possibility spaces
Verification, backtracking, subgoal setting This week: State anxiety, SimpleQA, self-improving reasoners (verification, backtracking, subgoal setting and something else…)
Feedback loops with the outside world This week: Catastrophic risks, external validity, trust, machine assisted proofs