DeepSeek R1's recipe to replicate o1 and the future of reasoning LMs

DeepSeek V3 and the cost of frontier AI models

A look at Apple's technical approach to AI including core model performance etc.

Q* Hypothesis: Enhancing Reasoning, Rewards, and Synthetic Data

The AI research job market

Llama 2: an open-source LLM

How RLHF Works