Best AI papers explained

A podcast by Enoch H. Kang

550 Episoade

PLAN-AND-ACT: LLM Agent Planning with Synthetic Data
Publicat: 08.04.2025
SEARCH-R1: LLMs Learn to Reason and Search via Reinforcement Learning
Publicat: 08.04.2025
The Theory of the Firm: Information, Incentives, and Organization
Publicat: 08.04.2025
Four Formalizable Theories of the Firm
Publicat: 08.04.2025
Efficient Tool Use with Chain-of-Abstraction Reasoning
Publicat: 06.04.2025
CodeTool: Process Supervision for Enhanced LLM Tool Invocation
Publicat: 06.04.2025
Evaluating LLM Agents in Multi-Turn Conversations: A Survey
Publicat: 06.04.2025
Epistemic Alignment in User-LLM Knowledge Delivery
Publicat: 06.04.2025
MCP is (not) all you need
Publicat: 06.04.2025
AI, Human Skills, and Competitive Advantage in Chess
Publicat: 05.04.2025
Inference-Time Scaling for Generalist Reward Modeling
Publicat: 04.04.2025
Optimal Pure Exploration in Linear Bandits via Sampling
Publicat: 04.04.2025
Presidential Address: The Economist as Designer in the Innovation Process for Socially Impactful Digital Products
Publicat: 04.04.2025
Emergent Symbolic Mechanisms for Reasoning in Large Language Models
Publicat: 03.04.2025
Inference-Time Alignment: Coverage, Scaling, and Optimality
Publicat: 03.04.2025
Sharpe Ratio-Guided Active Learning for Preference Optimization
Publicat: 03.04.2025
Active Learning for Adaptive In-Context Prompt Design
Publicat: 03.04.2025
Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
Publicat: 03.04.2025
On the Biology of a Large Language Model
Publicat: 01.04.2025
Async-TB: Asynchronous Trajectory Balance for Scalable LLM RL
Publicat: 01.04.2025

26 / 28

Cut through the noise. We curate and break down the most important AI papers so you don’t have to.

Visit the podcast's native language site

550 Episoade

PLAN-AND-ACT: LLM Agent Planning with Synthetic Data

SEARCH-R1: LLMs Learn to Reason and Search via Reinforcement Learning

The Theory of the Firm: Information, Incentives, and Organization

Four Formalizable Theories of the Firm

Efficient Tool Use with Chain-of-Abstraction Reasoning

CodeTool: Process Supervision for Enhanced LLM Tool Invocation

Evaluating LLM Agents in Multi-Turn Conversations: A Survey

Epistemic Alignment in User-LLM Knowledge Delivery

MCP is (not) all you need

AI, Human Skills, and Competitive Advantage in Chess

Inference-Time Scaling for Generalist Reward Modeling

Optimal Pure Exploration in Linear Bandits via Sampling

Presidential Address: The Economist as Designer in the Innovation Process for Socially Impactful Digital Products

Emergent Symbolic Mechanisms for Reasoning in Large Language Models

Inference-Time Alignment: Coverage, Scaling, and Optimality

Sharpe Ratio-Guided Active Learning for Preference Optimization

Active Learning for Adaptive In-Context Prompt Design

Visual Chain-of-Thought Reasoning for Vision-Language-Action Models

On the Biology of a Large Language Model

Async-TB: Asynchronous Trajectory Balance for Scalable LLM RL