Best AI papers explained
A podcast by Enoch H. Kang
512 Episoade
-
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models
Publicat: 11.10.2025 -
Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs
Publicat: 09.10.2025 -
Learning dynamics of LLM finetuning
Publicat: 09.10.2025 -
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
Publicat: 09.10.2025 -
OpenAI Agent Builder and n8n: Orchestrating Reasoning Versus Automating Process
Publicat: 08.10.2025 -
Training Agents Inside of Scalable World Models
Publicat: 08.10.2025 -
Small Language Models are the Future of Agentic AI
Publicat: 07.10.2025 -
Activation Steering in Generative Settings via Contrastive Causal Mediation Analysis
Publicat: 06.10.2025 -
Eliciting Secret Knowledge from Language Models
Publicat: 06.10.2025 -
Temporal difference flow
Publicat: 06.10.2025 -
Personalized reasoning: just-in-time personalization and why LLMs fail at it
Publicat: 05.10.2025 -
Prompt Curriculum Learning for Efficient LLM Post-Training
Publicat: 05.10.2025 -
Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning
Publicat: 04.10.2025 -
Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward
Publicat: 04.10.2025 -
Learning to summarize user information for personalized reinforcement learning from human feedback
Publicat: 04.10.2025 -
Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF
Publicat: 03.10.2025 -
LIMI: Less is More for Agency
Publicat: 01.10.2025 -
LoRA Without Regret
Publicat: 01.10.2025 -
Actor-Critic without Actor: Critic-Guided Denoising for RL
Publicat: 29.09.2025 -
DELTA-Code: How Does RL Unlock and Transfer New Programming Algorithms in LLMs?
Publicat: 29.09.2025
Cut through the noise. We curate and break down the most important AI papers so you don’t have to.
