Picture for Benjamin Van Roy

Benjamin Van Roy

Stanford University Department of Electrical Engineering

Granular feedback merits sophisticated aggregation

Add code
Jul 16, 2025
Viaarxiv icon

Choice between Partial Trajectories

Add code
Oct 30, 2024
Figure 1 for Choice between Partial Trajectories
Figure 2 for Choice between Partial Trajectories
Figure 3 for Choice between Partial Trajectories
Figure 4 for Choice between Partial Trajectories
Viaarxiv icon

Aligning AI Agents via Information-Directed Sampling

Add code
Oct 18, 2024
Figure 1 for Aligning AI Agents via Information-Directed Sampling
Figure 2 for Aligning AI Agents via Information-Directed Sampling
Figure 3 for Aligning AI Agents via Information-Directed Sampling
Figure 4 for Aligning AI Agents via Information-Directed Sampling
Viaarxiv icon

The Need for a Big World Simulator: A Scientific Challenge for Continual Learning

Add code
Aug 06, 2024
Figure 1 for The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Figure 2 for The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Figure 3 for The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Figure 4 for The Need for a Big World Simulator: A Scientific Challenge for Continual Learning
Viaarxiv icon

Information-Theoretic Foundations for Machine Learning

Add code
Jul 18, 2024
Figure 1 for Information-Theoretic Foundations for Machine Learning
Figure 2 for Information-Theoretic Foundations for Machine Learning
Figure 3 for Information-Theoretic Foundations for Machine Learning
Figure 4 for Information-Theoretic Foundations for Machine Learning
Viaarxiv icon

Exploration Unbound

Add code
Jul 16, 2024
Figure 1 for Exploration Unbound
Viaarxiv icon

Satisficing Exploration for Deep Reinforcement Learning

Add code
Jul 16, 2024
Figure 1 for Satisficing Exploration for Deep Reinforcement Learning
Figure 2 for Satisficing Exploration for Deep Reinforcement Learning
Figure 3 for Satisficing Exploration for Deep Reinforcement Learning
Figure 4 for Satisficing Exploration for Deep Reinforcement Learning
Viaarxiv icon

Information-Theoretic Foundations for Neural Scaling Laws

Add code
Jun 28, 2024
Figure 1 for Information-Theoretic Foundations for Neural Scaling Laws
Viaarxiv icon

Adaptive Crowdsourcing Via Self-Supervised Learning

Add code
Feb 02, 2024
Viaarxiv icon

Efficient Exploration for LLMs

Add code
Feb 01, 2024
Figure 1 for Efficient Exploration for LLMs
Figure 2 for Efficient Exploration for LLMs
Figure 3 for Efficient Exploration for LLMs
Figure 4 for Efficient Exploration for LLMs
Viaarxiv icon