Picture for Simon Guo

Simon Guo

Kevin: Multi-Turn RL for Generating CUDA Kernels

Add code
Jul 16, 2025
Viaarxiv icon

Think, Prune, Train, Improve: Scaling Reasoning without Scaling Models

Add code
Apr 25, 2025
Viaarxiv icon

BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

Add code
Aug 15, 2024
Figure 1 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 2 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 3 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Figure 4 for BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts
Viaarxiv icon