这篇视频主要介绍了:Wang S, Zhang G, Zhang L L, et al. Loongrl: Reinforcement learning for advanced reasoning over long contexts[J]. arXiv preprint arXiv:2510.19363, 2025. 这一长上下文LLM推理领域的工作 slides:https://gamma.app/docs/LoongRL-efb44m2qoum4m28 代码:https://github.com
www.bilibili.com