Rank-GRPO: Training LLM-based Conversational Recommender Systems with Reinforcement Learning Paper • 2510.20150 • Published Oct 23, 2025 • 6
SemCoT: Accelerating Chain-of-Thought Reasoning through Semantically-Aligned Implicit Tokens Paper • 2510.24940 • Published Oct 28, 2025 • 18
PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling Paper • 2602.06030 • Published 29 days ago