2026
论文总结FACT-AUDIT:An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models 论文总结-AGENTIF:Benchmarking Instruction Following of Large Language Models in Agentic Scenarios 论文总结-Evolving Deception:When Agents Evolve, Deception Wins 论文总结-From Self-Evolving Synthetic Data to Verifiable-Reward RL:Post-Training Multi-turn Interactive Tool-Using Agents 论文总结-Agent World Model:Infinity Synthetic Environments for Agentic Reinforcement Learning 论文总结-EnvScaler:Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis 论文总结-Simulating Environments with Reasoning Models for Agent Training 论文总结-Close the Loop:Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing 论文总结-Scaling Agent Learning via Experience Synthesis