SuperWriter: Reflection-Driven Long-Form Generation with Large Language
Models
Paper
• 2506.04180
• Published
• 34
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven
Clip Generation
Paper
• 2506.10540
• Published
• 37
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper
• 2506.10974
• Published
• 19
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced
Academic Search
Paper
• 2507.15245
• Published
• 11
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper
• 2507.15846
• Published
• 133
ScreenCoder: Advancing Visual-to-Code Generation for Front-End
Automation via Modular Multimodal Agents
Paper
• 2507.22827
• Published
• 100
Phi-Ground Tech Report: Advancing Perception in GUI Grounding
Paper
• 2507.23779
• Published
• 45
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution
Paper
• 2507.23348
• Published
• 12
agentica-org/DeepSWE-Preview
Text Generation
• Updated
• 437
• • 192
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust
GAIA Problem Solving
Paper
• 2508.09889
• Published
• 32
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with
Long-Term Memory
Paper
• 2508.09736
• Published
• 58
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm
Bridging Foundation Models and Lifelong Agentic Systems
Paper
• 2508.07407
• Published
• 98
Efficient Agents: Building Effective Agents While Reducing Cost
Paper
• 2508.02694
• Published
• 86
SSRL: Self-Search Reinforcement Learning
Paper
• 2508.10874
• Published
• 97
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent
Distillation and Agentic RL
Paper
• 2508.13167
• Published
• 129
Mobile-Agent-v3: Foundamental Agents for GUI Automation
Paper
• 2508.15144
• Published
• 64
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper
• 2508.16153
• Published
• 160
AgentScope 1.0: A Developer-Centric Framework for Building Agentic
Applications
Paper
• 2508.16279
• Published
• 53
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer
Use Agent with Decoupled Reinforcement Learning
Paper
• 2508.20096
• Published
• 37
rStar2-Agent: Agentic Reasoning Technical Report
Paper
• 2508.20722
• Published
• 117
AWorld: Orchestrating the Training Recipe for Agentic AI
Paper
• 2508.20404
• Published
• 38
UItron: Foundational GUI Agent with Advanced Perception and Planning
Paper
• 2508.21767
• Published
• 12
GTA1: GUI Test-time Scaling Agent
Paper
• 2507.05791
• Published
• 27
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn
Reinforcement Learning
Paper
• 2509.02544
• Published
• 125
Morae: Proactively Pausing UI Agents for User Choices
Paper
• 2508.21456
• Published
• 5
DeepResearch Arena: The First Exam of LLMs' Research Abilities via
Seminar-Grounded Tasks
Paper
• 2509.01396
• Published
• 58
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI
Agents
Paper
• 2509.06917
• Published
• 43
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM
Step-Provers
Paper
• 2509.06493
• Published
• 12
F1: A Vision-Language-Action Model Bridging Understanding and Generation
to Actions
Paper
• 2509.06951
• Published
• 32
EnvX: Agentize Everything with Agentic AI
Paper
• 2509.08088
• Published
• 8
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making
through Multi-Turn Reinforcement Learning
Paper
• 2509.08755
• Published
• 57
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
Text Generation
• 31B • Updated
• 41.5k
• 801
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon
Agents
Paper
• 2509.13309
• Published
• 67
Paper
• 2509.10147
• Published
• 27
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading
Paper
• 2509.09995
• Published
• 16
Image-Text-to-Text
• 8B • Updated
• 251
• 12
VoiceAssistant-Eval: Benchmarking AI Assistants across Listening,
Speaking, and Viewing
Paper
• 2509.22651
• Published
• 23
ACON: Optimizing Context Compression for Long-horizon LLM Agents
Paper
• 2510.00615
• Published
• 34
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world
Markets?
Paper
• 2510.02209
• Published
• 56
CoDA: Agentic Systems for Collaborative Data Visualization
Paper
• 2510.03194
• Published
• 30
Agent Learning via Early Experience
Paper
• 2510.08558
• Published
• 273
Training-Free Group Relative Policy Optimization
Paper
• 2510.08191
• Published
• 45
CoDA: Coding LM via Diffusion Adaptation
Paper
• 2510.03270
• Published
• 43
In-the-Flow Agentic System Optimization for Effective Planning and Tool
Use
Paper
• 2510.05592
• Published
• 107
Don't Just Fine-tune the Agent, Tune the Environment
Paper
• 2510.10197
• Published
• 30
Demystifying Reinforcement Learning in Agentic Reasoning
Paper
• 2510.11701
• Published
• 32
Agentic Entropy-Balanced Policy Optimization
Paper
• 2510.14545
• Published
• 106
PokeeAI/pokee_research_7b
Text Generation
• 8B • Updated
• 350
• 100
Text Generation
• Updated
• 512k
• • 1.49k
moonshotai/Kimi-Linear-48B-A3B-Instruct
Text Generation
• 49B • Updated
• 27.9k
• 543
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration
Paper
• 2510.27266
• Published
• 21
IterResearch: Rethinking Long-Horizon Agents via Markovian State
Reconstruction
Paper
• 2511.07327
• Published
• 78
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery
Paper
• 2511.11257
• Published
• 25
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
Paper
• 2510.08529
• Published
• 19
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
Paper
• 2511.11373
• Published
• 14
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation
Paper
• 2511.08195
• Published
• 34
cerebras/MiniMax-M2-REAP-162B-A10B
Text Generation
• Updated
• 71
• 77
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
Paper
• 2511.16043
• Published
• 109
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity
Paper
• 2511.15593
• Published
• 58
Tongyi DeepResearch Technical Report
Paper
• 2510.24701
• Published
• 101
AgentFold: Long-Horizon Web Agents with Proactive Context Management
Paper
• 2510.24699
• Published
• 71
Search Self-play: Pushing the Frontier of Agent Capability without
Supervision
Paper
• 2510.18821
• Published
• 18
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO
Paper
• 2511.13288
• Published
• 19
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs
Paper
• 2511.20468
• Published
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
Paper
• 2511.02303
• Published
• 1
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn,
Multi-Task Framework
Paper
• 2510.04206
• Published
• 3
MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games
Paper
• 2510.15414
• Published
• 1
Multi-Agent Tool-Integrated Policy Optimization
Paper
• 2510.04678
• Published
• 31
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory
Paper
• 2511.21678
• Published
• 12
Latent Collaboration in Multi-Agent Systems
Paper
• 2511.20639
• Published
• 121
Guided Self-Evolving LLMs with Minimal Human Supervision
Paper
• 2512.02472
• Published
• 55
open-thoughts/OpenThinker-Agent-v1
Text Generation
• Updated
• 421
• 94
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing
Paper
• 2512.02589
• Published
• 72
DeepCode: Open Agentic Coding
Paper
• 2512.07921
• Published
• 33
nvidia/Nemotron-Orchestrator-8B
Text Generation
• Updated
• 14.7k
• 555
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment
Paper
• 2512.12692
• Published
• 14
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning
Paper
• 2512.14442
• Published
• 11
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize
Memories via Reinforcement Learning
Paper
• 2508.19828
• Published
• 8
Step-DeepResearch Technical Report
Paper
• 2512.20491
• Published
• 86
Paper
• 2512.16301
• Published
• 106
Nested Browser-Use Learning for Agentic Information Seeking
Paper
• 2512.23647
• Published
• 19
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
Paper
• 2512.24873
• Published
• 105
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing
Paper
• 2512.23611
• Published
• 3
Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling
Paper
• 2511.05951
• Published
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent
Paper
• 2512.20745
• Published
Can We Predict Before Executing Machine Learning Agents?
Paper
• 2601.05930
• Published
• 27
SmartSearch: Process Reward-Guided Query Refinement for Search Agents
Paper
• 2601.04888
• Published
• 10
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
Paper
• 2601.06021
• Published
• 47
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search
Paper
• 2601.04767
• Published
• 28
DocDancer: Towards Agentic Document-Grounded Information Seeking
Paper
• 2601.05163
• Published
• 5
Agentic Rubrics as Contextual Verifiers for SWE Agents
Paper
• 2601.04171
• Published
• 12
Agentic Reasoning for Large Language Models
Paper
• 2601.12538
• Published
• 197
LLM-in-Sandbox Elicits General Agentic Intelligence
Paper
• 2601.16206
• Published
• 84
Behavior Knowledge Merge in Reinforced Agentic Models
Paper
• 2601.13572
• Published
• 24
Multi-agent cooperation through in-context co-player inference
Paper
• 2602.16301
• Published
• 22
Towards a Science of AI Agent Reliability
Paper
• 2602.16666
• Published
• 12