Agent

JuanRafap 's Collections

Fondation model

RAG

World models

Bim

updated 2 days ago

Upvote

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4, 2025 • 34
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Paper • 2506.10540 • Published Jun 12, 2025 • 37
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Paper • 2506.10974 • Published Jun 12, 2025 • 19
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Paper • 2507.15245 • Published Jul 21, 2025 • 11
GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21, 2025 • 135
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30, 2025 • 101
Phi-Ground Tech Report: Advancing Perception in GUI Grounding

Paper • 2507.23779 • Published Jul 31, 2025 • 46
SWE-Debate: Competitive Multi-Agent Debate for Software Issue Resolution

Paper • 2507.23348 • Published Jul 31, 2025 • 12
agentica-org/DeepSWE-Preview

Text Generation • Updated Jul 3, 2025 • 648 • • 195
AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published Aug 13, 2025 • 32
Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13, 2025 • 58
A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 99
Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24, 2025 • 86
SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14, 2025 • 97
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129
Mobile-Agent-v3: Foundamental Agents for GUI Automation

Paper • 2508.15144 • Published Aug 21, 2025 • 65
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 162
AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22, 2025 • 61
CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27, 2025 • 37
rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 118
AWorld: Orchestrating the Training Recipe for Agentic AI

Paper • 2508.20404 • Published Aug 28, 2025 • 38
UItron: Foundational GUI Agent with Advanced Perception and Planning

Paper • 2508.21767 • Published Aug 29, 2025 • 12
GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8, 2025 • 27
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2, 2025 • 127
Morae: Proactively Pausing UI Agents for User Choices

Paper • 2508.21456 • Published Aug 29, 2025 • 5
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1, 2025 • 58
Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8, 2025 • 44
Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers

Paper • 2509.06493 • Published Sep 8, 2025 • 13
F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8, 2025 • 33
EnvX: Agentize Everything with Agentic AI

Paper • 2509.08088 • Published Sep 9, 2025 • 8
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10, 2025 • 94.4k • 811
WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Paper • 2509.13309 • Published Sep 16, 2025 • 67
Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12, 2025 • 27
QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading

Paper • 2509.09995 • Published Sep 12, 2025 • 16
hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11, 2025 • 139 • 14
VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Paper • 2509.22651 • Published Sep 26, 2025 • 23
ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1, 2025 • 35
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2, 2025 • 57
CoDA: Agentic Systems for Collaborative Data Visualization

Paper • 2510.03194 • Published Oct 3, 2025 • 30
Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 276
Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9, 2025 • 46
CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27, 2025 • 43
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7, 2025 • 110
Don't Just Fine-tune the Agent, Tune the Environment

Paper • 2510.10197 • Published Oct 11, 2025 • 30
Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published Oct 13, 2025 • 33
Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 108
PokeeAI/pokee_research_7b

Text Generation • 8B • Updated Oct 23, 2025 • 146 • • 100
MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 72.4k • • 1.49k
moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 58.4k • • 559
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Paper • 2510.27266 • Published Oct 31, 2025 • 21
IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published Nov 10, 2025 • 80
AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Paper • 2511.11257 • Published Nov 14, 2025 • 25
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

Paper • 2510.08529 • Published Oct 9, 2025 • 19
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published Nov 14, 2025 • 14
UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Paper • 2511.08195 • Published Nov 11, 2025 • 34
cerebras/MiniMax-M2-REAP-162B-A10B

Text Generation • 162B • Updated Nov 15, 2025 • 58 • 79
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110
What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published Nov 19, 2025 • 59
Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 103
AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published Oct 28, 2025 • 72
Search Self-play: Pushing the Frontier of Agent Capability without Supervision

Paper • 2510.18821 • Published Oct 21, 2025 • 19
Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Paper • 2511.13288 • Published Nov 17, 2025 • 19
DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs

Paper • 2511.20468 • Published Nov 25, 2025
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation

Paper • 2511.02303 • Published Nov 4, 2025 • 1
AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Paper • 2510.04206 • Published Oct 5, 2025 • 3
MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games

Paper • 2510.15414 • Published Oct 17, 2025 • 1
Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6, 2025 • 31
Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

Paper • 2511.21678 • Published Nov 26, 2025 • 12
Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published Nov 25, 2025 • 127
Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published Dec 2, 2025 • 55
open-thoughts/OpenThinker-Agent-v1

Text Generation • Updated Jan 27 • 1.43k • 98
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 73
DeepCode: Open Agentic Coding

Paper • 2512.07921 • Published Dec 8, 2025 • 33
nvidia/Nemotron-Orchestrator-8B

Text Generation • Updated Dec 2, 2025 • 5.52k • • 564
WebOperator: Action-Aware Tree Search for Autonomous Agents in Web Environment

Paper • 2512.12692 • Published Dec 14, 2025 • 14
A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

Paper • 2512.14442 • Published Dec 16, 2025 • 11
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Paper • 2508.19828 • Published Aug 27, 2025 • 8
Step-DeepResearch Technical Report

Paper • 2512.20491 • Published Dec 23, 2025 • 87
Adaptation of Agentic AI

Paper • 2512.16301 • Published Dec 18, 2025 • 108
Nested Browser-Use Learning for Agentic Information Seeking

Paper • 2512.23647 • Published Dec 29, 2025 • 19
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Paper • 2512.24873 • Published Dec 31, 2025 • 108
Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing

Paper • 2512.23611 • Published Dec 29, 2025 • 6
Klear-AgentForge: Forging Agentic Intelligence through Posttraining Scaling

Paper • 2511.05951 • Published Nov 8, 2025 • 1
AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Paper • 2512.20745 • Published Dec 23, 2025
Can We Predict Before Executing Machine Learning Agents?

Paper • 2601.05930 • Published Jan 9 • 28
SmartSearch: Process Reward-Guided Query Refinement for Search Agents

Paper • 2601.04888 • Published Jan 8 • 10
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 48
AT^2PO: Agentic Turn-based Policy Optimization via Tree Search

Paper • 2601.04767 • Published Jan 8 • 28
DocDancer: Towards Agentic Document-Grounded Information Seeking

Paper • 2601.05163 • Published Jan 8 • 7
Agentic Rubrics as Contextual Verifiers for SWE Agents

Paper • 2601.04171 • Published Jan 7 • 13
Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204
LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published Jan 22 • 86
Behavior Knowledge Merge in Reinforced Agentic Models

Paper • 2601.13572 • Published Jan 20 • 27
Multi-agent cooperation through in-context co-player inference

Paper • 2602.16301 • Published Feb 18 • 24
Towards a Science of AI Agent Reliability

Paper • 2602.16666 • Published Feb 18 • 16
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization

Paper • 2512.24615 • Published Dec 31, 2025 • 119
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published Feb 25 • 25
Agentic Code Reasoning

Paper • 2603.01896 • Published Mar 2 • 10
Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 194
DREAM: Deep Research Evaluation with Agentic Metrics

Paper • 2602.18940 • Published Feb 21 • 14
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents

Paper • 2603.18815 • Published Mar 19 • 14
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral Specifications

Paper • 2603.08806 • Published Mar 9 • 7
Hyperagents

Paper • 2603.19461 • Published Mar 19 • 50
Terminal Agents Suffice for Enterprise Automation

Paper • 2604.00073 • Published 28 days ago • 96
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 25 days ago • 374
ClawArena: Benchmarking AI Agents in Evolving Information Environments

Paper • 2604.04202 • Published 23 days ago • 37
FileGram: Grounding Agent Personalization in File-System Behavioral Traces

Paper • 2604.04901 • Published 22 days ago • 40
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents

Paper • 2603.27490 • Published about 1 month ago • 18
SEVerA: Verified Synthesis of Self-Evolving Agents

Paper • 2603.25111 • Published Mar 26 • 31
AgentGL: Towards Agentic Graph Learning with LLMs via Reinforcement Learning

Paper • 2604.05846 • Published 21 days ago • 10
Qualixar OS: A Universal Operating System for AI Agent Orchestration

Paper • 2604.06392 • Published 21 days ago • 16
CodeTracer: Towards Traceable Agent States

Paper • 2604.11641 • Published 15 days ago • 38
CocoaBench: Evaluating Unified Digital Agents in the Wild

Paper • 2604.11201 • Published 15 days ago • 35
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs

Paper • 2604.10480 • Published 16 days ago • 20
SWE-AGILE: A Software Agent Framework for Efficiently Managing Dynamic Reasoning Context

Paper • 2604.11716 • Published 15 days ago • 4
Toward Autonomous Long-Horizon Engineering for ML Research

Paper • 2604.13018 • Published 14 days ago • 34
OpenGame: Open Agentic Coding for Games

Paper • 2604.18394 • Published 8 days ago • 74
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 8 days ago • 80
SkillFlow:Benchmarking Lifelong Skill Discovery and Evolution for Autonomous Agents

Paper • 2604.17308 • Published 9 days ago • 22
Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration

Paper • 2604.18131 • Published 8 days ago • 9

Upvote

Collection guide
Browse collections