new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jul 24

Submitted by

lz1001

AREX: Towards a Recursively Self-Improving Agent for Deep Research

BAAI

Beijing Academy of Artificial Intelligence

Submitted by

tianlezeng

ReferTrack: Referring Then Tracking for Embodied Visual Tracking

tencent

Submitted by

lhpku20010120

K12-KGraph: A Curriculum-Aligned Knowledge Graph for Benchmarking and Training Educational LLMs

PekingUniversity

Peking University

Submitted by

joliang17

Visual Contrastive Self-Distillation

UMCP

University of Maryland College Park

Submitted by

zwq2018

Show, Don't Tell: Evaluating Spatial Cognition in Generative Pixels Rather Than LLM Text

OmniAI-ZJU

Submitted by

taesiri

NVIDIA-labs OO Agents: Native Python Object-Oriented Agents

nvidia

Submitted by

taesiri

Tencent WorkBuddy Bench: A Multi-Domain Coding-Agent Benchmark with Contamination-Resistant Task Construction

tencent

Submitted by

Lyricccco

Color Pass-Through via Camera-Display Coupling

MMLab-CUHK

Submitted by

Lawrence-cj

SANA-Video 2.0: Hybrid Linear Attention with Attention Residuals for Efficient Video Generation

nvidia

Submitted by

jihoontack

LLMs Get Lost in Evolving User Intent

MicrosoftResearch

Microsoft Research

Submitted by

Lukas431

Self-Supervised Learning of Structured Dynamics from Videos

FunAILab

Fundamental AI Lab at UTN

Submitted by

taesiri

Streaming Multi-Agent Autoregressive Diffusion Model with World State Registers

·
5 authors

Submitted by

gouc

Sample-Efficient Learning from Agent Experience

·
5 authors

1

Submitted by

taesiri

Robostral Navigate

mistralai

Submitted by

baohao

Multi-Turn On-Policy Distillation with Prefix Replay

MicrosoftResearch

Microsoft Research

Submitted by

FlippyDora

Predictive Divergence Masks for LLM RL

·
7 authors

Submitted by

hyeoncho01

Recurrent Sinusoidal INRs for Efficient High-Fidelity Representation

·
3 authors

Submitted by

taesiri

TableVerse: A Large-scale Tabletop Dataset with Real-world Grounded Layouts for Generalizable Manipulation

ByteDance

Submitted by

Baolin

OpenForgeRL: Train Harness-native Agents in Any Environment

microsoft

Submitted by

BuaaCXF

FinanceComplexQA: Benchmarking Agentic Reasoning on Industrial-grade Financial Documents

Beihang

Beihang University

Submitted by

isminoula

GraphVid: Interactive Graph-Controllable Video Generation

PLAN-Lab

PLAN Lab @University of Illinois Urbana-Champaign

Submitted by

thrshr

Dataset Distillation by Influence Matching

TheHKU

Hong Kong University