Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
peterlee6706 's Collections
WeekDaily

WeekDaily

updated Aug 31
Upvote
-

  • rStar2-Agent: Agentic Reasoning Technical Report

    Paper • 2508.20722 • Published Aug 28 • 113

  • AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

    Paper • 2508.16153 • Published Aug 22 • 154

  • Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

    Paper • 2508.14029 • Published Aug 19 • 118

  • InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

    Paper • 2508.18265 • Published Aug 25 • 201

  • AWorld: Orchestrating the Training Recipe for Agentic AI

    Paper • 2508.20404 • Published Aug 28 • 38

  • Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling

    Paper • 2508.16745 • Published Aug 22 • 28

  • Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability in Knowledge and Safety with DuET-PD

    Paper • 2508.17450 • Published Aug 24 • 9

  • Jailbreaking Commercial Black-Box LLMs with Explicitly Harmful Prompts

    Paper • 2508.10390 • Published Aug 14 • 1

  • InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles

    Paper • 2508.16072 • Published Aug 22 • 4
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs