3 32 11

Xin Xu

XinXuNLPer

https://xxupiano.github.io/

AI & ML interests

NLP, Music AI

Recent Activity

upvoted a paper 3 days ago

mHC: Manifold-Constrained Hyper-Connections

upvoted a paper about 1 month ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

upvoted a paper about 2 months ago

MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation

View all activity

Organizations

upvoted a paper 3 days ago

mHC: Manifold-Constrained Hyper-Connections

Paper • 2512.24880 • Published 4 days ago • 181

upvoted a paper about 1 month ago

InnoGym: Benchmarking the Innovation Potential of AI Agents

Paper • 2512.01822 • Published Dec 1, 2025 • 35

upvoted a paper about 2 months ago

MIDI-LLM: Adapting Large Language Models for Text-to-MIDI Music Generation

Paper • 2511.03942 • Published Nov 6, 2025 • 2

upvoted a collection 2 months ago

AI Evals

Collection

1 item • Updated Oct 3, 2025 • 1

upvoted a paper 2 months ago

LightMem: Lightweight and Efficient Memory-Augmented Generation

Paper • 2510.18866 • Published Oct 21, 2025 • 111

upvoted 4 papers 3 months ago

Executable Knowledge Graphs for Replicating AI Research

Paper • 2510.17795 • Published Oct 20, 2025 • 14

When Benchmarks Age: Temporal Misalignment through Large Language Model Factuality Evaluation

Paper • 2510.07238 • Published Oct 8, 2025 • 14

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Paper • 2509.26536 • Published Sep 30, 2025 • 34

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses

Paper • 2510.00232 • Published Sep 30, 2025 • 15

upvoted 2 papers 4 months ago

SteeringControl: Holistic Evaluation of Alignment Steering in LLMs

Paper • 2509.13450 • Published Sep 16, 2025 • 7

WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning

Paper • 2509.04744 • Published Sep 5, 2025 • 11

upvoted an article 4 months ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

102

upvoted a paper 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

upvoted an article 5 months ago

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

128

upvoted a paper 5 months ago

Persona Vectors: Monitoring and Controlling Character Traits in Language Models

Paper • 2507.21509 • Published Jul 29, 2025 • 32

upvoted 2 papers 6 months ago

Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions

Paper • 2507.05257 • Published Jul 7, 2025 • 14

M+: Extending MemoryLLM with Scalable Long-Term Memory

Paper • 2502.00592 • Published Feb 1, 2025 • 2

upvoted 2 papers 7 months ago

ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Paper • 2506.10960 • Published Jun 12, 2025 • 12

Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

Paper • 2505.20322 • Published May 23, 2025 • 14

upvoted an article 8 months ago

Article

What is test-time compute and how to scale it?

Feb 6, 2025

•

110

Xin Xu

AI & ML interests

Recent Activity

Organizations

XinXuNLPer's activity

Decoding Strategies in Large Language Models

Mastering Tensor Dimensions in Transformers

What is test-time compute and how to scale it?