From Data to Behavior: Predicting Unintended Model Behaviors Before Training Paper • 2602.04735 • Published about 14 hours ago • 10
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published 3 days ago • 12
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published 3 days ago • 12
Why Steering Works: Toward a Unified View of Language Model Parameter Dynamics Paper • 2602.02343 • Published 3 days ago • 12
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published 17 days ago • 15
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published 27 days ago • 18
How Do Large Language Models Learn Concepts During Continual Pre-Training? Paper • 2601.03570 • Published 29 days ago • 4
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published 17 days ago • 15
Aligning Agentic World Models via Knowledgeable Experience Learning Paper • 2601.13247 • Published 17 days ago • 15
Can We Predict Before Executing Machine Learning Agents? Paper • 2601.05930 • Published 27 days ago • 26
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published 27 days ago • 18
Can We Predict Before Executing Machine Learning Agents? Paper • 2601.05930 • Published 27 days ago • 26
Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency Paper • 2601.05905 • Published 27 days ago • 18
InnoGym: Benchmarking the Innovation Potential of AI Agents Paper • 2512.01822 • Published Dec 1, 2025 • 36
Unveiling the Pitfalls of Knowledge Editing for Large Language Models Paper • 2310.02129 • Published Oct 3, 2023
Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity Paper • 2310.07521 • Published Oct 11, 2023
LLMs for Knowledge Graph Construction and Reasoning: Recent Capabilities and Future Opportunities Paper • 2305.13168 • Published May 22, 2023
Editing Large Language Models: Problems, Methods, and Opportunities Paper • 2305.13172 • Published May 22, 2023 • 1
KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction Paper • 2104.07650 • Published Apr 15, 2021 • 2