Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States Paper • 2603.19987 • Published 26 days ago • 9
POLCA: Stochastic Generative Optimization with LLM Paper • 2603.14769 • Published about 1 month ago • 23