What Generative Search Engines Like and How to Optimize Web Content Cooperatively
Abstract
AutoGEO, a framework for optimizing generative engines, learns and applies preference rules to enhance content traction and search utility using large language models.
By employing large language models (LLMs) to retrieve documents and generate natural language responses, Generative Engines, such as Google AI overview and ChatGPT, provide significantly enhanced user experiences and have rapidly become the new form of search. Their rapid adoption also drives the needs of Generative Engine Optimization (GEO), as content providers are eager to gain more traction from them. In this paper, we introduce AutoGEO, a framework to automatically learn generative engine preferences when using retrieved contents for response generation, and rewrite web contents for more such traction. AutoGEO first prompts frontier LLMs to explain generative engine preferences and extract meaningful preference rules from these explanations. Then it uses preference rules as context engineering for AutoGEO_API, a prompt-based GEO system, and as rule-based rewards to train AutoGEO_Mini, a cost-effective GEO model. Experiments on the standard GEO-Bench and two newly constructed benchmarks using real user queries demonstrate the effectiveness of AutoGEO in enhancing content traction while preserving search utility. Analyses confirm the learned rules' robustness and abilities to capture unique preferences in variant domains, and AutoGEO systems' ability to embed them in content optimization. The code is released at https://github.com/cxcscmu/AutoGEO.
Community
TL;DR: AutoGEO is a framework to automatically learn generative engine preferences and rewrite web content for more traction.
Paper: https://arxiv.org/pdf/2510.11438
Code: https://github.com/cxcscmu/AutoGEO
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Can Synthetic Query Rewrites Capture User Intent Better than Humans in Retrieval-Augmented Generation? (2025)
- Learning Contextual Retrieval for Robust Conversational Search (2025)
- LESER: Learning to Expand via Search Engine-feedback Reinforcement in e-Commerce (2025)
- QAgent: A modular Search Agent with Interactive Query Understanding (2025)
- CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms (2025)
- DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search (2025)
- ERank: Fusing Supervised Fine-Tuning and Reinforcement Learning for Effective and Efficient Text Reranking (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper