Papers
arxiv:2604.20857

DiagramBank: A Large-scale Dataset of Diagram Design Exemplars with Paper Metadata for Retrieval-Augmented Generation

Published on Feb 28
· Submitted by
Leo Y
on Apr 27
Authors:
,
,
,

Abstract

A large-scale dataset of schematic diagrams called DiagramBank is introduced for multimodal retrieval and exemplar-driven scientific figure generation, addressing the gap in automated publication-grade diagram creation by existing AI scientist systems.

AI-generated summary

Recent advances in autonomous ``AI scientist'' systems have demonstrated the ability to automatically write scientific manuscripts and codes with execution. However, producing a publication-grade scientific diagram (e.g., teaser figure) is still a major bottleneck in the ``end-to-end'' paper generation process. For example, a teaser figure acts as a strategic visual interface and serves a different purpose than derivative data plots. It demands conceptual synthesis and planning to translate complex logic workflow into a compelling graphic that guides intuition and sparks curiosity. Existing AI scientist systems usually omit this component or fall back to an inferior alternative. To bridge this gap, we present DiagramBank, a large-scale dataset consisting of 89,422 schematic diagrams curated from existing top-tier scientific publications, designed for multimodal retrieval and exemplar-driven scientific figure generation. DiagramBank is developed through our automated curation pipeline that extracts figures and corresponding in-text references, and uses a CLIP-based filter to differentiate schematic diagrams from standard plots or natural images. Each instance is paired with rich context from abstract, caption, to figure-reference pairs, enabling information retrieval under different query granularities. We release DiagramBank in a ready-to-index format and provide a retrieval-augmented generation codebase to demonstrate exemplar-conditioned synthesis of teaser figures. DiagramBank is publicly available at https://huggingface.co/datasets/zhangt20/DiagramBank with code at https://github.com/csml-rpi/DiagramBank.

Community

Paper submitter

DiagramBank introduces a large-scale dataset of 89,422 schematic diagrams curated from top-tier scientific publications, designed to address a key bottleneck in autonomous "AI scientist" systems: the generation of publication-grade scientific diagrams such as teaser figures.

Unlike derivative data plots, teaser figures require conceptual synthesis and planning to translate complex logic workflows into compelling visuals, a capability that current AI scientist pipelines typically omit or handle poorly. DiagramBank tackles this through an automated curation pipeline that extracts figures along with their in-text references, and uses a CLIP-based filter to distinguish schematic diagrams from standard plots and natural images.

Each instance is paired with rich contextual metadata (abstract, caption, and figure-reference pairs), enabling multimodal information retrieval at different query granularities. The authors release the dataset in a ready-to-index format along with a retrieval-augmented generation (RAG) codebase that demonstrates exemplar-conditioned synthesis of teaser figures, paving the way for end-to-end automated scientific paper generation with high-quality visual components.

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2604.20857
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2604.20857 in a model README.md to link it from this page.

Datasets citing this paper 1

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2604.20857 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.