1 3 3

Linke Ouyang

ouyanglinke

AI & ML interests

None yet

Recent Activity

updated a dataset 27 days ago

ouyanglinke/OmniDocBench_tsv

authored a paper 2 months ago

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

authored a paper 2 months ago

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

View all activity

Organizations

updated a dataset 27 days ago

ouyanglinke/OmniDocBench_tsv

Viewer • Updated 27 days ago • 981 • 120

authored 9 papers 2 months ago

InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition

Paper • 2309.15112 • Published Sep 26, 2023 • 2

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization

Paper • 2311.16839 • Published Nov 28, 2023 • 1

InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model

Paper • 2401.16420 • Published Jan 29, 2024 • 55

MLLM-DataEngine: An Iterative Refinement Approach for MLLM

Paper • 2308.13566 • Published Aug 25, 2023 • 1

InternLM2 Technical Report

Paper • 2403.17297 • Published Mar 26, 2024 • 34

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Paper • 2404.06512 • Published Apr 9, 2024 • 30

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 95

MinerU: An Open-Source Solution for Precise Document Content Extraction

Paper • 2409.18839 • Published Sep 27, 2024 • 35

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

liked a Space 2 months ago

MinerU OCR

📚

502

A data extraction tool to convert PDF to Markdown and JSON

liked a model 2 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Sep 29 • 1.39M • 292

upvoted a paper 2 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 136

updated a dataset 2 months ago

opendatalab/OmniDocBench

Viewer • Updated Sep 26 • 1.36k • 20.9k • 58

upvoted a paper 7 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25 • 144

published a dataset 10 months ago

ouyanglinke/OmniDocBench_tsv

Viewer • Updated 27 days ago • 981 • 120

liked a dataset 12 months ago

opendatalab/OmniDocBench

Viewer • Updated Sep 26 • 1.36k • 20.9k • 58

authored a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

updated a dataset 12 months ago

opendatalab/OmniDocBench

Viewer • Updated Sep 26 • 1.36k • 20.9k • 58

upvoted a paper 12 months ago

OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations

Paper • 2412.07626 • Published Dec 10, 2024 • 28

Linke Ouyang

AI & ML interests

Recent Activity

Organizations

ouyanglinke's activity

MinerU OCR