13 14 274

r4dm

radm

r4dm

AI & ML interests

data science

Recent Activity

upvoted an article 27 days ago

SOTA OCR on-device with Core ML and dots.ocr

liked a model about 1 month ago

Qwen/Qwen3-Omni-30B-A3B-Thinking

liked a model about 1 month ago

Qwen/Qwen3-VL-235B-A22B-Thinking

View all activity

Organizations

upvoted an article 27 days ago

Article

SOTA OCR on-device with Core ML and dots.ocr

Oct 2

• 56

upvoted a paper about 2 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1 • 58

upvoted a paper 5 months ago

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26 • 89

upvoted a collection 6 months ago

late interaction retrievers

Collection

This collection list our ColBERT like late interaction retriever models • 4 items • Updated Jul 20 • 2

upvoted 2 articles about 1 year ago

Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

•

Jul 27, 2024

• 34

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 271

upvoted a paper about 1 year ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 68

upvoted a collection over 1 year ago

SimPO

Collection

This collections contains a list of SimPO and baseline models. • 49 items • Updated Mar 16 • 23

upvoted an article over 1 year ago

Article

Google Search with LLM

•

May 1, 2024

• 10

upvoted a collection over 1 year ago

abliterated-v3

Collection

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 131

upvoted an article over 1 year ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 707

upvoted 2 papers over 1 year ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54

upvoted a paper about 2 years ago

Microscaling Data Formats for Deep Learning

Paper • 2310.10537 • Published Oct 16, 2023 • 8

r4dm

AI & ML interests

Recent Activity

Organizations

radm's activity

SOTA OCR on-device with Core ML and dots.ocr

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Google Search with LLM

Uncensor any LLM with abliteration