Eugene Klimov's picture

Eugene Klimov

Slach

·

Slach

AI & ML interests

None yet

Recent Activity

updated a collection about 12 hours ago

usefull opensource models

liked a model 2 days ago

cerebras/GLM-4.6-REAP-252B-A32B-FP8

new activity 2 days ago

cerebras/GLM-4.6-REAP-252B-A32B-FP8:any plan about GLM-4.7-REAP-139B-FP8?

View all activity

Organizations

None yet

upvoted a collection 4 days ago

GigaAM

Foundational Model for Speech Recognition Tasks • 1 item • Updated Nov 26 • 2

upvoted a paper 4 days ago

Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models

Paper • 2512.00590 • Published about 1 month ago • 45

upvoted a paper 7 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published 19 days ago • 112

upvoted an article 15 days ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

+4

Mar 15, 2024

•

14

upvoted a collection 18 days ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 10 days ago • 70

upvoted an article 24 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

26 days ago

•

544

upvoted a collection 27 days ago

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 5 days ago • 25

upvoted a collection 3 months ago

usefull opensource models

90 items • Updated about 12 hours ago • 1

upvoted a collection 4 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 26 days ago • 183

upvoted a paper 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 90

upvoted a paper 5 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 124

upvoted a paper 6 months ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 119

upvoted a collection 8 months ago

Qwen3

84 items • Updated Aug 6 • 1.53k

upvoted an article 8 months ago

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

May 7

•

59

upvoted 2 collections 8 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 5 days ago • 250

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 5 days ago • 36

upvoted a paper 9 months ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20 • 72

upvoted 3 papers 10 months ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5 • 232

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published Feb 25 • 67

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 174