Mohammad Shojaei's picture

Mohammad Shojaei

mshojaei77

·

AI & ML interests

Post-training smol models for Edge devices

Recent Activity

updated a model about 1 hour ago

mshojaei77/RuEstateAgent-Gemma3N-4B

published a model about 2 hours ago

mshojaei77/RuEstateAgent-Gemma3N-4B

updated a dataset about 14 hours ago

mshojaei77/outreach-agent-sft-dataset

View all activity

Organizations

upvoted a collection 3 months ago

Persian SLMs

6 items • Updated Aug 15 • 1

upvoted an article 3 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

upvoted a paper 3 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 308

upvoted a paper 4 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257

upvoted an article 5 months ago

Article

Easily Train Models with H100 GPUs on NVIDIA DGX Cloud

Mar 18, 2024

• 11

upvoted a collection 8 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 173

upvoted an article 8 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

upvoted a collection 8 months ago

Leaderboards and benchmarks ✨

Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated Feb 28 • 114

upvoted 2 articles 8 months ago

Article

The Large Language Model Course

By

•

Jan 16

• 209

Article

Merge Large Language Models with mergekit

By

•

Jan 9, 2024

• 144

upvoted 4 collections 8 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 544

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 650

DeepSeek-R1

10 items • Updated May 29 • 807

Persian-Datasets

دیتاست‌های متنوع برای آموزش و ارزیابی مدل‌های فارسی؛ اعضا می‌توانند دیتاست‌های خود را به اشتراک بگذارند یا از منابع موجود بهره ببرند • 59 items • Updated Jan 27 • 7

upvoted an article 9 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 186

upvoted a collection 9 months ago

rag-chat-models

8 items • Updated Jul 12, 2024 • 1

upvoted 2 papers 9 months ago

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Paper • 2501.06590 • Published Jan 11 • 11

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published Jan 21 • 85

upvoted a collection over 1 year ago

Product Catalog Generator

Product Catalog Generator for Persian products which is hosted by Basalam • 7 items • Updated Sep 7, 2024 • 8

upvoted a collection almost 2 years ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 647