1 23

SAKSRI PRASERTSANG

OPEN-GPT-OSS

http://microsoft.ai/

iNTERTECHNUMBERLOGYTHAiLAND

AI & ML interests

https://intertechnumberlogythailand.blogspot.com/2025/10/microsoftai.html

Recent Activity

upvoted an article 20 days ago

Welcome GPT OSS, the new open-source model family from OpenAI!

upvoted a paper 20 days ago

Scaling Granite Code Models to 128K Context

upvoted a paper 20 days ago

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

View all activity

Organizations

upvoted an article 20 days ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

• 503

upvoted 19 papers 20 days ago

GoldFinch: High Performance RWKV/Transformer Hybrid with Linear Pre-Fill and Extreme KV-Cache Compression

Paper • 2407.12077 • Published Jul 16, 2024 • 57

H2O Open Ecosystem for State-of-the-art Large Language Models

Paper • 2310.13012 • Published Oct 17, 2023 • 9

Voxtral

Paper • 2507.13264 • Published Jul 17 • 29

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 55

Pixtral 12B

Paper • 2410.07073 • Published Oct 9, 2024 • 68

H2OVL-Mississippi Vision Language Models Technical Report

Paper • 2410.13611 • Published Oct 17, 2024 • 1

Scaling Context, Not Parameters: Training a Compact 7B Language Model for Efficient Long-Context Processing

Paper • 2505.08651 • Published May 13 • 1

Mellum: Production-Grade in-IDE Contextual Code Completion with Multi-File Project Understanding

Paper • 2510.05788 • Published 28 days ago • 1

FaceChain: A Playground for Human-centric Artificial Intelligence Generated Content

Paper • 2308.14256 • Published Aug 28, 2023 • 2

Granite Embedding R2 Models

Paper • 2508.21085 • Published Aug 26 • 2

Salamandra Technical Report

Paper • 2502.08489 • Published Feb 12 • 3

TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese

Paper • 2401.16640 • Published Jan 30, 2024 • 10

BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model

Paper • 2309.11568 • Published Sep 20, 2023 • 11

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 19

H2O-Danube3 Technical Report

Paper • 2407.09276 • Published Jul 12, 2024 • 20

NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment

Paper • 2405.01481 • Published May 2, 2024 • 31

SAKSRI PRASERTSANG

AI & ML interests

Recent Activity

Organizations

OPEN-GPT-OSS's activity

Welcome GPT OSS, the new open-source model family from OpenAI!