Marc Kovka's picture

Marc Kovka

GPT007

·

AI & ML interests

None yet

Organizations

None yet

upvoted 2 articles over 1 year ago

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

Jul 31, 2024

•

60

Article

XetHub is joining Hugging Face!

Aug 8, 2024

•

111

upvoted 4 collections over 1 year ago

ShieldGemma Release

A series of safety classifiers, trained on top of Gemma 2, for developers to filter inputs and outputs of their applications. • 3 items • Updated Jul 10 • 14

Gemma Scope Release

A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. • 10 items • Updated Jul 10 • 18

Gemma 2 2B Release

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10 • 81

Playground v2.5

2 items • Updated Feb 27, 2024 • 24

upvoted a paper over 1 year ago

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 62

upvoted a collection over 1 year ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 695

upvoted a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a collection over 1 year ago

DynMoE Family

DynMoE model checkpoints and paper on huggingface • 4 items • Updated Aug 19, 2024 • 4

upvoted a paper over 1 year ago

Scaling Diffusion Transformers to 16 Billion Parameters

Paper • 2407.11633 • Published Jul 16, 2024 • 26

upvoted a collection over 1 year ago

DCLM

DCLM Models + Datasets • 6 items • Updated Aug 25 • 27

upvoted a paper over 1 year ago

Training language models to follow instructions with human feedback

Paper • 2203.02155 • Published Mar 4, 2022 • 24

upvoted a collection over 1 year ago

Qwen2

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 373

upvoted a paper over 1 year ago

Video Diffusion Alignment via Reward Gradients

Paper • 2407.08737 • Published Jul 11, 2024 • 49

upvoted an article over 1 year ago

Article

Introducing Ghost 8B Beta: A Game-Changing Language Model

Jul 17, 2024

•

7

upvoted a paper over 1 year ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151

upvoted an article over 1 year ago

Article

Train a Llama model from scratch

Jul 29, 2024

•

56

upvoted 2 papers over 1 year ago

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 84

Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions

Paper • 2407.06723 • Published Jul 9, 2024 • 11