1 5

Sharath Turuvekere Sreenivas

sharathts

AI & ML interests

Learning algorithms, LLM efficiency: Knowledege distillation and compression.

Recent Activity

upvoted a paper 10 days ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

upvoted a paper 3 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

published an article 3 months ago

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

View all activity

Organizations

upvoted a paper 10 days ago

Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs

Paper • 2511.16664 • Published 11 days ago • 24

upvoted a paper 3 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20 • 37

published an article 3 months ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Aug 18

•

updated 3 models 3 months ago

published 2 models 3 months ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base

Text Generation • 12B • Updated 27 days ago • 5.93k • 83

nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base

Text Generation • 9B • Updated 27 days ago • 120k • 41

upvoted a collection 8 months ago

Nemotron-H

Collection

Mamba-Transformer hybrid models • 10 items • Updated 7 days ago • 30

authored 2 papers 8 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 57

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 15

New activity in nvidia/Llama-3.1-Minitron-4B-Width-Base about 1 year ago

Teacher correction training hyperparameters

#13 opened about 1 year ago by

hjlee1371

upvoted a paper over 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 57

authored a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a paper over 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

Sharath Turuvekere Sreenivas

AI & ML interests

Recent Activity

Organizations

sharathts's activity

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Teacher correction training hyperparameters