3 90 340

Timex Peachtree

TimexPeachtree

TimexPeachtree

AI & ML interests

None yet

Recent Activity

upvoted a collection about 5 hours ago

Nemotron-Labs-Diffusion

liked a model about 5 hours ago

nvidia/Nemotron-Labs-Diffusion-3B

upvoted a collection about 5 hours ago

Nemotron OCR and Object Detection

View all activity

Organizations

None yet

upvoted a collection about 5 hours ago

Nemotron-Labs-Diffusion

Collection

A Tri-Mode Language Model Family Unifying Autoregressive, Diffusion, and Self-Speculation Decoding • 7 items • Updated 16 days ago • 50

liked a model about 5 hours ago

nvidia/Nemotron-Labs-Diffusion-3B

Text Generation • 4B • Updated 24 days ago • 60.1k • 32

upvoted a collection about 5 hours ago

Nemotron OCR and Object Detection

Collection

4 items • Updated 16 days ago • 19

upvoted an article about 5 hours ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 113

upvoted a paper about 5 hours ago

Fast LeWorldModel

Paper • 2606.26217 • Published 4 days ago • 22

liked a model about 5 hours ago

deepreinforce-ai/Ornith-1.0-9B

Text Generation • 1.47M • Updated 2 days ago • 1.5k • • 149

liked a model about 6 hours ago

deepreinforce-ai/Ornith-1.0-9B-GGUF

Text Generation • 9B • Updated 2 days ago • 11k • 209

liked a model 5 days ago

baidu/Unlimited-OCR

Image-Text-to-Text • 3B • Updated 3 days ago • 213k • 1.11k

liked a model 6 days ago

Boogu/Boogu-Image-0.1-Base

Updated 2 days ago • 671 • 56

upvoted a paper 6 days ago

FastContext: Training Efficient Repository Explorer for Coding Agents

Paper • 2606.14066 • Published 16 days ago • 93

liked 2 datasets 8 days ago

seeklhy/SynSQL-2.5M

Updated Mar 17, 2025 • 480 • 30

xiaobing11/ACE-SQL

Viewer • Updated 12 days ago • 22.1k • 116 • 1

liked a model 8 days ago

MiniMaxAI/MiniMax-M3

Image-Text-to-Text • 427B • Updated 4 days ago • 183k • • 1.25k

reacted to Jaward's post with 🔥 8 days ago

Post

9122

Our preprint is out!
We attempt to model human teaching behaviors into agents yielding a unified framework that enables adaptive personalized learning experiences:
LectūraAgents addresses the prevailing limitations in current AI learning systems with three essential capabilities:
(1) a hierarchical multi-agent architecture modeled on academic standards. we observe that agents collaborating across hierarchies yield better personalized learning outcomes.
(2) an adaptive embodied teaching mechanism, in which the instructor agent executes visible and pedagogically motivated teaching actions (e.g. handwrite, highlight, circle etc) on contents in a teaching environment while speaking.
(3) to achieve this we propose a novel teaching action-speech alignment algorithm (TASA) that dynamically aligns speech with visual teaching actions: specifically, TASA temporally chops up speech segments into word-level tokens, performs salience heuristics analysis on learning contents (texts, images etc) then identifies relevant regions to apply pedagogical teaching actions that guide attention and augment understanding.

We conducted several experiments to assess these capabilities: starting with pedagogical evaluation of the various components under frontier models, comparative analysis with existing frameworks and an efficacy study with real students.

Results show consistent gains in standard instructional metrics (curated by expert educators) spanning lecture content quality, embodied teaching quality, assessment, and personalization over baseline systems, positioning LectūraAgents as a pedagogically grounded framework for personalized learning at scale.

Paper: LectūraAgents: A Multi-Agent Framework for Adaptive Personalized AI-Assisted Learning and Embodied Teaching (2606.16428)
Data: Jaward/lectura-agents-data

1 reply

liked a model 8 days ago

owensong/Inflect-Nano-v1

Text-to-Speech • Updated 4 days ago • 205

reacted to owensong's post with 🔥 8 days ago

Post

6448

I just released Inflect-Nano-v1, an ultra-small 4.63 parameter text-to-speech model.

The main idea is simple: instead of only making the acoustic model tiny and relying on a larger external vocoder, Inflect-Nano-v1 keeps the complete text-to-waveform stack under 5M parameters.

Quick facts:
- 4.63M total inference parameters
- 3.46M acoustic model
- 1.17M vocoder
- 24 kHz audio
- English-only
- Single male voice
- Runs locally with a simple PyTorch inference script

Why I made it:
Most modern TTS models are much larger, and even many “small TTS” projects depend on a separate vocoder. I wanted to see how far a complete tiny TTS stack could be pushed while still producing usable speech.

It is not SOTA, and I am not trying to claim it competes with large TTS systems. The interesting part is the size-to-functionality ratio.

What works:
It can generate arbitrary English speech locally, and the model is small enough to be interesting for:

- local voice assistants
- embedded/edge experiments
- browser or WASM-style TTS exploration
- efficient inference research
- tiny-model baselines

Limitations:
The quality is still limited. It can sound robotic, stumble on difficult unseen text, and the vocoder is still a clear bottleneck. Long or unusual prompts are less reliable.

So I would frame this as a research/demo release, not a production TTS engine.

I’d love feedback from people interested in:
- tiny speech models
- vocoders
- local TTS
- efficient inference
- embedded speech synthesis
- improving small-model generalization

If people find it useful, I’m interested in putting more training budget into a stronger v2.

Model page:
owensong/Inflect-Nano-v1