CVPR Demo Track

non-profit

http://cvpr2022.thecvf.com/

Activity Feed Request to join this org

AI & ML interests

CVPR Demo Track @ CVPR 2022

Recent Activity

lkeab authored a paper 7 days ago

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

seravee008 authored a paper 11 days ago

Helios: Real Real-Time Long Video Generation Model

seravee008 authored a paper 11 days ago

Adaptive 1D Video Diffusion Autoencoder

View all activity

noamrot

submitted a paper to Daily Papers 27 days ago

SemanticMoments: Training-Free Motion Similarity via Third Moment Features

Paper • 2602.09146 • Published Feb 9 • 21

gagan3012

authored 2 papers 2 months ago

From RAG to Agentic RAG for Faithful Islamic Question Answering

Paper • 2601.07528 • Published Jan 12 • 2

Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics

Paper • 2601.04946 • Published Jan 8

noamrot

authored a paper 2 months ago

CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature

Paper • 2601.03319 • Published Jan 6 • 53

DavidVivancos

posted an update 3 months ago

Post

349

Need a new challenging Dataset? Now that #NeurIPS2025 is almost over.

DavidVivancos/NeuraxonLife2-1M

1 Million #Neuraxon Artificial Lives, from almost 10000 Research Game runs, with more than 21 Million Neurons and almost 4 years of Simulated Life.

Read the preprint here https://www.researchgate.net/publication/397331336_Neuraxon

And here you have all the code: https://github.com/DavidVivancos/Neuraxon

yumingj

authored a paper 4 months ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published Nov 21, 2025 • 28

DavidVivancos

posted an update 4 months ago

Post

388

Hi all!,

Neuraxon Game of Life is also live in demo at HuggingFace
DavidVivancos/NeuraxonLife

Preprint Paper: https://www.researchgate.net/publication/397331336_Neuraxon

Source Code of the Research verision: https://github.com/DavidVivancos/Neuraxon

HuggingFace Models are in the oven!

Hope you like it!
@DavidVivancos

noamrot

authored a paper 4 months ago

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published Nov 9, 2025 • 55

abidlabs

authored 3 papers 4 months ago

posted an update 4 months ago

Post

994

Hi all!,

Neuraxon ( a novel Neural Growth & Computation Blueprint) is live in demo at HuggingFace DavidVivancos/Neuraxon

Paper: https://www.researchgate.net/publication/397331336_Neuraxon (on its way to arxiv too)

Code: https://github.com/DavidVivancos/Neuraxon

HuggingFace Model in the oven!

Hope you like it!
@DavidVivancos

2 replies

abidlabs

posted an update 4 months ago

Post

10156

Why I think local, open-source models will eventually win.

The most useful AI applications are moving toward multi-turn agentic behavior: systems that take hundreds or even thousands of iterative steps to complete a task, e.g. Claude Code, computer-control agents that click, type, and test repeatedly.

In these cases, the power of the model is not how smart it is per token, but in how quickly it can interact with its environment and tools across many steps. In that regime, model quality becomes secondary to latency.

An open-source model that can call tools quickly, check that the right thing was clicked, or verify that a code change actually passes tests can easily outperform a slightly “smarter” closed model that has to make remote API calls for every move.

Eventually, the balance tips: it becomes impractical for an agent to rely on remote inference for every micro-action. Just as no one would tolerate a keyboard that required a network request per keystroke, users won’t accept agent workflows bottlenecked by latency. All devices will ship with local, open-source models that are “good enough” and the expectation will shift toward everything running locally. It’ll happen sooner than most people think.

8 replies

gagan3012

authored a paper 5 months ago

Distributional Semantics Tracing: A Framework for Explaining Hallucinations in Large Language Models

Paper • 2510.06107 • Published Oct 7, 2025 • 3

yuna0x0

authored a paper 6 months ago

See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation

Paper • 2509.22653 • Published Sep 26, 2025 • 25

yumingj

authored a paper 6 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25, 2025 • 104

abidlabs

posted an update 6 months ago

Post

1591

What other features would you like to see on the Trackio Dashboard? ( gradio-templates/trackio-dashboard)

yumingj

authored a paper 6 months ago

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published Sep 18, 2025 • 22

Abhaykoul

posted an update 6 months ago

Post

3304

🚀 Ever dreamed of training your own Large Language Model from scratch? What if I told you it doesn't require a supercomputer or PhD in ML? 🤯

Introducing LLM Trainer - the educational framework that makes LLM training accessible to EVERYONE! Whether you're on a CPU-only laptop or scaling to distributed GPUs, we've got you covered. 💻➡️🖥️

Why LLM Trainer? Because existing tools are either too simplistic (hiding the magic) or too complex (requiring expert knowledge). We bridge the gap with:

🎓 Educational transparency - every component built from scratch with clear code
💻 CPU-first approach - start training immediately, no GPU needed
🔧 Full customization - modify anything you want
📈 Seamless scaling - from laptop to cluster without code changes
🤝 HuggingFace integration - works with existing models & tokenizers

Key highlights:
✅ Built-in tokenizers (BPE, WordPiece, HF wrappers)
✅ Complete Transformer implementation from scratch
✅ Optimized for CPU training
✅ Advanced features: mixed precision, gradient checkpointing, multiple generation strategies
✅ Comprehensive monitoring & metrics

Perfect for:
- Students learning transformers
- Researchers prototyping new ideas
- Developers building domain-specific models

Ready to train your first LLM? It's easier than you think!

🔗 Check it out: https://github.com/HelpingAI/llm-trainer
📚 Docs: Getting Started Guide
💬 Join the community: GitHub Discussions

#AI #MachineLearning #LLM #DeepLearning #OpenSource #Python #HuggingFace #NLP

Special thanks to HuggingFace and PyTorch teams for the amazing ecosystem! 🙏

1 reply

Abhaykoul

posted an update 8 months ago

Post

4171

🚀 Dhanishtha-2.0-preview-0825 Is Here

The Intermediate Thinking Model just leveled up again.

With sharper reasoning, better tool use, and expanded capabilities, Dhanishtha-2.0-preview-0825 is now live and ready to impress.

🧠 What Makes Dhanishtha Special?
Unlike typical CoT models that only thinks one time, Dhanishtha thinks iteratively:

> Think → Answer → Rethink → Improve → Rethink again if needed.

🔗 Try it now: HelpingAI/Dhanishtha-2.0-preview-0825

🔞 Dhanishtha NSFW Preview

For those exploring more expressive and immersive roleplay scenarios, we’re also releasing:

HelpingAI/Dhanishtha-nsfw
A specialized version tuned for adult-themed interactions and character-driven roleplay.

🔗 Explore it here: HelpingAI/Dhanishtha-nsfw

💬 You can also try all of these live at chat.helpingai.co

4 replies

AI & ML interests

Recent Activity

Team members 265

CVPR's activity