Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up

All HF Hub posts

Kseniase 
posted an update 1 day ago
view post
Post
4056
6 Comprehensive Resources on AI Coding

AI coding is moving fast, and it’s getting harder to tell what actually works. Agents, workflows, context management and many other aspects are reshaping how software gets built.

We’ve collected a set of resources to help you understand how AI coding is evolving today and what building strategies work best:

1. AI Agentic Programming: A Survey of Techniques, Challenges, and Opportunities (2508.11126)
Provides a clear taxonomy, compares agent architectures, and exposes practical gaps in tools, benchmarks, and reliability that AI coding agents now struggle with

2. Does AI-Assisted Coding Deliver? A Difference-in-Differences Study of Cursor's Impact on Software Projects (2511.04427)
This survey from Carnegie Mellon University shows causal evidence that LLM agent assistants deliver short-term productivity gains but have lasting quality costs that can slow development over time

3. A Survey of Vibe Coding with Large Language Models (2510.12399)
Turns Vibe Coding from hype into a structured field, categorizing real development workflows. It shows which models, infrastructure, tool requirements, context, and collaboration setups affect real software development outcomes

4. From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence (2511.18538) (from Chinese institutes and companies like ByteDance and Alibaba)
Compares real code LLMs, shows how training and alignment choices affect code quality and security, and connects academic benchmarks to everyday software development

5. Build Your Own Coding Agent via a Step-by-Step Workshop⟶ https://github.com/ghuntley/how-to-build-a-coding-agent
A great guide that covers the basics of building an AI-powered coding assistant – from a chatbot to a file reader/explorer/editor and code search

6. State of AI Coding: Context, Trust, and Subagents⟶ https://www.turingpost.com/p/aisoftwarestack
Here is our in-depth analysis of where AI coding is heading and the new directions we see today – like agent swarms and context management importance – offering an emerging playbook beyond the IDE

If you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe
daqc 
posted an update 2 days ago
view post
Post
3886
Check out your 2025 Hugging Face Wrapped, a small experimental recap
hf-wrapped/2025
·
sanaka87 
posted an update 2 days ago
view post
Post
3311
🚀 Introducing VideoCoF: Unified Video Editing with a Temporal Reasoner (Chain-of-Frames)!

We’re excited to introduce VideoCoF, a unified framework for instruction-based video editing that enables temporal reasoning and ~4× video length extrapolation, trained with only 50k video pairs. 🔥

🔍 What makes VideoCoF different?
🧠 Chain-of-Frames reasoning , mimic human thinking process like Seeing → Reasoning → Editing to apply edits accurately over time without external masks, ensuring physically plausible results.
📈 Strong length generalization — trained on 33-frame clips, yet supports multi-shot editing and long-video extrapolation (~4×).
🎯 Unified fine-grained editing — Object Removal, Addition, Swap, and Local Style Transfer, with instance-level & part-level, spatial-aware control.

⚡ Fast inference update
🚀 H100: ~20s / video with 4-step inference, making high-quality video editing far more practical for real-world use.

🔗 Links
📄 Paper: https://arxiv.org/abs/2512.07469
💻 Code: https://github.com/knightyxp/VideoCoF
🤗 Demo: XiangpengYang/VideoCoF
🧩 Models: XiangpengYang/VideoCoF
🌐 Project Page: https://videocof.github.io/

#VideoEditing #DiffusionModels #GenerativeAI #ComputerVision #AI
  • 1 reply
·
danielhanchen 
posted an update about 14 hours ago
DawnC 
posted an update about 14 hours ago
view post
Post
1306
Intelligent Inpainting for Precise Creative Control 🎨✨

Transform your images with AI-powered precision! SceneWeaver delivers professional-quality image composition with intelligent background replacement and advanced object manipulation.
What's New in This Update?

🖌️ Object Replacement — Select and transform any element in your scene with natural language prompts while maintaining perfect visual consistency with surrounding content

🗑️ Object Removal — Intelligently remove unwanted objects with context-aware generation that preserves natural lighting, shadows, and scene coherence

🎯 Context-Aware Processing — Advanced inpainting technology ensures seamless integration across all regenerated regions

Core Capabilities
⚡ One-click transformation with smart subject detection, 24 curated professional backgrounds, custom scene generation through text prompts, and studio-quality results powered by BiRefNet, Stable Diffusion XL, and ControlNet Inpainting.

Current Infrastructure & Future Vision
SceneWeaver operates on ZeroGPU with dynamic resource allocation, resulting in extended processing times during peak usage. Based on community demand, I am exploring cloud deployment with dedicated GPU resources for enhanced speed and batch processing capabilities.

Active development focuses on expanding background variety, refining edge quality, and advancing toward intelligent object addition with automatic shadows and reflections—making professional image composition accessible to everyone without technical expertise.

👉 Try it here: DawnC/SceneWeaver

If SceneWeaver helps bring your creative vision to life, please give it a ❤️ — your support influences future development and infrastructure investments!

#AI #Inpainting #DeepLearning #ComputerVision #StableDiffusion #Photography
martinsu 
posted an update 3 days ago
view post
Post
3176
https://huggingface.co/blog/martinsu/potus-broke-my-pipeline

How POTUS Completely Broke My Flash 2.5-Based Guardrail

Did quite a bit of deep research on this one, since it IMHO matters. At first I used this story to amuse fellow MLOps guys, but then I went deeper and was surprised.

To those who don't want to read too much, in plain English: when you give the model a high-stakes statement that clashes with what it "knows" about the world, it gets more brittle. Sometimes to a point of being unusable.

Or an even shorter version: do not clash with the model's given worldview—it will degrade to some extent.

And in practice, it means that in lower-resource languages like Latvian and Finnish (and probably others), Flash 2.5 is an unreliable guardrail model when something clashes with the model's general "worldview".

However, I'm sure this degradation applies to other languages and models as well to varying extents.

In one totally normal week of MLOps, my news summarization pipeline started failing intermittently. Nothing was changed. No deploys. No prompt edits. No model version bump (as far as I could tell). Yet the guardrail would suddenly turn into a grumpy judge and reject outputs for reasons that felt random, sometimes even contradicting itself between runs. It was the worst kind of failure: silent, flaky, and impossible to reproduce on demand.

Then I noticed the pattern: it started when one specific named entity appeared in the text — Donald Trump ** (**and later in tests — Bernie Sanders too ).

And then down the rabbit hole I went.
·
MikeDoes 
posted an update about 23 hours ago
view post
Post
297
Making LLMs fast with KV-cache sharing is great. A new paper reports it's also a huge privacy risk.

That's why we're excited to see the "SafeKV" paper from researchers at the University of Connecticut, Peking University, and others. Their solution-oriented framework selectively shares non-sensitive data while isolating PII. To validate the "Safe" part of their system, they needed a robust, multilingual privacy benchmark.

We're proud that the Ai4Privacy pii-masking dataset was used for this critical evaluation related to privacy.

This is a perfect win-win. Our open-source data enables researchers to build and validate more effective security solutions for core AI infrastructure. Their work, in turn, helps make the entire LLM ecosystem safer, showing that performance and privacy don't have to be mutually exclusive.

Kudos to Kexin Chu, Zecheng Lin, Dawei Xiang, 沈子旭, Jianchang Su, cheng chu, Yiwei Yang, Wenhui Zhang, Wenfei Wu, and Wei Zhang on this beautiful work.

🔗 Check out their paper to see the future of secure, high-performance LLM inference: https://arxiv.org/pdf/2508.08438

#OpenSource
#DataPrivacy
#LLM
#Anonymization
#AIsecurity
#HuggingFace
#Ai4Privacy
#Worldslargestopensourceprivacymaskingdataset
unmodeled-tyler 
posted an update about 24 hours ago
view post
Post
284
New Preview Model: unmodeled-tyler/vanta-research-loux-preview

VANTA Research is excited to announce a small lab preview of our new 675B fine tune, Loux-Large. Loux is an AI model with a sophisticated, rebellious edge designed to assist and collaborate with engineers, builders, and people working on technical projects.

If you enjoy working with Loux and would like full access, let us know by liking the space or opening a discussion in the community!
  • 3 replies
·
nicolay-r 
posted an update 1 day ago
view post
Post
326
📢 For those who interested in applying LLM for inferring iterators of data with CoT / prompts, this update might be relevant. Deligted to share the new release of the bulk-chain. This is a framework that contributes to efficient AI querying in synthetic data generation scenarios.

🌟 bulk-chain: https://github.com/nicolay-r/bulk-chain

🔑 This features the no-string framework for quierrying LLMs in various modes: sync, async and with optional support for output streaming.
📦️ In the latest 1.2.0 release, the updates on outlining API parameters for inference mode.

🌟 Integration into web: https://github.com/nicolay-r/bulk-chain-web-integration
YatharthS 
posted an update 3 days ago
view post
Post
2724
I just released LayaCodec, a highly efficient neural audio tokenizer/codec for TTS models, far better than most previous audio tokenizers.

🤯 Next-gen TTS models that use this could achieve several 100s of times real-time speed while producing clearer audio!! 🤯

GitHub repo: https://github.com/ysharma3501/LayaCodec
Model: YatharthS/LayaCodec