Nwankwo samuel's picture

Nwankwo samuel

Samexplorer

·

AI & ML interests

Multi modal

Recent Activity

upvoted a collection 11 days ago

MiroThinker-v1.0

liked a model 12 days ago

maya-research/maya1

upvoted a paper about 1 month ago

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

View all activity

Organizations

None yet

upvoted a collection 11 days ago

MiroThinker-v1.0

Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 7 items • Updated 6 days ago • 36

upvoted a paper about 1 month ago

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning

Paper • 2509.23768 • Published Sep 28 • 48

upvoted a collection 3 months ago

MolmoAct

All models for the MolmoAct (Multimodal Open Language Model for Action) release. • 10 items • Updated 1 day ago • 30

upvoted a paper 5 months ago

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Paper • 2506.18903 • Published Jun 23 • 22

upvoted a paper 7 months ago

PixelHacker: Image Inpainting with Structural and Semantic Consistency

Paper • 2504.20438 • Published Apr 29 • 45

upvoted a collection 7 months ago

Perception LM

7 items • Updated Apr 17 • 62

upvoted a paper 7 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted 2 papers 8 months ago

DreamActor-M1: Holistic, Expressive and Robust Human Image Animation with Hybrid Guidance

Paper • 2504.01724 • Published Apr 2 • 68

FirePlace: Geometric Refinements of LLM Common Sense Reasoning for 3D Object Placement

Paper • 2503.04919 • Published Mar 6 • 8

upvoted a collection 8 months ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated Mar 20 • 96

upvoted 3 papers 10 months ago

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published Jan 27 • 17

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Paper • 2501.12326 • Published Jan 21 • 65

Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise

Paper • 2501.08331 • Published Jan 14 • 20

upvoted 5 papers 12 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

Agent-as-a-Judge: Evaluate Agents with Agents

Paper • 2410.10934 • Published Oct 14, 2024 • 23

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 84

SwiftEdit: Lightning Fast Text-Guided Image Editing via One-Step Diffusion

Paper • 2412.04301 • Published Dec 5, 2024 • 41

One Shot, One Talk: Whole-body Talking Avatar from a Single Image

Paper • 2412.01106 • Published Dec 2, 2024 • 24

upvoted a collection about 1 year ago

LipSync and Face Operations

22 items • Updated Aug 25 • 59

upvoted a paper about 1 year ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 54