2 18 12

Zhaoye Fei

ngc7293

https://ngc7292.github.io/

AI & ML interests

NLP & Ro.

Recent Activity

authored a paper 1 day ago

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

commented on a paper 3 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

commented on a paper 3 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

View all activity

Organizations

authored a paper 1 day ago

InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning

Paper • 2402.06332 • Published Feb 9, 2024 • 19

commented 2 papers 3 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 9 days ago • 52 •

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 9 days ago • 52 •

reacted to Reality123b's post with 🤗 3 days ago

Post

2048

Happy birthday to me!!!

2 replies

authored a paper 6 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 9 days ago • 52

upvoted a paper 6 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 9 days ago • 52

updated a collection 6 days ago

MOSS Transcribe Diarize

Collection

A unified multimodal large language model for end-to-end speaker-attributed, time-stamped transcription. • 2 items • Updated 6 days ago • 1

submitted a paper to Daily Papers 6 days ago

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Paper • 2601.01554 • Published 9 days ago • 52

upvoted 2 papers 14 days ago

DiRL: An Efficient Post-Training Framework for Diffusion Language Models

Paper • 2512.22234 • Published 21 days ago • 19

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

Paper • 2512.23576 • Published 15 days ago • 64

liked a Space 15 days ago

MOSS Transcribe Diarize

🏢

Transcribe audio/video files with speaker identification

upvoted 2 papers about 2 months ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published Nov 6, 2025 • 211

SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models

Paper • 2511.15605 • Published Nov 19, 2025 • 23

liked a model 2 months ago

OpenMOSS-Team/MOSS-TTSD-v0.7

Text-to-Speech • 2B • Updated Nov 11, 2025 • 839 • 15

upvoted a paper 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

upvoted a paper 3 months ago

RoboOmni: Proactive Robot Manipulation in Omni-modal Context

Paper • 2510.23763 • Published Oct 27, 2025 • 53

liked 2 datasets 3 months ago

Sylvest/libero_plus_rlds

Updated Oct 17, 2025 • 348 • 5

Sylvest/LIBERO-plus

Updated Oct 17, 2025 • 496 • 15

upvoted a paper 3 months ago

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15, 2025 • 37

Zhaoye Fei

AI & ML interests

Recent Activity

Organizations

ngc7293's activity

MOSS Transcribe Diarize