weichenfan's picture

weichenfan

weepiess2383

·

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

authored a paper 3 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

upvoted a paper 3 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

View all activity

Organizations

upvoted 2 papers 3 days ago

From Pixels to Words -- Towards Native One-Vision Models at Scale

Paper • 2605.28820 • Published 5 days ago • 68

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 20 days ago • 191

upvoted a collection 3 days ago

NEO1_5

From Pixels to Words -- Towards Native One-Vision Models at Scale • 3 items • Updated 4 days ago • 6

upvoted an article 3 months ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 163

upvoted a paper 4 months ago

Demo-ICL: In-Context Learning for Procedural Video Knowledge Acquisition

Paper • 2602.08439 • Published Feb 9 • 28

upvoted a paper 5 months ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published Dec 22, 2025 • 68

upvoted 3 papers about 1 year ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27, 2025 • 33

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14, 2025 • 1

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Paper • 2503.18886 • Published Mar 24, 2025 • 24

upvoted a paper over 2 years ago

FreeU: Free Lunch in Diffusion U-Net

Paper • 2309.11497 • Published Sep 20, 2023 • 66

upvoted a paper almost 3 years ago

Link-Context Learning for Multimodal LLMs

Paper • 2308.07891 • Published Aug 15, 2023 • 17