3 7 13

Guangkai Xu

guangkaixu

AI & ML interests

Monocular Depth Estimation, 3D Reconstruction, Visual-Language Models

Recent Activity

upvoted a paper 25 days ago

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

upvoted a paper about 1 month ago

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

upvoted a paper 2 months ago

Emu3.5: Native Multimodal Models are World Learners

View all activity

Organizations

upvoted a paper 25 days ago

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published 26 days ago • 48

upvoted a paper about 1 month ago

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Paper • 2511.20714 • Published Nov 25, 2025 • 47

upvoted a paper 2 months ago

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published Oct 30, 2025 • 108

upvoted 2 papers 4 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15, 2025 • 105

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Paper • 2508.08240 • Published Aug 11, 2025 • 45

upvoted a paper 6 months ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17, 2025 • 65

upvoted a paper 10 months ago

SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories

Paper • 2503.08625 • Published Mar 11, 2025 • 27

liked a Space 11 months ago

TRELLIS

🏢

4.78k

Scalable and Versatile 3D Generation from images

liked a dataset about 1 year ago

sayakpaul/nyu_depth_v2

Viewer • Updated Dec 12, 2022 • 3.75k • 1.63k • 34

updated a Space about 1 year ago

GenPercept

🏆

A Diffusion-free One-Step Visual Perception Generalist Model

updated a model about 1 year ago

guangkaixu/genpercept-models

Updated Oct 24, 2024 • 11 • 4

updated a dataset about 1 year ago

guangkaixu/genpercept-input-demo

Viewer • Updated Oct 24, 2024 • 43 • 47

updated 7 models about 1 year ago