Picture for Jie Zhou

Jie Zhou

Streaming 4D Visual Geometry Transformer

Add code
Jul 15, 2025
Viaarxiv icon

LaCo: Efficient Layer-wise Compression of Visual Tokens for Multimodal Large Language Models

Add code
Jul 03, 2025
Viaarxiv icon

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

Add code
Jul 03, 2025
Viaarxiv icon

SpectralAR: Spectral Autoregressive Visual Generation

Add code
Jun 12, 2025
Viaarxiv icon

GenWorld: Towards Detecting AI-generated Real-world Simulation Videos

Add code
Jun 12, 2025
Viaarxiv icon

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

Add code
Jun 11, 2025
Viaarxiv icon

Vision Generalist Model: A Survey

Add code
Jun 11, 2025
Viaarxiv icon

Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings

Add code
Jun 10, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing

Add code
Jun 06, 2025
Viaarxiv icon