1 7 1

Gengze Zhou

ZGZzz

https://gengzezhou.github.io/

AI & ML interests

Embodied Ai, Vision-and-Language Navigation, Computer vision, Multimodality Learning, LLM

Recent Activity

upvoted a paper about 17 hours ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

upvoted a paper 3 days ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

upvoted a paper 3 days ago

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

View all activity

Organizations

None yet

ZGZzz's activity

upvoted a paper about 17 hours ago

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

Paper • 2412.06781 • Published 6 days ago • 17

upvoted 2 papers 3 days ago

Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion

Paper • 2412.09593 • Published 3 days ago • 15

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Paper • 2412.05552 • Published 9 days ago • 3

commented a paper 3 days ago

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Paper • 2412.05552 • Published 9 days ago • 3 •

upvoted a paper 3 days ago

Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel

Paper • 2412.08467 • Published 4 days ago • 3

updated a dataset about 1 month ago

ZGZzz/VersNav

Preview • Updated Nov 1 • 6

updated a model 3 months ago

ZGZzz/NavGPT2-FlanT5-XXL

Updated Sep 10

updated 2 models 4 months ago

ZGZzz/albef-text

Feature Extraction • Updated Sep 1 • 3

ZGZzz/NavGPT2-FlanT5-XL

Updated Aug 26

upvoted a paper 4 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15 • 52

upvoted an article 4 months ago

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9

• 29

updated 2 datasets 4 months ago

ZGZzz/NavGPT-R2R

Updated Aug 12 • 153

ZGZzz/NavGPT-Instruct

Preview • Updated Aug 11 • 86

liked a dataset 4 months ago

LooksJuicy/ruozhiba

Viewer • Updated Apr 9 • 1.5k • 516 • 239

authored 4 papers 5 months ago

WebVLN: Vision-and-Language Navigation on Websites

Paper • 2312.15820 • Published Dec 25, 2023

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Paper • 2402.15852 • Published Feb 24

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Paper • 2305.16986 • Published May 26, 2023

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Paper • 2407.12366 • Published Jul 17 • 4

upvoted a paper 5 months ago

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Paper • 2407.12366 • Published Jul 17 • 4