| Xiangxiang Chu - AI Leader | Spatial & Generative Intelligence |
Xiangxiang Chu(初祥祥)
Senior Director & Head of AMAP-ML, Alibaba Group
I lead AMAP-ML at Alibaba AMAP, a 100+ member product-facing AI team building foundation systems for spatial intelligence and generative intelligence. My research traces an arc from efficient neural architecture design to multimodal foundation models, LLM reasoning, world models, agent systems, and large-scale AI products serving hundreds of millions of users. The thread that connects all of it: making AI systems more efficient, more capable, and more broadly useful.
Featured Projects
Research Journey
Recognition
- Top 100 AI Scholars, AMiner 2023 — selected from hundreds of thousands of AI researchers worldwide
- National Science and Technology Progress First Prize, 2018 — contributed 20 invention patents
- 3 first-authored papers on PaperDigest's Most Influential Paper List: FairNAS, Twins, CPVT
- Area Chair: ICLR, NeurIPS | Senior Program Committee: AAAI, IJCAI
- 40+ domestic and 7 international invention patents
Core Technical Directions
Spatial Intelligence — Route-planning agents (MobilityBench), map-augmented geolocalization (Thinking-with-Map), autonomous-driving VLA reasoning (AutoDrive-R2), urban scene understanding, and industrial mobility systems
Generative Intelligence — Scene-text editing (FluxText), diffusion-model optimization (DCW, S2-Guidance), video virtual try-on (Eevee), 3D editing (RL3DEdit), and controllable visual effects (Omni-Effects)
Reasoning Agents — Reinforcement learning for LLM reasoning (GPG, MathForge), tree-search agent training (Tree-GRPO), agent-data co-evolution (CoEvolve), and collective skill evolution (SkillClaw)
World Models & Interactive AI — Interactive world simulation (DreamX-World), GUI world models (Code2World), and benchmarks for dynamic 4D response capabilities (Omni-WorldBench)
Multimodal Understanding — Vision-language reasoning, visual policy optimization, spatial intelligence evaluation (SpatialGenEval), multimodal in-context learning (STV)
Foundation Architectures — Frequency-aware sparse attention (FASA), unified pretraining for generation and understanding (USP), end-to-end pixel generation without VAE (EPG), and diffusion LLMs (AR-MAP)
Team & Opportunities
I lead the AMAP-ML team at Alibaba Group, a 100+ member product-facing AI team with over half recruited from top AI labs globally, including multiple Google PhD Fellowship recipients.
Our philosophy: We build systems where research quality, engineering discipline, open-source reproducibility, and product deployment reinforce each other. Every core paper ships with reproducible code, and our work directly powers products serving hundreds of millions of users.
Open Source
We maintain 30+ projects on GitHub spanning spatial intelligence, generative intelligence, reasoning agents, world models, and multimodal AI, with 10,000+ cumulative stars.
Hiring
We are always looking for talented interns, full-time researchers, and AI engineers in LLM agents, reinforcement learning, world models, multimodal learning, spatial intelligence, and generative AI. Drop me an email if interested.
Education
- M.S. in Electrical Engineering, Tsinghua University, 2012
- B.S. in Electrical Engineering, Southeast University, 2010
