Foundation & Generative Model
We have utilized foundation models for open-vocabulary tasks (IROS23, ICRA24) and developed VLM for robotic applications (Robotic-CLIP). We have proposed techniques for generative tasks such as scene synthesis (NeurIPS23) and dance generation (CVPR23, SIGGRAPH ASIA, ECCV24).