Skip to content

Latest commit

 

History

History
130 lines (92 loc) · 4.91 KB

README.md

File metadata and controls

130 lines (92 loc) · 4.91 KB

Logo MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension

MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension
Zeyu Ling*Bo Han*Shiyang LiHongdeng ShenJikang ChengChangqing Zou
Zhejiang University  Zhejiang Lab

💻 Project Page

📖 Introduction

This project introduces:

  • MMotion: A public motion-related common library based on MMEngine, which includes PyTorch implementations of MotionLLaMA and various motion models.

  • MotionHub: Currently the largest open-source multimodal, multi-task motions dataset.

📜 What's New

  • 2024-12-27: Release the MotionHub V2, which involves following updates compared to the original version:
      1. Manually correct the captions in Fit3D, HumanSC3D, Hi4D subset.
      1. Manually filter and correct the InterHuman datset, low-quality motion clips are removed.
      1. Chi3D dataset is removed, since the motion quality is not good.
      1. Use PoseScript to generate frame-level caption for AIST++ and BEATV2 dataset, and we use ChatGPT-4o-mini to propess the frame-level caption to sentence-level caption.
      1. Use ChatGPT-4o-mini to correct the caption in MotionX dataset w.r.t the frame-level caption, some original captions are not correct.
      1. We define the granularity of all captions, including Macro, Meso and Micro. Macro is the lowest granularity, and Micro is the highest granularity.
      1. We segment the BEATV2 dataset into clips with duration less than 12 seconds. We use whisper to generate the corresponding spoken text of each clip. Each clip contains complete setences, we do not segment one single sentence into multiple clips.
      1. We remove the preclude dance clips in FineDance dataset, in the preclude clips, the dancer is not dancing but keeping the same pose. Then, we segment the remaining clips into clips with duration less than 12 seconds. We hope this version can be more useful for the community.
  • Release the MMotion Library.
  • Release the MotionHub dataset.
  • Release the demo video.

📥 Dataset Download

Dataset Clip Number Caption Number Google Drive Baidu Disk
MotionHub V1 131512 269873 Coming Soon https://pan.baidu.com/s/1vuewGrtVF9PjhEIiv153pw?pwd=AIXM
MotionHub V2 142350 259998 Coming Soon https://pan.baidu.com/s/1KNc31GrwBhuqTzopqu_U7Q?pwd=AIXM