Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any plan for dataset release? #3

Open
yinkangning0124 opened this issue Jun 4, 2024 · 21 comments
Open

Any plan for dataset release? #3

yinkangning0124 opened this issue Jun 4, 2024 · 21 comments
Labels
question Further information is requested

Comments

@yinkangning0124
Copy link

Hi Linghao,
What an excellent work! I wonder is there any plan to release the dataset in this paper?
Thanks a lot,
Kangning.

@LinghaoChan
Copy link
Collaborator

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

@yinkangning0124
Copy link
Author

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@LinghaoChan
Copy link
Collaborator

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@yinkangning0124 Do you have any suggestions on the schedule (like ''before xxx''). Or which part of the dataset will benefit you most? Because of our schedule on releasing training codes, evaluation, benchmark, and datasets are a bit conflict, your suggestion might be helpful to arrange our time.

@yinkangning0124
Copy link
Author

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@yinkangning0124 Do you have any suggestions on the schedule (like ''before xxx''). Or which part of the dataset will benefit you most? Because of our schedule on releasing training codes, evaluation, benchmark, and datasets are a bit conflict, your suggestion might be helpful to arrange our time.

Yes, I'm quite interested in the annotated Motion-X dataset, which I think is the motion-video-text pair right? May be you can release the corresponding dataset before 20th June if possible.
Thanks again for your great effort !

@LinghaoChan
Copy link
Collaborator

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@yinkangning0124 Do you have any suggestions on the schedule (like ''before xxx''). Or which part of the dataset will benefit you most? Because of our schedule on releasing training codes, evaluation, benchmark, and datasets are a bit conflict, your suggestion might be helpful to arrange our time.

Yes, I'm quite interested in the annotated Motion-X dataset, which I think is the motion-video-text pair right? May be you can release the corresponding dataset before 20th June if possible. Thanks again for your great effort !

ok, ic! Thanks for your suggestion.

@LinghaoChan
Copy link
Collaborator

@yinkangning0124 Could you please provide more details about your demands? Because some of the data has some copyright issues, we cannot directly redistribute the data. We are considering this. If you can provide your detailed request, I will act accordingly. If it is not very convenient to detail it in public, please reach out to me via my public email.

@AQFU
Copy link

AQFU commented Jun 6, 2024

Hoping "the motion demo of MotionLLM"

@JiaweiMorris
Copy link

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@yinkangning0124 Do you have any suggestions on the schedule (like ''before xxx''). Or which part of the dataset will benefit you most? Because of our schedule on releasing training codes, evaluation, benchmark, and datasets are a bit conflict, your suggestion might be helpful to arrange our time.

Could you public the H3DQA please?

@LinghaoChan
Copy link
Collaborator

Hi Linghao, What an excellent work! I wonder is there any plan to release the dataset in this paper? Thanks a lot, Kangning.

Yes. We are working on this.

Looking forward to this ! Thanks again for your great effort.

@yinkangning0124 Do you have any suggestions on the schedule (like ''before xxx''). Or which part of the dataset will benefit you most? Because of our schedule on releasing training codes, evaluation, benchmark, and datasets are a bit conflict, your suggestion might be helpful to arrange our time.

Could you public the H3DQA please?

On the way. Thx. There are a bag of work for us to do. The priority of H3DQA might be a month later.

@LinghaoChan
Copy link
Collaborator

@yinkangning0124 We plan to release the MoVid video QA part in the coming week.

@yinkangning0124
Copy link
Author

@yinkangning0124 We plan to release the MoVid video QA part in the coming week.

Much appreciate to it !!!

@LinghaoChan LinghaoChan added the question Further information is requested label Jun 10, 2024
@LinghaoChan LinghaoChan added this to the video dataset release milestone Jun 10, 2024
@LinghaoChan
Copy link
Collaborator

We plan to announce the video part data release next Monday officially. If any other further situations, we may also release data before that time.

@LinghaoChan
Copy link
Collaborator

See dataset preview here.

@Chatonz
Copy link

Chatonz commented Jun 20, 2024

此处查看数据集预览。

The dataset looks really great. Do you have any plans to release all of it?

@LinghaoChan
Copy link
Collaborator

此处查看数据集预览。

The dataset looks really great. Do you have any plans to release all of it?

The dataset is already prepared for release. We are resolving some legal reviewing issues. Perhaps two weeks.

@AshkanTaghipour
Copy link

Hi, thank you for the novel task.
is there any plan for releasing annotated Motion-X dataset?

@LinghaoChan
Copy link
Collaborator

@AshkanTaghipour Please refer to the readme. The link is https://huggingface.co/datasets/EvanTHU/MoVid

@AshkanTaghipour
Copy link

Thank you for your prompt reply. Upon reviewing the dataset, I noticed it includes only videos without any annotated motion, correct?

@LinghaoChan
Copy link
Collaborator

Thank you for your prompt reply. Upon reviewing the dataset, I noticed it includes only videos without any annotated motion, correct?

The motion comes from Motion-X. You can download them accordingly.

@qhFang
Copy link

qhFang commented Sep 18, 2024

Thank you for your efforts. However, I noticed that the amount of data currently released differs significantly from what was claimed in the paper. The paper claims that 200k QA pairs were generated based on the Motion-X dataset, but there are only 24K QA pairs in video-QA.json. Are there any plans to release the remaining data in the future?

@LinghaoChan
Copy link
Collaborator

Thank you for your efforts. However, I noticed that the amount of data currently released differs significantly from what was claimed in the paper. The paper claims that 200k QA pairs were generated based on the Motion-X dataset, but there are only 24K QA pairs in video-QA.json. Are there any plans to release the remaining data in the future?

@qhFang The released dataset part is the captioning part. We plan to release the QA data in next month.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

7 participants