Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Custom Dataset Format #68

Open
nadimra opened this issue May 31, 2022 · 0 comments
Open

Custom Dataset Format #68

nadimra opened this issue May 31, 2022 · 0 comments

Comments

@nadimra
Copy link

nadimra commented May 31, 2022

Hi,

I'm planning on create a custom dataset for a specific domain which is to be trained on this network and I had a couple of questions of how I should go about this. The plan is to structure the dataset as follows:

customDataset
├── valid
│   ├── HR
│   │   ├── Vid1
│   │   │   ├── 0.png
│   │   │   ├── ...
│   │   │   └── ***.png
│   │   ├── Vid2
│   │   ├── Vid3
│   │   └── Vid4
│   └── LR
│       ├── Vid1
│       │   ├── 0.png
│       │   ├── ...
│       │   └── ***.png
│       ├── Vid2
│       ├── Vid3
│       └── Vid4
└── test
    ├── HR
    │   ├── Vid5
    │   │   ├── 0.png
    │   │   ├── ...
    │   │   └── ***.png
    │   ├── Vid6
    │   ├── Vid7
    │   └── Vid8
    └── LR
        ├── Vid5
        │   ├── 0.png
        │   ├── ...
        │   └── ***.png
        ├── Vid6
        ├── Vid7
        └── Vid8

So my plan is :

  1. Obtain the videos, extract the frames of these videos and place the frames in the HR folders.
  2. Use generate_mod_LR_bic.py to generate the corresponding LR frames.
  3. Use create_lmdb_mp.py but adjusted to my dataset to generate lmdb files.
  4. Edit train_zsm.yml for my dataset and run the train.py file.

Questions:

  • Does the HR images need to be a specific size? If they are different sizes, what would I need to change in the configuration files?
  • I would have thought that, given this network is used to solve the STVSR task, there should also be GT (HR) frames corresponding to the non-existent frames which are generated by ZSM. Therefore, the input frames (LR) should be of lower frame rate. But it seems like this isn't the case. Is this correct? So the LR and HR folders should contain the same number of files?
  • How do I train for a specific interpolation factor? Or is this even possible? Is the default interpolation factor 2?
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant