Thie Repository is an implementation of the paper https://arxiv.org/pdf/1804.10660.pdf
Download the data VRD dataset from here.
Download pretrained embeddings from here. Put it in the data/wordvectors folder. Folder structure is shown below.
+ data
+ VRD
- json_dataset
- sg_dataset
+ wordvectors
- GoogleNews-vectors-negative300.bin
./train.sh
- All the weights will be saved to the snapshots folder
- To resume Training from any checkpoint, Use
--weight_path <path-to-model>
python inference.py --input_image <path-to-image>
python inference_batch.py --images_dir <path-to-dir>
Visual Genome Dataset, VGG BackBone, Resnet 152
This project is licensed under the MIT License