Hi, can you provide the code for only implementing transformer of video caption?
Hi, can you provide the code for only implementing transformer of video caption?