TRCaptionNet : A novel and accurate deep Turkish image captioning model with vision transformer based image encoders and deep linguistic text decoders
Paper | Github Repo