Using the extensive PlantVillage dataset, we systematically analyzed the effects of patch sizes, image resolutions, embedding dimensions, the number of transformer blocks (depth), the number of heads ...
Predicting Categories and Ingredients of Traditional Dishes Using Deep Learning and Cross-Attention Mechanism. Open Access Library Journal, 12, 1-12. doi: 10.4236/oalib.1112846 . Image recognition and ...