Conducted 5+ experiments benchmarking 8 VLMs using Egyptian culture-related datasets
• Evaluated models performance with 3 syntactic metrics such as BLEU, ROUGE, METEOR
• Replicated VQ2A paper method to create a 1700+ VQA dataset on Egyptian culture
• Built an efficient Model Context Protocol (MCP) image search tool using serpapi search API