Neural module networks
Visualquestionansweringisfundamentallycompositionalinnature—aquestionlikewhereisthedog?sharessubstructurewithquestionslikewhatcoloristhedog?andwhereisthecat?Thispaperseekstosimultaneouslyexploittherepresentationalcapacityofdeepnetworksandthecompositionallinguisticstructureofquestions.W