There are visual clues in the way people assemble images that you can use to guide what you ask questions about. In practice, you look at the visuals in the context of what people are saying. I am isolating them here to make the visual part separate and clearer so you can practice seeing the elements faster.
There are no right and wrongs. Just identifying possible clues. I am not trying to be comprehensive, just pointing out what popped out to me in the moment.