Exploring Visual Attention Mechanism for Scene Understanding in Image Captioning