Visual and Spatial Understanding of Human-Object Interactions