Action can be seen as a sequence of coordinated changes in limp positions. Each limb has its own geometry of possible movements. I simplify the geometry of limb positions in the following way:
Geometry is different for vision and action. For vision it makes sense to think of a 3D Cartesian space, with a viewpoint that depends on the line of sight. Action at the joints is in polar coordinates, with each joint in its own coordinate space which depends on the limb more directly connected to the waist or centre of gravity. Gravity depends neither on the line of sight nor on the current pose.
Skills as stored instructions for an information-processing system: in general I don't care whether skills are hard-wired into the brain and neural system, whether they are software, or whether they are a combination of the two. However, since my background and experience is primarily with software, I shall use that as primary analogy. I shall focus on the system design, looking at architecture and feasibility. The hardware / software question then becomes an implementation issue.
Perception of gravity is 'always on', and does not appear to require any action. It provides a vector from approximately the eyes to the centre of the earth. (For modeling convenience, we shall also assume such a vector from the waist (centre of gravity) to the centre of the earth.