This paper presents a framework proposal aiming at analysing 3D spontaneous referring gestures which occur in multimodal interactions. Our approach is co mposed by two main steps: a structural segmentation of 3D continuous gestural signal and a perception or iented interpretation stage. Both modules which already exist have now to be merged, and the whole process validated with actual trajectories.