We present a theory for the detection and identification of pitch and voicing based on a comprehensive physiological model of auditory signal processing. Our approach is based on building detectors of spatially and temporally local patterns of response phase from a number of parallel channels. Using this approach, we have designed computationally simple, physiologically reasonable algorithms for pitch and voicing that are robust in noise and selective for speech.