In this paper, we propose a computational auditory scene analysis (CASA)based frontend for twomicrophone speech recognition in a car environment. One of the important issues associated with CASA is the accurate estimation of mask information for target speech separation within multiple microphone noisy speech. For such a task, the timefrequency mask information is compensated through the signaltonoise ratio resulted from a beamformer to adjust the noise quantity included in noisy speech. We evaluate the performance of an automatic speech recognition system employing a CASAbased frontend with the proposed mask compensation method. Then, we compare its performance with those employing a CASAbased frontend without mask compensation and the beamformingbased frontend. As a result, the CASAbased frontend with the proposed method achieves relative WER reductions of 26.52% and 8.57%, compared that the beamformer and a CASAbased frontend alone, respectively.