Both longitudinal and cross-sectional speech databases are used in research on the development of the spoken language. However, previous longitudinal speech databases (e.g., Hamasaki database and Miyata database in CHILDES project) were limited in terms of the recording period or the number of utterances. To promote a developmental research, a largescale longitudinal infant speech database has been developed from longitudinal recordings.