Non-human monsters in massive multiplayer online role-playing games (MMORPGs) are essential for immersive game environments and significantly enhance player engagement. However, producing high-quality monster sounds demands considerable time and financial resources. AI-driven voice conversion offers a potential solution, but existing models rely on standard human voice training and human-centric audio features, limiting their ability to generate realistic non-human voices. This study presents a human-to-non-human (H2NH) voice conversion model designed to address these challenges. The model effectively generated high-quality non-human sounds by recording the voices of participants and converting them in real-time into selected monster vocalizations.