KazParC: A Comprehensive Parallel Corpus for Multilingual Machine Translation
KazParC is a large-scale parallel corpus designed to facilitate machine translation across Kazakh, English, Russian, and Turkish languages. The corpus was developed with the assistance of human translators and contains over 371,000 parallel sentences spanning diverse domains. The research also introduces Tilmash, a neural machine translation model that demonstrates competitive performance compared to industry-leading services.