Parallel Corpora Datasets for Machine TranslationAbout Gradiet Line

Enhance your language AI model's capabilities with our Parallel Corpora datasets. These datasets provide aligned text pairs in multiple languages, making them ideal for training machine translation models, multilingual language models, and cross-lingual embeddings.

Perfect for improving text alignment, generating accurate translations, and developing robust multilingual AI solutions. Download now to elevate your model’s performance in understanding and generating text across different languages.

Filter IconFilter Close

Filter

(38)

Clear

Apply

Arabic Parallel Datasets

Explore ready-to-deploy Text datasets in Arabic language.

15+ Datasets

Bahasa Parallel Datasets

Explore ready-to-deploy Text datasets in Bahasa language.

15+ Datasets

Bengali Parallel Datasets

Explore ready-to-deploy Text datasets in Bengali language.

15+ Datasets

Bulgarian Parallel Datasets

Explore ready-to-deploy Text datasets in Bulgarian language.

15+ Datasets

Chinese Parallel Datasets

Explore ready-to-deploy Text datasets in Chinese language.

15+ Datasets

Czech Parallel Datasets

Explore ready-to-deploy Text datasets in Czech language.

15+ Datasets

Danish Parallel Datasets

Explore ready-to-deploy Text datasets in Danish language.

15+ Datasets

Dutch Parallel Datasets

Explore ready-to-deploy Text datasets in Dutch language.

15+ Datasets

English Parallel Datasets

Explore ready-to-deploy Text datasets in English language.

15+ Datasets

Finnish Parallel Datasets

Explore ready-to-deploy Text datasets in Finnish language.

15+ Datasets

French Parallel Datasets

Explore ready-to-deploy Text datasets in French language.

15+ Datasets

German Parallel Datasets

Explore ready-to-deploy Text datasets in German language.

15+ Datasets

Gujarati Parallel Datasets

Explore ready-to-deploy Text datasets in Gujarati language.

15+ Datasets

Hindi Parallel Datasets

Explore ready-to-deploy Text datasets in Hindi language.

15+ Datasets

Italian Parallel Datasets

Explore ready-to-deploy Text datasets in Italian language.

15+ Datasets

Japanese Parallel Datasets

Explore ready-to-deploy Text datasets in Japanese language.

15+ Datasets

Kannada Parallel Datasets

Explore ready-to-deploy Text datasets in Kannada language.

15+ Datasets

Korean Parallel Datasets

Explore ready-to-deploy Text datasets in Korean language.

15+ Datasets

Malay Parallel Datasets

Explore ready-to-deploy Text datasets in Malay language.

15+ Datasets

Malayalam Parallel Datasets

Explore ready-to-deploy Text datasets in Malayalam language.

15+ Datasets

Marathi Parallel Datasets

Explore ready-to-deploy Text datasets in Marathi language.

15+ Datasets

Norwegian Parallel Datasets

Explore ready-to-deploy Text datasets in Norwegian language.

15+ Datasets

Odia Parallel Datasets

Explore ready-to-deploy Text datasets in Odia language.

15+ Datasets

Polish Parallel Datasets

Explore ready-to-deploy Text datasets in Polish language.

15+ Datasets

Portuguese Parallel Datasets

Explore ready-to-deploy Text datasets in Portuguese language.

15+ Datasets

Punjabi Parallel Datasets

Explore ready-to-deploy Text datasets in Punjabi language.

15+ Datasets

Romanian Parallel Datasets

Explore ready-to-deploy Text datasets in Romanian language.

15+ Datasets

Russian Parallel Datasets

Explore ready-to-deploy Text datasets in Russian language.

15+ Datasets

Spanish Parallel Datasets

Explore ready-to-deploy Text datasets in Spanish language.

15+ Datasets

Swedish Parallel Datasets

Explore ready-to-deploy Text datasets in Swedish language.

15+ Datasets

Filipino Parallel Datasets

Explore ready-to-deploy Text datasets in Filipino language.

15+ Datasets

Tamil Parallel Datasets

Explore ready-to-deploy Text datasets in Tamil language.

15+ Datasets

Telugu Parallel Datasets

Explore ready-to-deploy Text datasets in Telugu language.

15+ Datasets

Thai Parallel Datasets

Explore ready-to-deploy Text datasets in Thai language.

15+ Datasets

Turkish Parallel Datasets

Explore ready-to-deploy Text datasets in Turkish language.

15+ Datasets

Ukrainian Parallel Datasets

Explore ready-to-deploy Text datasets in Ukrainian language.

15+ Datasets

Urdu Parallel Datasets

Explore ready-to-deploy Text datasets in Urdu language.

15+ Datasets

Vietnamese Parallel Datasets

Explore ready-to-deploy Text datasets in Vietnamese language.

15+ Datasets

Train & Fine-tune Neural Machine Translation models with Multi-lingual Parallel Corpus!

Collect custom dataset with crowd community