Basic template for translating comments between Greek ↔ English, English ↔ Italian, and English ↔ German. Translations are generated using prompts sent to LLMs hosted on AWS Bedrock.
- Claude 3.5
- Llama 3.3 70b
- Deepseek R1
- We fetch data from the Helsinki-NLP/europarl dataset on HuggingFace Dataset.
- Each comment (a paragraph of conversation) is sent to the LLM for translation.
- For each comment, we create a dictionary with the following keys:
from_language: The source language sent to the modelto_language: The target language to be translated intoai_response: The translated output generated by the model
- This dictionary is then sent for evaluation.
We evaluate the translations using:
- BertScorer
- Meteor_Score
- Rougue