This dataset contains translations structured as I am [demonym] in English, with corresponding translations into German (deu), Spanish (spa), French (fra), and Italian (it). The dataset is organized by language in the data/ folder.
The translations were sourced from the following references:
- English https://github.com/mledoze/countries/tree/master
- French https://github.com/mledoze/countries/tree/master
- German https://deutsch.lingolia.com/en/vocabulary/laender-nationalitaeten
- Italian https://www.theintrepidguide.com/nationalities-in-italian/?utm_source=chatgpt.com
- Spanish https://espanol.lingolia.com/en/vocabulary/countries
Each file contains the following columns:
| Column Name | Description |
|---|---|
eng |
The source sentence in English |
<lang>_m |
The masculine form of the translation (if applicable) |
<lang>_f |
The feminine form of the translation (if applicable) |
<lang>_n |
The neuter form of the translation (if applicable) |
| eng | it_m | it_f | it_n |
|---|---|---|---|
| I am Austrian. | Sono austriaco. | Sono austriaca. | |
| I am Belgian. | Sono belga. |