Multi Lang Text
Data fields that can contain text in multiple languages.
Structure
Fields with this type are structured as follows:
[
{
"lang": "IETF BCP 47 language tag",
"value": ""
},
...
]
Each field with this type can have multiple entries, allowing for text in different languages.
- The
langfield specifies the language code (e.g.,enfor English,de-DEfor German) - The
valuefield contains the actual text in that language. The field can contain text and HTML markup.
Language Code
We use the IETF BCP 47 language tag format to specify languages in a structured and standardized way. This format allows for precise identification of linguistic and regional variations.
A typical language tag consists of the following components:
- ISO 639-1 Language Code: Represents the base language, e.g.,
"de"for German. - ISO 3166-1 alpha-2 Country Code (Optional): Specifies the country or region variant of the language, e.g.,
"de-DE"for Standard German as spoken in Germany.
AI-Generated Simple Language Extension
For AI-generated simplified language variants, we introduce an additional private-use extension:
-x-simple-ai: This extension indicates that the content is an AI-generated simplified version of the language.
For example:
de-DE-x-simple-airefers to an AI-generated simplified variant of German.en-x-simple-airefers to an AI-generated simplified variant of English.
This extension ensures that AI-generated simple language content is distinguishable from standard human-authored text while remaining compatible with existing BCP 47 language tag structures.
Example
{
"types": [
"schema:Thing"
],
"content": {
"schema:name": [
{
"lang": "de-DE",
"value": "Beispieltext"
},
{
"lang": "en",
"value": "Example Text"
},
{
"lang": "fr",
"value": "Texte d'exemple"
},
{
"lang": "de-DE-x-simple-ai",
"value": "Einfacher Text auf Deutsch (AI-generiert)"
}
]
}
}