Skip to main content

Multi Lang Text

Data fields that can contain text in multiple languages.

Structure

Fields with this type are structured as follows:

[
{
"lang": "IETF BCP 47 language tag",
"value": ""
},
...
]

Each field with this type can have multiple entries, allowing for text in different languages.

  • The lang field specifies the language code (e.g., en for English, de-DE for German)
  • The value field contains the actual text in that language. The field can contain text and HTML markup.

Language Code

We use the IETF BCP 47 language tag format to specify languages in a structured and standardized way. This format allows for precise identification of linguistic and regional variations.

A typical language tag consists of the following components:

  • ISO 639-1 Language Code: Represents the base language, e.g., "de" for German.
  • ISO 3166-1 alpha-2 Country Code (Optional): Specifies the country or region variant of the language, e.g., "de-DE" for Standard German as spoken in Germany.

AI-Generated Simple Language Extension

For AI-generated simplified language variants, we introduce an additional private-use extension:
-x-simple-ai: This extension indicates that the content is an AI-generated simplified version of the language.

For example:

  • de-DE-x-simple-ai refers to an AI-generated simplified variant of German.
  • en-x-simple-ai refers to an AI-generated simplified variant of English.

This extension ensures that AI-generated simple language content is distinguishable from standard human-authored text while remaining compatible with existing BCP 47 language tag structures.

Example

{
"types": [
"schema:Thing"
],
"content": {
"schema:name": [
{
"lang": "de-DE",
"value": "Beispieltext"
},
{
"lang": "en",
"value": "Example Text"
},
{
"lang": "fr",
"value": "Texte d'exemple"
},
{
"lang": "de-DE-x-simple-ai",
"value": "Einfacher Text auf Deutsch (AI-generiert)"
}
]
}
}