Language codes - Dialects and interop

lnorth · 2 February 2026 10:03

Good morning,
Having some questions from trusts around the language codes used for language in PDS.

The communication language is indicated in ISO-639-1
Some languages do not have the ISO 639-1 codes because the standard was initially designed to represent major and primary national languages with well-established terminologies and lexicography.
The big example here is Cantonese/Mandarin is not listed, instead we have Chinese macro language only despite these being very different spoken dialects (as i understand). If the primary purpose is to inform translator choice is this correct?
Other examples we’ve had people ask about include flemish and Dari.

This then hits further issues when we start looking at ECDS reporting which uses snomed codes which differentiate between these dialects.

And then further issues again when we start using the reasonable adjustments flags API which uses translator required snomed codes (the flags api does indicate we should start ignoring PDS language when its in use but we then have an issue where all our existing data is in a different standard).

Is there any over arching guidance on how language and translator needs should be record as part of a patient record, how the different requirements work together that we could reference to add clarity here?

Thanks,

Liam

Matthew_Beswick · 10 February 2026 11:27

ISO 639-1 intentionally collapses “Chinese” into a single macrolanguage code zh and does not provide separate codes for Cantonese or Mandarin as you’ve highlighted. That’s by design for high‑level cataloging.

What to do:

For interpreter choice, don’t rely on PDS’s ISO 639-1. Record the specific spoken variety with SNOMED CT (e.g., Cantonese vs Mandarin) and prefer RA flags where present.
If you need language tags outside SNOMED, use ISO 639-3/BCP 47: yue for Cantonese, cmn for Mandarin. When a system only accepts 639-1, downcast both to zh.
Similar pattern for Dari: use SNOMED; BCP 47 prs (or fa-AF); PDS fallback fa (Persian).

Hope that helps.

lnorth · 10 February 2026 15:03

Thank you for your response @Matthew_Beswick
I would say that from the documentation/SCAL process the intent of language data and how it should be used/consumed here isn’t mega clear, particularly as it comes next to an interpreter required flag.
Its not clear to me what the use case/intent of having this high-level cataloging of languages within the PDS record is? Presumably we should be assuming the language here is more relevant for written comms or is it more for census like population reporting?

Thanks,

Liam

Matthew_Beswick · 10 February 2026 16:31

Roger that, i will feed this back with our onboarding team.

Separately you are going to see us as demographics publish a call for participation for a user research activity focused on onboarding. Please consider volunteering/feeding back when we do publish that (likely to include a post here on this forum)

The purpose of language is to support care. (not census like population reporting)

Topic		Replies	Views
Physical Activity (Apple/Google) to IM1 - SNOMED Requests API Platform fhir	2	63	3 February 2025
EMIS Partner API - FileRecord call - creating SNOMED codes API Platform emis	4	88	22 September 2025
PDS or ODS INT environments - How do we raise a Request for Change? Personal Demographics Service FHIR API pds , fhir , spine	9	53	4 March 2026
Replacing SNOMED codes in XML search API Platform emis	1	104	23 April 2025
Reasonable Adjustments - SNOMED Reference Sets Reasonable Adjustment Flag	3	275	6 December 2023

Language codes - Dialects and interop

Related topics