[DDI-users] language tags

Wendy Thomas wlt at pop.umn.edu
Tue Nov 24 10:25:14 EST 2009


I've added this information to the bug contents. Its bug 315 by the way.

Wendy

On Tue, 24 Nov 2009, Joachim Wackerow wrote:

> Alerk,
>
> The intention of ALL language related elements/attributes in DDI3 is to
> use the values allowed as for xml:lang. Maybe this is not clear enough
> in the documentation and in the definition of these elements/attributes
> in the XML Schema. As Wendy said this should be reviewed.
>
> A quite instructive page about language tags in HTML and XML is
> available at W3C. This would also apply to the language values at
> several places in DDI3. The page describes the detailed structure of the
> tags when the ISO code isn't sufficient for the description of a language.
> http://www.w3.org/International/articles/language-tags/
>
> The page mentioned above is basically a description of the content of
> the official RFC 4646.
> http://tools.ietf.org/html/rfc4646
>
> The ISO three-letter codes should be used, when languages should be
> defined which have no two-letter code.
>
> When the ISO three-letter code is not sufficient, the Language Subtag
> Registry at IANA (Internet Assigned Numbers Authority) may help:
> http://www.iana.org/assignments/language-subtag-registry
>
> Search Tool by Richard Ishida:
> http://people.w3.org/rishida/utils/subtags/
>
> Hope this helps
> Achim
>
> Wendy Thomas wrote:
>> Alerk,
>>
>> I did a preliminary scan through the schemas and found multiple means of
>> capturing language at different points. There were related issues raised
>> in the Expert Workshop relating to grouping and language. I am going to
>> file this information under a bug and call for a systematic review of
>> language elements and attributes with the goal of a consistant approach
>> within DDI that will meet the needs you specified below as well as those
>> raised in Expert Workshop.
>>
>> Wendy
>>
>>
>> On Tue, 24 Nov 2009, Alerk Amin wrote:
>>
>>> Hello,
>>>
>>>   In looking at the DDI 3.1 specification, I have a question regarding
>>> language tags.  This issue doesn't cause us problems today, but might in
>>> the future.
>>>   As far as I know, the xml:lang attribute accepts IETF language tags
>>> (BCP 47 standard track).  This consists of a language tag, with
>>> additional optional tags, including region and script.  The script tag
>>> is important for us, because for one project, we have 2 translations
>>> into the same language/region, with 2 different scripts.  As most DDI
>>> elements use the xml:lang attribute to specify the language, this is
>>> fine with us.
>>>   However, the Language element from the ddi:instance:3_1 namespace
>>> does not specify what standard to use for the language.  If I look at
>>> the Language element in the ddi:reusable:3_1 namespace, it specifies
>>> only a pair of codes for language and locale (ISO 3166 for country
>>> codes, and I assume ISO 639-1 for language).  The LanguageOfData element
>>> only specifies "a 2-character ISO language code" (ISO 639-1?).  The
>>> Language element in the ddi:archive:3_1 namespace does not specify which
>>> standard to use, but its type is xs:language, so I assume I can use IETF
>>> language tags, as with xml:lang.
>>>   Is the above interpretation correct?  If not, please let me know
>>> where I am mistaken.
>>>   I think we would prefer to use IETF language tags everywhere.  We
>>> will likely use the TranslationInformation element in the future, and
>>> will will need the ability to specify the script at that time.
>>>   Thank you for your help.
>>>
>>> Best,
>>> Alerk
>>>
>>> --
>>> Alerk Amin
>>> Senior Software Developer
>>> CentERdata
>>> Room K737 (Tilburg University, Koopmans Building)
>>> Postal address   : PO Box 90153, 5000 LE  Tilburg, The Netherlands
>>> Visiting address : Warandelaan 2, 5037 AB  Tilburg, The Netherlands
>>> Telephone        : +31-13-466 2243 / 8325
>>> Fax              : +31-13-466 2764
>>> WWW              : www.centerdata.nl
>>> Disclaimer       : See http://www.centerdata.nl/maildisclaimer.
>>>
>>> _______________________________________________
>>> DDI-users mailing list
>>> DDI-users at icpsr.umich.edu
>>> http://www.icpsr.umich.edu/mailman/listinfo/ddi-users
>>>
>>
>> Wendy L. Thomas                          Phone: +1 612.624.4389
>> Data Access Core Director		 Fax:   +1 612.626.8375
>> Minnesota Population Center              Email: wlt at pop.umn.edu
>> University of Minnesota
>> 50 Willey Hall
>> 225 19th Avenue South
>> Minneapolis, MN 55455
>> _______________________________________________
>> DDI-users mailing list
>> DDI-users at icpsr.umich.edu
>> http://www.icpsr.umich.edu/mailman/listinfo/ddi-users
>
>
> -- 
> GESIS - Leibniz Institute for the Social Sciences
> Postal address: P.O. Box 122155, 68072 Mannheim, Germany
> Visiting address: B2 1, 68159 Mannheim, Germany
> Phone: +49 (0)621 1246 262
> Fax: +49 (0)621 1246 100
> E-mail: joachim.wackerow at gesis.org
> www.gesis.org/en/institute/
> _______________________________________________
> DDI-users mailing list
> DDI-users at icpsr.umich.edu
> http://www.icpsr.umich.edu/mailman/listinfo/ddi-users
>

Wendy L. Thomas                          Phone: +1 612.624.4389
Data Access Core Director		 Fax:   +1 612.626.8375
Minnesota Population Center              Email: wlt at pop.umn.edu
University of Minnesota
50 Willey Hall
225 19th Avenue South
Minneapolis, MN 55455


More information about the DDI-users mailing list