locale.html   [plain text]


<html>
<head>
  <h1>
  Notes on the locale implementation.
  </h1>
</head>
<I>
prepared by Benjamin Kosnik (bkoz@redhat.com) on August 8, 2001
</I>

<p>
<h2>
1. Abstract Describes the basic locale object, including nested
classes id, facet, and the reference-counted implementation object,
class _Impl.
</h2>
<p>
</p>

<p>
<h2>
2. What the standard says
See Chapter 22 of the standard.
</h2>


<p>
<h2>
3. Problems with &quot;C&quot; locales : global locales, termination.
</h2>

<p>
The major problem is fitting an object-orientated and non-global locale
design ontop of POSIX and other relevant stanards, which include the
Single Unix (nee X/Open.)

Because POSIX falls down so completely, portibility is an issue.
<p>

<h2>
4. Design
</h2>
Class locale in non-templatized and has three distinct types nested
inside of it:

class facet
22.1.1.1.2 Class locale::facet

Facets actually implement locale functionality. For instance, a facet
called numpunct is the data objects that can be used to query for the
thousands separator is in the German locale.

Literally, a facet is strictly defined:
 - containing 
public:
  static locale::id id;

- or derived from another facet

The only other thing of interest in this class is the memory
management of facets. Each constructor of a facet class takes a
std::size_t __refs argument: if __refs == 0, the facet is deleted when
no longer used. if __refs == 1, the facet is not destroyed, even when
it is no longer reference.


class id
Provides an index for looking up specific facets.

class _Impl

<p>
<h2>
5.  Examples
</h2>

<pre>
  typedef __locale_t locale;
</pre>

More information can be found in the following testcases:
<ul>
<li> testsuite/22_locale/ctype_char_members.cc 
<li> testsuite/22_locale/ctype_wchar_t_members.cc 
</ul>

<p>
<h2>
6.  Unresolved Issues
</h2>

<ul>
	<li> locale -a displays available locales on linux

	<li> locale initialization: at what point does _S_classic,
	_S_global get initialized? Can named locales assume this
	initialization has already taken place?

	<li> document how named locales error check when filling data
	members. Ie, a fr_FR locale that doesn't have
	numpunct::truename(): does it use "true"? Or is it a blank
	string? What's the convention?

	<li> explain how locale aliasing happens. When does "de_DE"
	use "de" information? What is the rule for locales composed of
	just an ISO language code (say, "de") and locales with both an
	ISO language code and ISO country code (say, "de_DE").

	<li> what should non-required facet instantiations do?  If the
	generic implemenation is provided, then how to end-users
	provide specializations?
</ul>

<p>
<h2>
7. Acknowledgments
</h2>

<p>
<h2>
8. Bibliography / Referenced Documents
</h2>

Drepper, Ulrich, GNU libc (glibc) 2.2 manual. In particular, Chapters &quot;6. Character Set Handling&quot; and &quot;7 Locales and Internationalization&quot;

<p>
Drepper, Ulrich, Numerous, late-night email correspondence

<p>
ISO/IEC 14882:1998 Programming languages - C++

<p>
ISO/IEC 9899:1999 Programming languages - C

<p>
Langer, Angelika and Klaus Kreft, Standard C++ IOStreams and Locales, Advanced Programmer's Guide and Reference, Addison Wesley Longman, Inc. 2000

<p>
Stroustrup, Bjarne, Appendix D, The C++ Programming Language, Special Edition, Addison Wesley, Inc. 2000

<p>
System Interface Definitions, Issue 6 (IEEE Std. 1003.1-200x)
The Open Group/The Institute of Electrical and Electronics Engineers, Inc.
http://www.opennc.org/austin/docreg.html