Configuration with XmlSerializer + codegen (i.e. xsd.exe): how "fix" PascalCase to camelCase for the XML

Monday, May 10, 2004

.NET XML

Note: this entry has moved.

Most people (including myself, as you may have noticed from my previous post on XmlSerializer-based configuration) believe an automated XSD->Classes->XmlSerializer config section handler is the optimum approach for dealing with configuration in .NET. You basically start definining your configuration in an XML Schema, then use xsd.exe or another even more cool approach, and finally get XmlSerializer-ready classes for it. Loading configuration afterwards is just a matter of deserializing it. Cool right?

Well, no matter how hard I try, the following sample config looks just awful to me (and believe me, I've created many like this one):

<SampleConfig>
  <MySetting Type="TheType, TheAssembly">
    <Enabled>True</Enabled>
    <MakeItFast>True</MakeItFast>
  </MySetting>
</SampleConfig>

And you know XAML goes the same route, using .NET naming convensions for elements and attributes...
Well, not only this looks pretty ugly (IMO) to any XML geek, but it also sort of unveils you're using XmlSerializer for loading it, and for some guys, this may seem like a less professional approach to configuration (don't ask me why). Learning yet another codegen tool just to overcome this "minor" casing issue sounded overkill for me. On the other hand, I didn't want to write all my configuration loading by hand anymore, specially once I got used to the incredibly fast refactoring that is enabled by the combination of XSD->CodeGen->XmlSerialier: all refactorings in the schema reflect on the classes, and you get inmediate compile-time errors in the code depending on old element names, or now inexistent or relocated elements. This is crucial for evolution.

Another aspect the XmlSerializer-based approach enables is a full programmatic typed configuration API you can use to write changes back to the file. And you don't have to write almost any code! Summarizing, the benefits are: straightforward loading and storing back. MAIN drawback: you have to use the XML-ugly .NET casing. Don't you?

Well, I decided I wanted the best of both worlds, as usual. I wanted camelCase-style elements and attributes, but XmlSerializer-based configuration loading and saving. And I definitely didn't want to write/use yet another codegen extension/tool to do the dirty job of adding all those XML serialization attributes to account for this "little" issue of casing.

So the answer is pretty simple once you know it: write a custom reader that turns the first letter into uppercase as it reads, and a custom writer that turns it back to lower upon saving it back. What's more, the reader should be based on the XmlTextReader so I can still layer the XmlValidatingReader once the "case normalization" happened, so the on-the-fly-PascalCased document is valid against the schema.

With the two new classes, XmlFirstUpperReader and XmlFirstLowerWriter, you read and write back XML files with the camelCase style, while having the XSD schema and the generated classes believe you have the proper PascalCase source/destination. So you can turn the previous config file into the nicer format:

<sampleConfig>
  <mySetting type="TheType, TheAssembly">
    <enabled>True</enabled>
    <makeItFast>True</makeItFast>
  </mySetting>
</sampleConfig>

Without changing your classes or schema. The trick is done when you pass the source for deserialization:

XmlFirstUpperReader fr = new XmlFirstUpperReader(config); // You should always validate your config at least with XSD XmlValidatingReader vr = new XmlValidatingReader(fr); // Add the PascalCased XSD. vr.Schemas.Add(theSchema); XmlSerializer ser = new XmlSerializer(typeof(MySetting)); MySetting settings = (MySetting) ser.Deserialize(vr);

After modifying the settings class, you can save it back into the file with the proper camelCase by using the custom writer:

MySettings settings = (MySetting) ser.Deserialize(vr); // Modify the settings at will. XmlFirstLowerWriter fw = new XmlFirstLowerWriter( new XmlTextWriter(config)); ser.Serialize(fw, settings);

Pretty straightforward, right?

Implementation

This time, I didn't "cheat" the XmlTextReader, but directly used it by inheriting it like any decent OO citizen, and overriding the members I needed to change. It turned out to be a little more work than I thought because I had to use the base reader XmlNameTable everywhere, or the XSD validation would stop working and values would be lost somehow. The XmlFirstUpperReaderhas a private method that turns a name into uppercase, and adds it to the NameTable before returning. This method uses the optimal "algorithm" discussed in my previous post:

public class XmlFirstUpperReader : XmlTextReader { #region Support all XmlTextReader ctors ...etc... #endregion Support all XmlTextReader ctors #region Private methods private string MakeFirstUpper(string name) { // Don't process empty strings. if (name.Length == 0) return name; // If the first is already upper, don't process. if (Char.IsUpper(name[0])) return name; // If there's just one char, make it lower directly. if (name.Length == 1) return name.ToUpper(); // Finally, modify and create a string. Char[] letters = name.ToCharArray(); letters[0] = Char.ToUpper(letters[0]); return NameTable.Add(new string(letters)); } ...

The XmlFirstLowerWriter implements the opposite operation:

public class XmlFirstLowerWriter : XmlWriter { internal static string MakeFirstLower(string name) { // Don't process empty strings. if (name.Length == 0) return name; // If the first is already lower, don't process. if (Char.IsLower(name[0])) return name; // If there's just one char, make it lower directly. if (name.Length == 1) return name.ToLower(); // Finally, modify and create a string. Char[] letters = name.ToCharArray(); letters[0] = Char.ToLower(letters[0]); return new string(letters); } ...

This last method is made internal and static because it doesn't depend on an XmlNameTable instance, and because it must be used by the reader in some places, as explained next. Note that both only modify the string if it's actually necessary.

The XmlTextReader properties that need to be overriden are:

this (indexer): this property is used to access attributes of the current element by name. Client code as well as the validating reader (based on the schema) will expect to access attributes by their PascalCase name. So, I need to ensure the name is camelCased before the access occurs:
LocalName: this property needs to be properly capitalized only for elements and attributes (you don't want to get the <? xml ?> directive changed!), but only if they are not namespace declarations:
Name: this property returns the qualified name of the element. That is, it can contain a prefix plus the name. The former needs to remain the same, while the later is capitalized. In addition, for namespace declaration, it returns the prefix, which we don't want to capitalize.:

On the methods side, there's only one to override:

MoveToAttribute(): movement must be done using the lower cased name, as the underlying XML uses this "version":

The XmlFirstLowerWriter implementation, in turn, is based on the XmlTextWriter one, overrides just WriteQualifiedName, WriteStartAttribute and WriteStartElement. All three methods a localName parameter which is turned into camelCase by a call to the MakeFirstLower method shown above.

The full Mvp.Xml project source code can be downloaded from SourceForge.

Enjoy and please give us feedback on the project!

+ As usual, the full code to copy-paste if you want to (remember to split in two files). #region using using System; using System.IO; using System.Xml; #endregion using namespace Mvp.Xml { /// <summary> /// Implements an <see cref="XmlTextReader"/> that turns the /// first letter of incoming elements and attributes into uppercase. /// </summary> /// <remarks> /// To be used in conjunction with <see cref="XmlFirstLowerWriter"/> for /// serialization. /// </remarks> public class XmlFirstUpperReader : XmlTextReader { #region Ctors /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(Stream input) : base(input) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(TextReader input) : base(input) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url) : base(url) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(Stream input, XmlNameTable nt) : base(input, nt) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(TextReader input, XmlNameTable nt) : base(input, nt) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url, Stream input) : base(url, input) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url, TextReader input) : base(url, input) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url, XmlNameTable nt) : base(url, nt) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(Stream xmlFragment, XmlNodeType fragType, XmlParserContext context) : base(xmlFragment, fragType, context) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url, Stream input, XmlNameTable nt) : base(url, input, nt) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string url, TextReader input, XmlNameTable nt) : base(url, input, nt) {} /// <summary> /// See <see cref="XmlTextReader"/> constructor overloads. /// </summary> public XmlFirstUpperReader(string xmlFragment, XmlNodeType fragType, XmlParserContext context) : base(xmlFragment, fragType, context) {} #endregion Ctors #region Private methods private string MakeFirstUpper(string name) { // Don't process empty strings. if (name.Length == 0) return name; // If the first is already upper, don't process. if (Char.IsUpper(name[0])) return name; // If there's just one char, make it lower directly. if (name.Length == 1) return name.ToUpper(); // Finally, modify and create a string. Char[] letters = name.ToCharArray(); letters[0] = Char.ToUpper(letters[0]); return NameTable.Add(new string(letters)); } #endregion Private methods #region Properties /// <summary>See <see cref="XmlReader.this"/></summary> public override string this[string name, string namespaceURI] { get { return base[ NameTable.Add(XmlFirstLowerWriter.MakeFirstLower(name)), namespaceURI]; } } /// <summary>See <see cref="XmlReader.this"/></summary> public override string this[string name] { get { return this[name, String.Empty]; } } /// <summary>See <see cref="XmlReader.LocalName"/></summary> public override string LocalName { get { // Capitalize elements and attributes. if ( base.NodeType == XmlNodeType.Element || base.NodeType == XmlNodeType.EndElement || base.NodeType == XmlNodeType.Attribute ) { return base.NamespaceURI == XmlNamespaces.XmlNs ? // Except if the attribute is a namespace declaration base.LocalName : MakeFirstUpper(base.LocalName); } return base.LocalName; } } /// <summary>See <see cref="XmlReader.Name"/></summary> public override string Name { get { // Again, if this is a NS declaration, pass as-is. if (base.NamespaceURI == XmlNamespaces.XmlNs) return base.Name; // If there's no prefix, capitalize it directly. if (base.Name.IndexOf(":") == -1) return MakeFirstUpper(base.Name); else { // Turn local name into upper, not the prefix. string name = base.Name.Substring(0, base.Name.IndexOf(":") + 1); name += MakeFirstUpper(base.Name.Substring(base.Name.IndexOf(":") + 1)); return NameTable.Add(name); } } } #endregion Properties #region Methods /// <summary>See <see cref="XmlReader.MoveToAttribute"/></summary> public override bool MoveToAttribute(string name, string ns) { return base.MoveToAttribute( NameTable.Add(XmlFirstLowerWriter.MakeFirstLower(name)), ns); } #endregion Methods } } /* NEXT FILE!!! */ #region using using System; using System.IO; using System.Text; using System.Xml; #endregion using namespace Mvp.Xml { /// <summary> /// Implements an <see cref="XmlWriter"/> that turns the /// first letter of outgoing elements and attributes into lowercase. /// </summary> /// <remarks> /// To be used in conjunction with <see cref="XmlFirstUpperReader"/>. /// </remarks> public class XmlFirstLowerWriter : XmlTextWriter { #region Fields & Ctor /// <summary> /// See <see cref="XmlTextWriter"/> ctors. /// </summary> public XmlFirstLowerWriter(TextWriter w) : base(w) { } /// <summary> /// See <see cref="XmlTextWriter"/> ctors. /// </summary> public XmlFirstLowerWriter(Stream w, Encoding encoding) : base(w, encoding) { } /// <summary> /// See <see cref="XmlTextWriter"/> ctors. /// </summary> public XmlFirstLowerWriter(string filename, Encoding encoding) : base(filename, encoding) { } #endregion Fields & Ctor #region MakeFirstLower internal static string MakeFirstLower(string name) { // Don't process empty strings. if (name.Length == 0) return name; // If the first is already lower, don't process. if (Char.IsLower(name[0])) return name; // If there's just one char, make it lower directly. if (name.Length == 1) return name.ToLower(); // Finally, modify and create a string. Char[] letters = name.ToCharArray(); letters[0] = Char.ToLower(letters[0]); return new string(letters); } #endregion MakeFirstUpper #region Methods /// <summary> /// See <see cref="XmlWriter.WriteQualifiedName"/>. /// </summary> public override void WriteQualifiedName(string localName, string ns) { base.WriteQualifiedName(MakeFirstLower(localName), ns); } /// <summary> /// See <see cref="XmlWriter.WriteStartAttribute"/>. /// </summary> public override void WriteStartAttribute(string prefix, string localName, string ns) { base.WriteStartAttribute(prefix, MakeFirstLower(localName), ns); } /// <summary> /// See <see cref="XmlWriter.WriteStartElement"/>. /// </summary> public override void WriteStartElement(string prefix, string localName, string ns) { base.WriteStartElement(prefix, MakeFirstLower(localName), ns); } #endregion Methods } }

Implementation

No Comments