Violating the layer separation principle (or some bad ideas in XmlTextReader)

Wednesday, February 25, 2004

.NET XML

Note: this entry has moved.

Most of the APIs in .NET have a layered design, most notably the IO classes. System.Xml namespace builds on this layered design, but falls short, IMO. Basically, you have the following layers in a typical XML parsing activity:

Input: a System.IO.Stream implementation, such as a FileStream, BufferedStream, NetworkStream and so on, or directly a string passed to the next layer.
Basic reader: most probably a System.IO.StreamReader, or a StringReader if the previous layer is skipped.
XmlReader: the actual parser implementation. In .NET, the XML parser is the XmlTextReader class.

Maybe it's just me, but isn't there a layer missing there? The "Lexical Analyzer" or "Scanner"? Well, it turns out that's it's missing to the public, but the XmlTextReader of course uses one, its XmlScanner. Wouldn't it be cool if this layer was exposed explicitly, so that you could tell the parser which scanner to use? Imagine that an imaginary scanner could present as XML tokens some binary stuff comming from the basic reader layer... I know all the discussions about binary XML, I'm just thinking about the clever solution for SVG, SVGZ or "zipped SVG". I don't have to tell you how well the zip algorithm is in general, but with highly redundant data such as XML (i.e. all the repeated tag names) the size reduction is really awesome.

Back to the topic, however, the XmlTextReader violates this separation with its internal XmlScanner class. Namely, the scanner BUFFERS its reads, instead of delegating this responsibility to the appropriate layer, which already implements such funcionality in the BufferedStream class. One consequence of this violation is that the stream position is no longer relevant as you will never know how far the internal scanner has gone. Have you ever dreamed of a "ResetableReader"? You can kiss that dream goodbye for now.

If the scanner didn't violate the separation, we could implement such a reader as follows:

Read until some arbitrary point.
Store current stream position.
Create a new reader to read starting from current position (one that stops reading when it finds elements "outside" its scope), and use it internally instead of advancing the "real" one.
Upon a call to a Reset() method, discard the "inner" reader and reposition the stream.

So, we could confidently hand such a reader to some arbitrary component to do whatever it has to do with the data, without risking our own positioning in the reader. This is typical in XML processing pipelines. You don't want the previous pipeline to mess with the "real" reader and break processing in later ones. Similarly, if you configure components to handle processing of certain elements (for example, with the handler registration mechanism allowed by Xml Streaming Events), you don't want one handler to screw the reader and forbid other handlers from doing their work. You could have the following syntatic sugar also:

ReseteableReader rr; //Initialize somehow // Do some reading // We're about to hand the reader to some other component using (rr.CreateResetPoint()) { Process(rr); } // Now we're exactly where we left before entering the "using"

But as the scanner is buffering (something that should be left to the lower layer, as stated), the only way to get "what's left" in the stream without losing what has already been buffered is to use the XmlTextReader.GetRemainder() method. Guess what, after calling that method, you have effectively screwed your "main" reader. And as the XmlTextReader doesn't support ICloneable either, you can't even store/clone/keep its internal state before screwing it. I heard someone suggesting that one *hack* would be to store the element qname and depth, construct a new reader and read again until its met again. This is clearly an unnacceptable hack: we would be parsing multiple times the same thing, wasting processing time by reading useless nodes, etc.

What's the moral of the story: cleaner separation allows for novel uses not foreseen originally. Violations lend in the best case to ugly hacks, in the worst case (as in the XmlTextReader) to plain impossibility. Let's keep dreaming about the ResetableReader (or thinking about alternative XML parsers for .NET...).

Well, XmlReader is abstract class. We can develop resetable implementation. btw, what about mono impl? AFAIK (there is a bit of my code in mono's System.Xml.Xsl :) they are yacc/lex based. Could it be resetable?

oleg@tkachenko.com (Oleg Tkachenko) - Wednesday, February 25, 2004 4:52:00 PM

Yup, it is abstract. But then you have to go and implement a full-blown custom parser :S

Daniel Cazzulino - Wednesday, February 25, 2004 5:17:00 PM

Kirk,

There are no plans I am aware of to provide a Clone() method on XmlReader. You must have misheard Mark talking about the XPathNavigator.

Dare Obasanjo - Saturday, March 6, 2004 12:28:00 AM

3 Comments