Full duplex: How I seem to have tricked Clemens Vasters into mentioning me several times in his blog - but in fact probably just was misunderstood

Sunday, May 16, 2004

Application Architecture English Postings

Well, well, fellow German RD Clemens Vasters obviously found some time to read my blog between his job as a (probably fulltime) "architecture consultant" and writing "more code on a monthly basis than some full-time application developers"; so in his blog entry he honors me with some mentions of my recent "discoveries".

Unfortunately I see some misunderstandings between us. So let me reply on a couple of aspects of his posting:

:::"The 'onion' [...] resembles the notation that Steve Swartz and I introduced": Well, I´d say, I did not claim, I invented "the onion". And neither invented Steve and Clemens it. It is a common design pattern.

Also, not any two things looking the same, actually mean the same. Clemens' picture of an onion in his post is a picture of a service. My intention, though, was not to describe a service. Right to the contrary: So far I haven´t made any explicit statement about how I´d say an "application" should be structured. Rather I left this to SOA. I´m (currently) explicitly not concerned with inter service communication.

:::"What Ralf descibes [...] is in fact a 'data service' as per our definition.": Well, Clemens seems to want to map any circle to a service. But he overlooked that I explicitly said, my (!) resource components (usually) live in the application process. And I mean it. There is no necissity to put data access into a service and deploy it pretty far away from its clients. Sometime it might make sense, sometimes it doesn't.

:::"Delegating resource access to autonomous services instead of 'just' encapsulating it [...]": As already said, sometimes it is good to set up a service for accessing a resource. But I don´t want to belittle the option of "just" encapsulating a resource in an in-proc layered component. If you have a hammer, everything looks like a nail. So if you´re working on service infrastructure, like Clemens does, probably most everything looks like a candidate to be made into a service. But as much as I like simplification and unification, I don´t think we should see everything thru service glasses.

:::"Now... Ralf [is] very concerned about streaming data": No, I´m not very concerned about it, but saw a deficiency in Microsoft´s message to the developers. Microsoft´s (current) view on data access is expressed in it´s implementation of ADO.NET. And this view leaves out streaming data access (to databases). (No, DataReaders are not enough. And pointing to ADO Classic is not enough either.) I just said, I understood the point many developers wanted to make, when they asked, what to do with huge amounts of data.

My intention was to give two clear rules: 1. Never use streaming access between frontend and application (service). 2. Streaming access within (!) an application (service) is ok.

:::"Now, I consider data consolidation [...]": I don´t see, why data consolidation (here: reporting) should be inherent to a data store. As the term says, a data store stores data. Why should it print a report on it? Although traditionally it is like this, it need not to continue to be like this. Consider, for example, "data consolidation" across several data stores. Which data store should be responsible for it?

Sure, a data store like a database can and should provide aggregation functions, cross-tabs and the like. But it should be devoid of a presentation layer like a report engine.

:::"A reporting rendering tool shat get pre-consolidated data": I totally agree. A reporting tool should only be concerned with formatting consolidated data. No calculations necessary any more.

:::"Also, Ralf´s proposed delivery of data to a reporting engine in chunks": I´m not proposing delivery in chunks. My point was just to push data to a reporting engine, rather than let the reporting engine pull it from a resource directly (and even do consolidation work on it). Reporting engines must not circumvent application logic!

:::"[...] doesn´t avoid [...] co-locate all received data": I don´t make any statement as to how and where a reporting engine should store data or how it should handle it. If data is consolidated within the application and pushed to the reporting engine in the right order, the reporting engine probably needs not store much data during report production.
(This is of course may be different, if Clemens means a reporting engine stores all data in an intermediate format first.)

By the way, if a reporting engine is designed as an application itself, it could host application specific resource components and get streaming pull access to them. That´s fine with me.

:::"What Ralf leaves a bit in the fog is really how the reporting engine learns of a new reporting job, [...]": Yes, sure, why should I clear the fog. My intention was not to talk much about the details of reporting.

How should a reporting engine learn about a new job? Maybe by establishing a TCP/IP connection with it and sending some appropriate information? Remember, from the point of view of an application, a reporting engine is a data sink. So streaming push access is allowed.

Where should results be delivered to a reporting engine? Maybe thru a TCP/IP connection? Maybe thru a service call? It´s the responsibility of the reporting engine´s resource data sink component.

How to deal with concurrency? What does Clemens mean? Concurrent data access? Or several reporting tasks at the same time?

:::"Ralf doesn´t mention context and how it is established and also doesn´t loop his solution back to the layering model he found.": Hm... which context does Clemens mean? In his posting he has a tendency to become a bit obscure when things get interesting (e.g. "context", "but I don´t want to get carried away here"). And what does he mean by "loop back his solution"? I embedded my view on reporting/stream data access in my application model. And I did not intend to be more specific with regard to reporting.

:::"The reporting service shall be autonomous": Maybe to Clemens´ surprise this is ok with me. If so, though, it is important, the reporting service cannot circumvent my application logic. It would need to go thru either the application or one of its data source resource components.

:::"What´s needed is simple a message/call exchange pattern [...]": Sounds good. Sure there should be duplex communication between services. But this should not be limited to message exchange. Message exchange is good to span larger distances between components. But not all components need to be set far apart. This is why in app architecture picture I distinguish between applications and resources. Applications and frontends or other applications are removed from one another. But applications (or business logic) and resources can reside in close vicinity.

Again, even though we need to move to more service oriented thinking, not everything is a service.

"Bottom line [...]": I agree with Clemens here: Sometime pull is good, sometimes push, and to have duplex communication is excellent.

To wrap it up: I don´t want to replace Clemens' and Steve´s view, but rather complement it or refine it here and there. As much as I like their approach, I still think it provides too few concrete rules. Plus, it starts at a point, where many developers are not yet. Without first changing the developers´ view of application architecture slightly by bringing application logic into the center of their thinking, they won´t be able nor willing to jump on the service bandwaggon. That´s what I´m concerned with.

First shift application logic into the center of the app architecture view, then explain what happens with frontends and data access. Then wait for this change to settle in. And not before several months (even years?) have passed expect the majority of developers to breath SOA.

We´ve seen the MTS/COM+ desaster: barely anybody knowing or using Microsoft´s Applicaton Server. This must not happen to the next technology. So, rather than wading knee deep in theoretical discussions I prefer to pick up developers where they are right now and provide them with a "migration path" for their thinking.

Well, what is left now is to say: Hey, Clemens let´s drink a beer and push the knives back into our boots, ok? ;-) (Upps, look at this, mine was just a rubber knife anyway :-)

Hi Ralf,

I wonder how your "never expose data streams to the frontend" rule applies to data sources that are inherently streaming and whose contents must surface on the frontend in that form because they might be time critical or that's just for what they are.

Examples: Real-time push news feeds, real-time push market data, audio/video broadcast streams, continuous technical/manufacturing data streams (temperature, pressure, volumes etc. in near-real-time process control environments) are all examples of where you get huge amounts of transient data that might possibly be filtered an preprocessed by some server/application logic, but must continously streamed up to the frontend to display dynamic text, graphs, video, etc.

I consider a database cursor that has a huge result set hanging off of it as just on rather special case of such "streaming" data that usually has the added luxury of not being as time critical as said stream data sources. So it's absolutely reasonable (as you say) to pump them into a consolidator (report engine) and then have the presentation layer pull the "customer-ready" report out.

However, near real-time, time-critical reporting as needed on a trading floor or in a manufacturing control or lab environment might not let you do this kind of indirection and, yet, that's reporting too. I think that "never use streaming between frontend and application" is a bit of a strong statement when we generalize it and look at more than just DB access.

... that's unless I am really misunderstanding something here.

Clemens

PS: I came unarmed. It's just that the "onion" caught my eye in your recent post.

Clemens Vasters - Sunday, May 16, 2004 3:04:00 PM

Again, this data should be pushed to the frontend. I´d recommend setting up a data sink to which the application pushes the data. What the data sink then does with the data, is its business only. Maybe it is connected via TCP/IP with a control located on some GUI frontend.

I just would not see this as being part of the (primary) interface between frontends and application.

Ralf - Sunday, May 16, 2004 3:41:00 PM

2 Comments