Re: XML databases

From: JOG <jog_at_cs.nott.ac.uk>
Date: 10 May 2006 10:03:01 -0700
Message-ID: <1147280581.037755.85940_at_j73g2000cwa.googlegroups.com>


dawn wrote:
> JOG wrote:
> > This is essentially a request for help - I am (relatively urgently)
> > searching out references that discuss XML databases with the view that
> > they may in some cases be deleterious for information storage.
> >
> > I am looking for research work that indicates that, while XML can
> > obviously handle irregular data, the artificial hierarchical
> > constraints it introduces can produce undesirable query bias and hides
> > many of the information's internal relationships.
>
> I would think most of the same arguments against MV would hold for XML
> data stores. If one makes use of the nested lists within XML (which
> would be typical of an XML schema), then you have "arbitrarily" (or,
> perhaps, with good sound analysis) decided to model a hat size as a
> property of a person rather than seeing all data as equal ;-)

Yes, a good point, it is an old debate. However the powers that be in the opaque european beauracracies that I'm going to be dealing with are going to need something requiring minimal extrapolation on there part - i.e. references will unfortunately have to have XML emblazoned across them. Perhaps in flashing neon.

>
> > I'm not intending to start a debate on XML - that's being had in many
> > other threads - just looking for formal citeable references.
>
> Everything that Codd or Date wrote prior to the 90's that includes any
> discussion of normalization and anything in the industry in support of
> what-was-once-termed-1NF would do, along with anything in support of a
> 3VL. It is wrong IMO, of course, but it would back up a position of
> query bias, for example.

Well I think it is a big step forward for those on either side of the MV arguments to realise that they are actually debating a premise - the merits or lack thereof of query bias (as highlighted by Jon Heggland). There is far too much fruitless argument on the consequences of this decision as opposed to the merits of it in the first place imo.

In the domain that I require these references for (neurogenesis research), a lack of query bias is essential. The area of knowledge is blurry to say the least and its future use is undetermined and so it must be store impartially.

>
> > Anyhow
> > anything of that nature, indicating potential deficiences (or benfits
> > in fact) in the XML 'data model'.
>
> If you would really like benefits, you have 2VL, NF2, and all
> (non-binary, non-mime/dime) types being subtypes of strings (which gets
> you some interesting benefits with similarities between handling and
> these documents, e.g. related rules, as handling user input, for
> example). The XML vs RM argument is very similar to the MV vs RM
> argument, with additions of a) more nested levels in XML, so it is more
> like Cache' MUMPS than Cache' PICK b) a distinction between attributes
> and entities in XML (that I don't find particularly helpful) and c) XML
> schema that may be used for enforcement or not, as desired.

Yes, if you have any formal references they are all worthy of inclusion - but again, they have to have XML databases swathed across them. I was hoping Jan might have some pointers here as it is his area of expertise (most works i've found seem to take the merits of XML databases as fait accompli).

>
> My reason for bringing up the MV data model at all in this forum is
> related to the future, not the past. The fact that there are highly
> successful systems employed by highly successful companies that use
> something similar to an XML data model already, is evidence that at
> least it can be successful. I will grant that there are pros and cons
> to the use of a di-graph model with trees on the nodes, but any blanket
> dismissal of persisted XML due to the data model is likely based more
> on data model religion than experience.

RDF has similar issues.

>
> > Anything you can share is much appreciated, Jim.
>
> I did not intend for this to be an MV discussion and do not want to
> make it that, but in case you were not aware of the similarlies, I
> thought I'd pipe up before I head out. Cheers! --dawn

Thanks for your input. J. Received on Wed May 10 2006 - 19:03:01 CEST

Original text of this message