Re: no names allowed, we serve types only

From: David BL <davidbl_at_iinet.net.au>
Date: Tue, 23 Feb 2010 09:08:24 -0800 (PST)
Message-ID: <59fe40c1-22b7-47f3-867b-bc06ec3a98e8_at_o16g2000prh.googlegroups.com>


On Feb 23, 5:28 pm, Jan Hidders <hidd..._at_gmail.com> wrote:
> On 23 feb, 01:33, David BL <davi..._at_iinet.net.au> wrote:
> > On Feb 23, 12:49 am, Jan Hidders <hidd..._at_gmail.com> wrote:
>
> > > On 22 feb, 15:39, Jan Hidders <hidd..._at_gmail.com> wrote:
> > > > There is indeed a problem with the subtype = subset principle, or at
> > > > least the one that I had in mind:
>
> > > > - if t1 <= t2 then [[t1]] is a subset of [[t2]]
>
> > > > where [[t]] denotes the set of all values that are of type t. Sorry
> > > > for being unclear about that.
>
> > > > The restriction that f in the definition should be uniquely defined is
> > > > implied by this principle, which IMNSHO makes it not ad-hoc or
> > > > gratuitous but rather well-founded. The proof is left to the reader as
> > > > an exercise. ;-)
>
> > My problem is that there may be existing types t1,t2,t3,t4 and an
> > existing declared relation-type with H2 = {t3,t4}. Someone wants to
> > create a new relation-type with H1 = {t1,t2} that subtypes H2, perhaps
> > where it is required that t1,t3 represent one role and t2,t4 represent
> > another. However it happens that both t1 and t2 subtype both t3 and
> > t4 so therefore it cannot be done because of ambiguity. That seems
> > unreasonable.
>
> Mathematics can sometimes be very unreasonable. :-) But I think this
> makes sense. If you give me a tuple (row/record) of H2 and would ask
> for the t1-value in that tuple, then this is ambiguous, since both the
> t3 and t4 value are also t1 values. It seems reasonable to assume that
> for tuples of type H1 we require that they contain a single value of
> type t1 and a single value of type t2. But the tuple of type H2 would
> actually in some sense contain two values of type t1 (and two values
> of type t2). So semantically speaking it is indeed not really a
> subtype.
>
> You could try to fix this by taking another semantics of the types,
> but I don't see a reasonable alternative at the moment. In fact, my
> current intuition is that this is a symptom of the fact that we are
> trying to shoehorn the atrribute-type relationship into the subtype
> relationship, where it doesn't really fit comfortably.

I agree with all that, and add that it is noteworthy that the ambiguity problem disappears with named attributes.

Anyway, I don't believe our analysis is complete, because our definitions don't specify what happens exactly with Keith's "copy" types (which must be used where there is a subtype relationship between two of the attributes in a header).

> > > PS. Note that the corrected definition can be compactly formulated:
>
> > > Definition: H1 <= H2 if
> > > for each type t2 in H2
> > > there is exactly one type t1 in H1
> > > such that t1 <= t2.
>
> > That doesn't seem right to me - by that definition H1 might have more
> > attributes than H2 and yet be considered a subtype.
>
> Of course. For the usual record types it holds that <a:t1, b:t2> is a
> supertype of <a:t1, b:t2, c:t3>. You can use values of the latter
> where values of the first are expected, not the other way around.

That view of subtyping concerns a possible interpretation of substitutability (of values), but I believe it is better to adhere to the subtype = subset principle for data types. It cannot be said that tuples of the form <a:t1, b:t2, c:t3> represent a subset of tuples of the form <a:t1, b:t2>.

C.Date presents this argument very well in section 20.9 of an Introduction to Database Systems where he claims that a coloured circle is not a subtype of circle (or vice versa).

I suggest implicit conversions must only be an artefact of the type system (i.e. implicit conversions cannot actually change a value by adding or removing information), and in fact no concept of implicit conversions is required in a typeless formalism. Received on Tue Feb 23 2010 - 18:08:24 CET

Original text of this message