Re: Word-Level Inverted File Structure
Date: Wed, 11 Oct 2000 17:27:26 +0800
Message-ID: <39e4319a$0$11625$7f31c96c_at_news01.syd.optusnet.com.au>
"Jan Hidders" <hidders_at_REMOVE.THIS.win.tue.nl> wrote in message
news:8s19pp$knf$1_at_news.tue.nl...
> Pete Nayler wrote:
> > The structure I'm referring to is explained in Witten et al "Managing
> > Gigabytes", where each word in an inverted file is referenced using:
> >
> > <2;(1;6,9),(4;8)>
> >
> > where the (bracketed) terms can be expressed as
> >
> > (x ; y1, y2, y3, ...)
> >
> > where x represents the document in which the word exists, and y
represents
> > the word position in the document.
> >
> > The question is, what does the first term in the full structure
represent?
>
> I'm totally guessing here, but could it be the word for which the
positions
> are indicated?
Thanks for the reply, but in the book, it gives an example of indexing using a series of documents, giving the word listing as follows:
cold - <2;(1;6),(4;8)>
hot - <2;(3;2),(6;2)>
warm - <2;(1;3),(4;4)>
etc...
As you can see, the first term is always "2", which preceeds the document and then the position. Puzzling...
Pete Received on Wed Oct 11 2000 - 11:27:26 CEST