| Oracle FAQ | Your Portal to the Oracle Knowledge Grid | |
Home -> Community -> Usenet -> comp.databases.theory -> Re: Searching Google n-gram corpus
Shield wrote:
> compressing the data would take allot of time. time taken away from
> the actual experiment.
>
> what would be the fastest way to using the dataset, using the same
> conditions of searching for occurances?
>
>
>
The payoff on the compression & reindexing is less than 100 straight
searches on the original corpus. If your experiment is that small (or
even 1000 searches) just do the in the brute force way.
Received on Sun Sep 16 2007 - 00:45:10 CDT
![]() |
![]() |