Problems with doc index
Document index approach good for static set of documents
But for SIFT:
- large profile set
- document set not static
- New documents arrive regularly - document set to be matched changes regularly
- Profiles relatively static
Hence build the index on profiles instead