Duplicate removal
Assume exixtence of copy detection blackbox
duplicate restraint per profile per document
- < profileid P, docid E, definition L, expiration T >
- user definition of duplicates using duplication threshold (degree of overlap)
- different for different documents. eg:
- high: 100% for bug fixes
- low: 60% for “call for participation” notices
- restraint is valid in a certain time window based on document type