And the survey says . . .
Duplication is good at high skews, but bad at low skews
At low skews (uniform access frequency):
- Duplicate elimination has good global memroy utilization.
At high skews (zipf over 1)
- Client-Server keeps hot pages at all nodes in memory
- duplication is good
Hybrid nears the performance of the better choice in both scenarios.
Varying database size, using diverse file sizes, and more nodes gave similar results.