Skip to content

Latest commit

 

History

History
5 lines (4 loc) · 312 Bytes

File metadata and controls

5 lines (4 loc) · 312 Bytes

MinHash

MinHash (also known as min-wise independent permutations locality sensitivity hashing scheme), used to efficiently estimate set similarity. It was invented by Andrei Border in 1997.

Application include detecting duplicate web pages, used by AltaVista search engine. For finding similar webpages.