Embed Notice
HTML Code
Corresponding Notice
- Embed this notice
:blank: (i@declin.eu)'s status on Wednesday, 09-Oct-2024 02:11:20 JST:blank: @realcaseyrollins locality sensitive hashing tries to create a number, that you can use in a distance calculation, to see how similar two documents are, even if some words or letters differ
most spam is low effort, so a low fidelity lsh will result in the same number, and you can skip hashes you've seen already
did the same thing on ditto too