Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make image similarity search significantly faster #72

Merged
merged 1 commit into from
Oct 15, 2020
Merged

Make image similarity search significantly faster #72

merged 1 commit into from
Oct 15, 2020

Conversation

Syfaro
Copy link
Contributor

@Syfaro Syfaro commented Oct 15, 2020

This PR makes the image similarity search significantly faster.

It does this by comparing the hamming distance of 8-byte arrays instead of the Levenshtein distance of base64 encoded data.

I ran master and this revision (both in release mode) against 10,312 files in 9,619 folders. The current master took around 410 seconds to complete, this version takes around 70 seconds.

@qarmin qarmin merged commit a55fe3d into qarmin:master Oct 15, 2020
@qarmin
Copy link
Owner

qarmin commented Oct 15, 2020

Thanks!
With my test collection time dropped from 121 seconds to 55 seconds.

@qarmin qarmin added the enhancement New feature or request label Oct 15, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants