immich's uploaded files are incorrectly identified as "already existing" due to hash duplication detection
Is my usage wrong? I upload a large number of pictures in subfolders through immich-go and upload pictures through Android App. The number of pictures on the server is always less than I expected, and by searching the file name, I can be very sure that the pictures do not exist on the server.
I want to know what algorithm immich uses to determine that the photos already exist on the server?
#immich's uploaded files are incorrectly identified as "already existing" due to hash duplication det
1 messages · Page 1 of 1 (latest)
:wave: Hey @novel sentinel,
Thanks for reaching out to us. Please follow the recommended actions below; this will help us be more effective in our support effort and leave more time for building Immich
.
References
- Container Logs:
docker compose logsdocs - Container Status:
docker compose psdocs - Reverse Proxy: https://immich.app/docs/administration/reverse-proxy
Checklist
- :blue_square: I have verified I'm on the latest release(note that mobile app releases may take some time).
- :blue_square: I have read applicable release notes.
- :blue_square: I have reviewed the FAQs for known issues.
- :blue_square: I have reviewed Github for known issues.
- :blue_square: I have tried accessing Immich via local ip (without a custom reverse proxy).
- :blue_square: I have uploaded the relevant logs, docker compose, and .env files using the buttons below or the
/uploadcommand. - :blue_square: I have tried an incognito window, disabled extensions, cleared mobile app cache, logged out and back in, different browsers, etc. as applicable
(an item can be marked as "complete" by reacting with the appropriate number)
If this ticket can be closed you can use the /close command, and re-open it later if needed.
The hash detection prevents uploading files with identical contents
I think it blocks a lot of unique images, which causes me to be missing a lot of images. Is this possible?
In other words, many photos with duplicate content need to be identified by the image content to determine their similarity. What does this quick "duplicate detection" during upload rely on? Is it reliable?
Similarity isn't relevant here, it's literally whether the files are the exact same
I see, it's just a hash check to make sure duplicate files aren't uploaded multiple times.