immich's uploaded files are incorrectly identified as "already existing" due to hash duplication det | Immich | Page 1

novel sentinel Jun 13, 2024, 7:23 AM

#

immich's uploaded files are incorrectly identified as "already existing" due to hash duplication detection
Is my usage wrong? I upload a large number of pictures in subfolders through immich-go and upload pictures through Android App. The number of pictures on the server is always less than I expected, and by searching the file name, I can be very sure that the pictures do not exist on the server.
I want to know what algorithm immich uses to determine that the photos already exist on the server?

zinc kelpBOT Jun 13, 2024, 7:23 AM

#

:wave: Hey @novel sentinel,

Thanks for reaching out to us. Please follow the recommended actions below; this will help us be more effective in our support effort and leave more time for building Immich immich .

References

Container Logs: docker compose logs docs
Container Status: docker compose ps docs
Reverse Proxy: https://immich.app/docs/administration/reverse-proxy

Checklist

:blue_square: I have verified I'm on the latest release(note that mobile app releases may take some time).
:blue_square: I have read applicable release notes.
:blue_square: I have reviewed the FAQs for known issues.
:blue_square: I have reviewed Github for known issues.
:blue_square: I have tried accessing Immich via local ip (without a custom reverse proxy).
:blue_square: I have uploaded the relevant logs, docker compose, and .env files using the buttons below or the /upload command.
:blue_square: I have tried an incognito window, disabled extensions, cleared mobile app cache, logged out and back in, different browsers, etc. as applicable

(an item can be marked as "complete" by reacting with the appropriate number)

If this ticket can be closed you can use the /close command, and re-open it later if needed.

fading sail Jun 13, 2024, 7:24 AM

#

The hash detection prevents uploading files with identical contents

novel sentinel Jun 13, 2024, 7:25 AM

#

I think it blocks a lot of unique images, which causes me to be missing a lot of images. Is this possible?

#

In other words, many photos with duplicate content need to be identified by the image content to determine their similarity. What does this quick "duplicate detection" during upload rely on? Is it reliable?

fading sail Jun 13, 2024, 7:37 AM

#

Similarity isn't relevant here, it's literally whether the files are the exact same

novel sentinel Jun 13, 2024, 7:39 AM

#

I see, it's just a hash check to make sure duplicate files aren't uploaded multiple times.

#immich's uploaded files are incorrectly identified as "already existing" due to hash duplication det

References

Checklist