#immich's uploaded files are incorrectly identified as "already existing" due to hash duplication det

1 messages · Page 1 of 1 (latest)

novel sentinel
#

immich's uploaded files are incorrectly identified as "already existing" due to hash duplication detection
Is my usage wrong? I upload a large number of pictures in subfolders through immich-go and upload pictures through Android App. The number of pictures on the server is always less than I expected, and by searching the file name, I can be very sure that the pictures do not exist on the server.
I want to know what algorithm immich uses to determine that the photos already exist on the server?

zinc kelpBOT
#

:wave: Hey @novel sentinel,

Thanks for reaching out to us. Please follow the recommended actions below; this will help us be more effective in our support effort and leave more time for building Immich immich.

References

Checklist

  1. :blue_square: I have verified I'm on the latest release(note that mobile app releases may take some time).
  2. :blue_square: I have read applicable release notes.
  3. :blue_square: I have reviewed the FAQs for known issues.
  4. :blue_square: I have reviewed Github for known issues.
  5. :blue_square: I have tried accessing Immich via local ip (without a custom reverse proxy).
  6. :blue_square: I have uploaded the relevant logs, docker compose, and .env files using the buttons below or the /upload command.
  7. :blue_square: I have tried an incognito window, disabled extensions, cleared mobile app cache, logged out and back in, different browsers, etc. as applicable

(an item can be marked as "complete" by reacting with the appropriate number)

If this ticket can be closed you can use the /close command, and re-open it later if needed.

fading sail
#

The hash detection prevents uploading files with identical contents

novel sentinel
#

I think it blocks a lot of unique images, which causes me to be missing a lot of images. Is this possible?

#

In other words, many photos with duplicate content need to be identified by the image content to determine their similarity. What does this quick "duplicate detection" during upload rely on? Is it reliable?

fading sail
#

Similarity isn't relevant here, it's literally whether the files are the exact same

novel sentinel
#

I see, it's just a hash check to make sure duplicate files aren't uploaded multiple times.