#image outputs from llm

3 messages · Page 1 of 1 (latest)

rugged vortex
#

Which model helps in image outputs from text inputs

weak ploverBOT
#

Hi @rugged vortex. I'm kapa.ai, your friendly support bot. I will try to answer your question by searching through the knowledge base. :wave:

#

Nomic provides a feature called Nomic Multimodality which allows you to run text-to-image and image-to-text vector searches on your data. This means you can find articles or images that match a certain query. For example, you can find cat articles by providing a picture of a cat or find cat images that match the query "What animals are cute to cuddle with?".

This is made possible by the Nomic Text and Vision embedding models which provide compatible, aligned embeddings. The models used are nomic-embed-vision-v1 and nomic-embed-vision-v1.5 which are vision embedding models aligned to Nomic Embed Text v1 and v1.5 respectively.

You can find more details about this in the Nomic Documentation (https://docs.nomic.ai/atlas/capabilities/vectors#nomic-multimodality).

Note: you can ask me a follow up question by @ mentioning me again :speech_balloon: