Search/
Skip to content
/

NVIDIA: Llama Nemotron Embed VL 1B V2 (free)Free variant

nvidia/llama-nemotron-embed-vl-1b-v2:free

Released Feb 25, 2026131,072 context
$0/M input tokens$0/M output tokens

The Llama Nemotron Embed VL 1B V2 embedding model is optimized for multimodal question-answering retrieval. The model can embed 'documents' in the form of image, text, or image and text combined. Documents can be retrieved given a user query in text form. The model supports images containing text, tables, charts, and infographics.

OpenRouter
© 2026 OpenRouter, Inc

Product

  • Chat
  • Rankings
  • Models
  • Providers
  • Pricing
  • Enterprise

Company

  • About
  • Announcements
  • CareersHiring
  • Privacy
  • Terms of Service
  • Support
  • State of AI
  • Works With OR

Developer

  • Documentation
  • API Reference
  • SDK
  • Status

Connect

  • Discord
  • GitHub
  • LinkedIn
  • X
  • YouTube

Sample code and API for Llama Nemotron Embed VL 1B V2 (free)

OpenRouter normalizes requests and responses across providers for you.

OpenRouter supports image input embeddings for models that can generate embeddings from both text and images. Pass multimodal content using the content array format with text and image_url components. Learn more about image embeddings.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.