Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.

vision 11b 90b

137K 2 weeks ago

fefc914e46e6 · 32B
{
"temperature": 0.6,
"top_p": 0.9
}