Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
llava
:13b
8.9M
Downloads
Updated
1 year ago
๐ LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
๐ LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
Cancel
vision
7b
13b
34b
llava:13b
...
/
params
7215dae26124 ยท 33B
{
"stop": [
"USER:",
"ASSSISTANT:"
]
}