firefunction-v2:70b-q3_K

firefunction-v2:70b-q3_K_S

500.3K Downloads Updated 1 year ago

An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.

tools 70b

ollama run firefunction-v2:70b-q3_K_S

curl http://localhost:11434/api/chat \
  -d '{
    "model": "firefunction-v2:70b-q3_K_S",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='firefunction-v2:70b-q3_K_S',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'firefunction-v2:70b-q3_K_S',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

474005ee9e26 · 31GB ·

model

archllama

parameters70.6B

quantizationQ3_K_S

31GB

license

META LLAMA 3 COMMUNITY LICENSE AGREEMENT Meta Llama 3 Version Release Date: April 18, 2024 “Agreem

7.8kB

params

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"

96B

template

{{- if .Messages }} {{- if or .System .Tools }}<|start_header_id|>system<|end_header_id|> {{ if .Sys

2.0kB

Readme

Firefunction-v2 is competitive with GPT-4o function calling capabilities, scoring 0.81 on a medley public benchmarks vs 0.80 for GPT-4o.

Firefunction-v2 is optimized for real world scenarios including multi-turn conversation, instruction following and parallel function calling. It retains Llama 3’s multi-turn instruction capability (0.84 vs 0.89 on MT bench) while consistently outscoring Llama 3 on function calling tasks (0.51 vs 0.30 on Nexus parallel multi function eval)

References

Blog Post

Hugging Face