nemotron:70b-instruct-q4_K

nemotron:70b-instruct-q4_K_S

587.4K Downloads Updated 1 year ago

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

tools 70b

ollama run nemotron:70b-instruct-q4_K_S

curl http://localhost:11434/api/chat \
  -d '{
    "model": "nemotron:70b-instruct-q4_K_S",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='nemotron:70b-instruct-q4_K_S',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'nemotron:70b-instruct-q4_K_S',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

0e32f30ad742 · 40GB ·

model

archllama

parameters70.6B

quantizationQ4_K_S

40GB

template

<|start_header_id|>system<|end_header_id|> {{ if .Tools }}You have access to the following functions

1.2kB

license

LLAMA 3.1 COMMUNITY LICENSE AGREEMENT Llama 3.1 Version Release Date: July 23, 2024 “Agreement”

7.6kB

license

# Llama 3.1 Acceptable Use Policy Meta is committed to promoting safe and fair use of its tools and

4.7kB

params

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"

96B

Readme

Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

This model was trained using RLHF (specifically, REINFORCE), Llama-3.1-Nemotron-70B-Reward and HelpSteer2-Preference prompts on a Llama-3.1-70B-Instruct model as the initial policy.

$ ollama run nemotron
>>> How many r in strawberry?
A sweet question!

Let's count the "R"s in "strawberry":

1. S
2. T
3. R
4. A
5. W
6. B
7. E
8. R
9. R
10. Y

There are **3** "R"s in the word "strawberry".

Reference

Hugging Face