deepseek-llm

1.1M Downloads Updated 2 years ago

An advanced language model crafted with 2 trillion bilingual tokens.

7b 67b

ollama run deepseek-llm

curl http://localhost:11434/api/chat \
  -d '{
    "model": "deepseek-llm",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='deepseek-llm',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'deepseek-llm',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

View all →

Name

64 models

Size / Usage

Context

Input

deepseek-llm:latest

4.0GB · 4K context window · Text · 2 years ago

deepseek-llm:latest

4.0GB

Text

deepseek-llm:7b

latest

4.0GB · 4K context window · Text · 2 years ago

deepseek-llm:7b latest

4.0GB

Text

deepseek-llm:67b

38GB · 4K context window · Text · 2 years ago

deepseek-llm:67b

38GB

Text

Readme

DeepSeek LLM is an advanced language model available in both 7 billion and 67 billion parameters. Both a chat and base variation are available.

Superior General Capabilities: DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.
Proficient in Coding and Math: DeepSeek LLM 67B Chat exhibits outstanding performance in coding (using the HumanEval benchmark) and mathematics (using the GSM8K benchmark).

References

GitHub

HuggingFace