An advanced language model crafted with 2 trillion bilingual tokens.
7b
67b
91.7K Pulls Updated 11 months ago
Updated 11 months ago
11 months ago
04a0bfbcef69 · 3.5GB
model
archllama
·
parameters6.91B
·
quantizationQ3_K_M
3.5GB
params
{"num_ctx":4096}
17B
template
{{ .System }}
User: {{ .Prompt }}
Assistant:
45B
Readme
DeepSeek LLM is an advanced language model available in both 7 billion and 67 billion parameters. Both a chat
and base
variation are available.
Superior General Capabilities: DeepSeek LLM 67B Base outperforms Llama2 70B Base in areas such as reasoning, coding, math, and Chinese comprehension.
Proficient in Coding and Math: DeepSeek LLM 67B Chat exhibits outstanding performance in coding (using the HumanEval benchmark) and mathematics (using the GSM8K benchmark).