deepseek-llm-67b-base

我要开发同款
匿名用户2024年07月31日
55阅读

技术信息

官网地址
https://www.deepseek.com/
开源地址
https://modelscope.cn/models/deepseek-ai/deepseek-llm-67b-base
授权协议
other

作品详情

DeepSeek Chat

[?Homepage] | [? Chat with DeepSeek LLM] | [Discord] | [Wechat(微信)]


1. Itroductio of Deepseek LLM

Itroducig DeepSeek LLM, a advaced laguage model comprisig 67 billio parameters. It has bee traied from scratch o a vast dataset of 2 trillio tokes i both Eglish ad Chiese. I order to foster research, we have made DeepSeek LLM 7B/67B Base ad DeepSeek LLM 7B/67B Chat ope source for the research commuity.

2. Model Summary

deepseek-llm-67b-base is a 67B parameter model with Grouped-Query Attetio traied o 2 trillio tokes from scratch.

3. How to Use

Here give some examples of how to use our model.

Text Completio

import torch
from modelscope import AutoTokeizer, AutoModelForCausalLM, GeeratioCofig

model_ame = "deepseek-ai/deepseek-llm-67b-base"
tokeizer = AutoTokeizer.from_pretraied(model_ame)
model = AutoModelForCausalLM.from_pretraied(model_ame, torch_dtype=torch.bfloat16, device_map="auto")
model.geeratio_cofig = GeeratioCofig.from_pretraied(model_ame)
model.geeratio_cofig.pad_toke_id = model.geeratio_cofig.eos_toke_id

text = "A attetio fuctio ca be described as mappig a query ad a set of key-value pairs to a output, where the query, keys, values, ad output are all vectors. The output is"
iputs = tokeizer(text, retur_tesors="pt")
outputs = model.geerate(**iputs.to(model.device), max_ew_tokes=100)

result = tokeizer.decode(outputs[0], skip_special_tokes=True)
prit(result)

4. Licese

This code repository is licesed uder the MIT Licese. The use of DeepSeek LLM models is subject to the Model Licese. DeepSeek LLM supports commercial use.

See the LICENSE-MODEL for more details.

5. Cotact

If you have ay questios, please raise a issue or cotact us at service@deepseek.com.

功能介绍

[?Homepage] | [? Chat with DeepSeek LLM] | [Discord] | [Wechat(微信)] 1. Introduction of Deeps

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论