MiniCPM-MoE-8x2B

我要开发同款
匿名用户2024年07月31日
35阅读
所属分类aiPytorch
开源地址https://modelscope.cn/models/OpenBMB/MiniCPM-MoE-8x2B
授权协议other

作品详情

Introduction

The MiniCPM-MoE-8x2B is a decoder-only transformer-based generative language model.

The MiniCPM-MoE-8x2B adopt a Mixture-of-Experts(MoE) architecture, which has 8 experts per layer and activates 2 of 8 experts for each token.

Usage

This is a model version after instruction tuning but without other rlhf methods. Chat template is automatically applied.

from modelscope import AutoModelForCausalLM, AutoTokenizer
import torch
torch.manual_seed(0)

path = 'openbmb/MiniCPM-MoE-8x2B'
tokenizer = AutoTokenizer.from_pretrained(path)
model = AutoModelForCausalLM.from_pretrained(path, torch_dtype=torch.bfloat16, device_map='cuda', trust_remote_code=True)

responds, history = model.chat(tokenizer, "山东省最高的山是哪座山, 它比黄山高还是矮?差距多少?", temperature=0.8, top_p=0.8)
print(responds)

Note

  1. You can alse inference with vLLM, which will be compatible with this repo and has a much higher inference throughput.
  2. The precision of model weights in this repo is bfloat16. Manual convertion is needed for other kinds of dtype.
  3. For more details, please refer to our github repo.

Statement

  1. As a language model, MiniCPM-MoE-8x2B generates content by learning from a vast amount of text.
  2. However, it does not possess the ability to comprehend or express personal opinions or value judgments.
  3. Any content generated by MiniCPM-MoE-8x2B does not represent the viewpoints or positions of the model developers.
  4. Therefore, when using content generated by MiniCPM-MoE-8x2B, users should take full responsibility for evaluating and verifying it on their own.
声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论