TRLLM-Model-4bit

匿名用户2024年07月31日

28阅读

所属分类ai、Pytorch

开源地址https://modelscope.cn/models/heitao5200/TRLLM-Model-4bit

作品详情

lmdeploy convert internlm2-chat-7b ./TRLLM-Model --dst-path ./workspace_trllm_turbomind

修改配置参数：
cache_max_entry_count = 0.2 （config.ini）

启动：
lmdeploy chat turbomind ./workspace_trllm2_turbomind

lmdeploy lite auto_awq ./TRLLM-Model --w-bits 4 --w-group-size 128 --work-dir ./trll-model-4bit

lmdeploy convert internlm2-chat-7b ./trll2-model-4bit --dst-path ./workspace_trll2_model_4bit_turbomind --model-format awq --group-size 128

修改配置参数：
cache_max_entry_count = 0.1
启动：
lmdeploy chat turbomind ./workspace_trll2_model_4bit_turbomind

声明：本文仅代表作者观点，不代表本站立场。如果侵犯到您的合法权益，请联系我们删除侵权资源！如果遇到资源链接失效，请您通过评论或工单的方式通知管理员。未经允许，不得转载，本站所有资源文章禁止商业使用运营!

下载安装【程序员客栈】APP

实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

点击空白处退出提示

您好 👋

我们能提供什么帮助？

向我们发送消息

常见问题、使用帮助、人工咨询等

使用微信扫一扫