Yi-1.5-34B-32K

我要开发同款
匿名用户2024年07月31日
27阅读
所属分类ai、llama、Pytorch
开源地址https://modelscope.cn/models/01ai/Yi-1.5-34B-32K
授权协议Apache License 2.0

作品详情

? GitHub? Discord? Twitter? WeChat
? Paper? Tech Blog? FAQ? Learning Hub

Intro

Yi-1.5 is an upgraded version of Yi. It is continuously pre-trained on Yi with a high-quality corpus of 500B tokens and fine-tuned on 3M diverse fine-tuning samples.

Compared with Yi, Yi-1.5 delivers stronger performance in coding, math, reasoning, and instruction-following capability, while still maintaining excellent capabilities in language understanding, commonsense reasoning, and reading comprehension.

Model | Context Length | Pre-trained Tokens | :------------: | :------------: | :------------: | | Yi-1.5 | 4K, 16K, 32K | 3.6T

Models

  • Chat models

| Name | Download | | --------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Yi-1.5-34B-Chat | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-34B-Chat-16K | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-9B-Chat | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-9B-Chat-16K | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-6B-Chat | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)|
  • Base models

| Name | Download | | ---------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | Yi-1.5-34B | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-34B-32K | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-9B | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-9B-32K | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)| | Yi-1.5-6B | • [? Hugging Face](https://huggingface.co/collections/01-ai/yi-15-2024-05-663f3ecab5f815a3eaca7ca8) • [? ModelScope](https://www.modelscope.cn/organization/01ai) • [? wisemodel](https://wisemodel.cn/organization/01.AI)|

Benchmarks

  • Chat models

    Yi-1.5-34B-Chat is on par with or excels beyond larger models in most benchmarks.

    image/png

    Yi-1.5-9B-Chat is the top performer among similarly sized open-source models.

    image/png

  • Base models

    Yi-1.5-34B is on par with or excels beyond larger models in some benchmarks.

    image/png

    Yi-1.5-9B is the top performer among similarly sized open-source models.

    image/png

Quick Start

For getting up and running with Yi-1.5 models quickly, see README.

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论