匿名用户2024年07月31日
24阅读
所属分类aiPytorch
开源地址https://modelscope.cn/models/RWKV/v5-Eagle-7B
授权协议apache-2.0

作品详情

An eagle soaring above a transformer robot

Eagle 7B - in short

Eagle 7B is a 7.52B parameter model that:

  • Built on the RWKV-v5 architecture (a linear transformer with 10-100x+ lower inference cost)
  • Ranks as the world’s greenest 7B model (per token)
  • Trained on 1.1 Trillion Tokens across 100+ languages (70% English, 15% multi lang, 15% code)
  • Outperforms all 7B class models in multi-lingual benchmarks
  • Approaches Falcon (1.5T), LLaMA2 (2T), Mistral (>2T?) level of performance in English evals
  • Trade blows with MPT-7B (1T) in English evals
  • All while being an “Attention-Free Transformer”
  • Is a foundation model, with a very small instruct tune - further fine-tuning is required for various use cases!

Find out more at our model announcment: https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers

Or our wiki: https://wiki.rwkv.com

声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论