Kolors-IP-Adapter-Plus

我要开发同款
匿名用户2024年07月31日
22阅读
开发技术Pytorch
所属分类ai、Kolors
开源地址https://modelscope.cn/models/Kwai-Kolors/Kolors-IP-Adapter-Plus
授权协议apache-2.0

作品详情

Kolors-IP-Adapter-Plus weights and inference code

? Introduction

We provide IP-Adapter-Plus weights and inference code based on Kolors-Basemodel. Examples of Kolors-IP-Adapter-Plus results are as follows:

Our improvements

  • A stronger image feature extractor. We employ the Openai-CLIP-336 model as the image encoder, which allows us to preserve more details in the reference images
  • More diverse and high-quality training data: We construct a large-scale and high-quality training dataset inspired by the data strategies of other works. We believe that paired training data can effectively improve performance.

? Evaluation

For evaluation, we create a test set consisting of over 200 reference images and text prompts. We invite several image experts to provide fair ratings for the generated results of different models. The experts rate the generated images based on four criteria: visual appeal, text faithfulness, image faithfulness, and overall satisfaction. Image faithfulness measures the semantic preservation ability of IP-Adapter on reference images, while the other criteria follow the evaluation standards of BaseModel. The specific results are summarized in the table below, where Kolors-IP-Adapter-Plus achieves the highest overall satisfaction score.

Model Average Overall Satisfaction Average Image Faithfulness Average Visual Appeal Average Text Faithfulness
SDXL-IP-Adapter-Plus 2.29 2.64 3.22 4.02
Midjourney-v6-CW 2.79 3.0 3.92 4.35
Kolors-IP-Adapter-Plus 3.04 3.25 4.45 4.30

The ip_scale parameter is set to 0.3 in SDXL-IP-Adapter-Plus, while Midjourney-v6-CW utilizes the default cw scale.


?️ Usage

Requirements

The dependencies and installation are basically the same as the Kolors-BaseModel.

  1. Repository Cloning and Dependency Installation
apt-get install git-lfs
git clone https://github.com/Kwai-Kolors/Kolors
cd Kolors
conda create --name kolors python=3.8
conda activate kolors
pip install -r requirements.txt
python3 setup.py install
  1. Weights download link: base model weights download
modelscope download --model=Kwai-Kolors/Kolors --local_dir weights/Kolors
modelscope download ---model=Kwai-Kolors/Kolors-IP-Adapter-Plus --local_dir weights/Kolors-IP-Adapter-Plus

or

git lfs clone https://modelscope.cn/models/Kwai-Kolors/Kolors-IP-Adapter-Plus
  1. Inference:
python ipadapter/sample_ipadapter_plus.py ./ipadapter/https://raw.githubusercontent.com/junqiangwu/Kolors/master/ipadapter/assert/test_ip.jpg "穿着黑色T恤衫,上面中文绿色大字写着“可图”"

python ipadapter/sample_ipadapter_plus.py ./ipadapter/https://raw.githubusercontent.com/junqiangwu/Kolors/master/ipadapter/assert/test_ip2.png "一只可爱的小狗在奔跑"

# The image will be saved to "scripts/outputs/"

Note

The IP-Adapter-FaceID model based on Kolors will also be released soon!

Acknowledgments

  • Thanks to IP-Adapter for providing the codebase.
声明:本文仅代表作者观点,不代表本站立场。如果侵犯到您的合法权益,请联系我们删除侵权资源!如果遇到资源链接失效,请您通过评论或工单的方式通知管理员。未经允许,不得转载,本站所有资源文章禁止商业使用运营!
下载安装【程序员客栈】APP
实时对接需求、及时收发消息、丰富的开放项目需求、随时随地查看项目状态

评论