当前位置：首页 > news >正文

Diffusers 使用 LoRA

news 2025/6/30 10:56:24

使用diffusers 加载 LoRA，实现文生图功能。摘自 diffusers文档。
模型可以根据名称去modelscope找对应资源下载。使用的时候需要替换成具体路径。虽然modelscope和diffusers都使用了模型id，但是并不能通用。
不同的LoRA对应了不同的“trigger” words，在prompt中加入这个“trigger” words才能生成正确的结果。
比如使用了toy-face的LoRA模型，那么就要在prompt加入“toy face”。

from diffusers import DiffusionPipeline
import torchpipe_id = "stabilityai/stable-diffusion-xl-base-1.0"
pipe = DiffusionPipeline.from_pretrained(pipe_id, torch_dtype=torch.float16).to("cuda")#模型下载
#from modelscope import snapshot_download
#model_dir = snapshot_download('SDXL-LoRA/CiroN2022-toy-face')
#加载lora
pipe.load_lora_weights("CiroN2022/toy-face", weight_name="toy_face_sdxl.safetensors", adapter_name="toy")
# 加入 “toy face”
prompt = "toy_face of a hacker with a hoodie"
lora_scale = 0.9
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": lora_scale}, generator=torch.manual_seed(0)
).images[0]
image

也可以再加载另一个模型，pipe.set_adapters() 函数确定使用哪个模型。不同的模型根据adapter_name来区分。

pipe.load_lora_weights("nerijs/pixel-art-xl", weight_name="pixel-art-xl.safetensors", adapter_name="pixel")
pipe.set_adapters("pixel")prompt = "a hacker with a hoodie, pixel art"
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": lora_scale}, generator=torch.manual_seed(0)
).images[0]
image

也可以同时使用，adapter_weights设置不同的权重，prompt也应该包括全部的“trigger” words。

pipe.set_adapters(["pixel", "toy"], adapter_weights=[0.5, 1.0])
prompt = "toy_face of a hacker with a hoodie, pixel art"
image = pipe(prompt, num_inference_steps=30, cross_attention_kwargs={"scale": 1.0}, generator=torch.manual_seed(0)
).images[0]
image

在这里插入图片描述

更准确的控制LoRA的影响强度。unet一般包括down、mid、up，不同的部分对图片的细节、纹理、风格等影响不同。

pipe.enable_lora()  # enable lora again, after we disabled it above
prompt = "toy_face of a hacker with a hoodie, pixel art"
adapter_weight_scales = { "unet": { "down": 1, "mid": 0, "up": 0} }
pipe.set_adapters("pixel", adapter_weight_scales)
image = pipe(prompt, num_inference_steps=30, generator=torch.manual_seed(0)).images[0]
image

从左到右分别对应{ “down”: 1, “mid”: 0, “up”: 0}，{ “down”: 0, “mid”: 1, “up”: 0}，{ “down”: 0, “mid”: 0, “up”: 1}
在这里插入图片描述

更细粒度的控制。要根据具体的模型结构，不同的模型结构不同。

adapter_weight_scales_toy = 0.5
adapter_weight_scales_pixel = {"unet": {"down": 0.9,  # all transformers in the down-part will use scale 0.9# "mid"  # because, in this example, "mid" is not given, all transformers in the mid part will use the default scale 1.0"up": {"block_0": 0.6,  # all 3 transformers in the 0th block in the up-part will use scale 0.6"block_1": [0.4, 0.8, 1.0],  # the 3 transformers in the 1st block in the up-part will use scales 0.4, 0.8 and 1.0 respectively}}
}
pipe.set_adapters(["toy", "pixel"], [adapter_weight_scales_toy, adapter_weight_scales_pixel])
image = pipe(prompt, num_inference_steps=30, generator=torch.manual_seed(0)).images[0]
image

在这里插入图片描述

常用函数

#不使用
pipe.disable_lora()
#使用
pipe.enable_lora()
#正在使用的
pipe.get_active_adapters()
#列表
pipe.get_list_adapters()
#删除
pipe.delete_adapters("toy")

查看全文

http://www.lryc.cn/news/517117.html

云安全博客阅读（二）

SpringCloud系列教程：微服务的未来（六）docker教程快速入门、常用命令

Vue 快速入门：开启前端新征程

UVM:uvm_component methods configure

LLM 训练中存储哪些矩阵：权重矩阵，梯度矩阵，优化器状态

大模型思维链推理的进展、前沿和未来分析

NLP 技术的突破与未来：从词嵌入到 Transformer

嵌入式中QT实现文本与线程控制方法

云备份项目--服务端编写

Node.js——fs（文件系统）模块

SAP BC 同服务器不同client之间的传输SCC1

CentOS: RPM安装、YUM安装、编译安装（详细解释+实例分析！！！）

Three.js 基础概念：构建3D世界的核心要素

如何用代码提交spark任务并且获取任务权柄

关于Mac中的shell

【npm依赖包介绍】借助rimraf依赖包，在用npm run build构建项目时，清空dist目录，避免新旧混合

爬虫学习记录

Java Spring Boot实现基于URL + IP访问频率限制

C4D2025 win版本安装完无法打开，提示请将你的maxon App更新至最新版本，如何解决

微信小程序实现登录注册

SpringBoot环境和Maven配置

大语言模型训练所需的最低显存，联邦大语言模型训练的传输优化技术

1.07 标准IO

恒压恒流原边反馈控制芯片 CRE6289F

Java中线程中断的几种方式，你了解吗？

Tesseract5.4.0自定义LSTM训练

相关文章：