转载:unet中的attn_processor的修改(用于设计新的注意力模块)_attnprocessor-优快云博客
【扩散模型(四)】IP-Adapter 源码详解2-训练核心(cross-attention)_ip-adapter.bin-优快云博客
IP-Adapter的示意图:
IP-Adapter的ImageEncoder代码:
@torch.inference_mode()
def get_image_embeds(self, pil_image=None, clip_image_embeds=None):
if pil_image is not None:
if isinstance(pil_image, Image.Image):
pil_image = [pil_image]
clip_image = self.clip_image_processor(images=pil_image, return_tensors="pt").pixel_values
clip_ima