← 返回
AI智能 Key

AI Image Generation

Create AI images with GPT Image, Gemini Nano Banana, FLUX, Imagen, and top providers using prompt engineering, style control, and smart editing.
利用提示工程、风格控制和智能编辑,借助GPT Image、Gemini Nano Banana、FLUX、Imagen等顶级服务商创作AI图像。
ivangdavila
AI智能 clawhub v1.0.3 2 版本 96782.9 Key: 需要
★ 11
Stars
📥 18,101
下载
💾 2,909
安装
2
版本
#latest

概述

Setup

On first use, read setup.md.

When to Use

User needs AI-generated visuals, edits, or consistent image sets.

Use this skill to pick the right model, write stronger prompts, and avoid outdated model choices.

Architecture

User preferences persist in ~/image-generation/. See memory-template.md for setup.

~/image-generation/
├── memory.md      # Preferred providers, project context, winning recipes
└── history.md     # Optional generation log

Quick Reference

TopicFile
-------------
Initial setupsetup.md
Memory templatememory-template.md
Migration guidemigration.md
Benchmark snapshotsbenchmarks-2026.md
Prompt techniquesprompting.md
API handlingapi-patterns.md
GPT Image (OpenAI)gpt-image.md
Gemini and Imagen (Google)gemini.md
FLUX (Black Forest Labs)flux.md
Midjourneymidjourney.md
Leonardoleonardo.md
Ideogramideogram.md
Replicatereplicate.md
Stable Diffusionstable-diffusion.md

Core Rules

1. Resolve aliases to official model IDs first

Community names shift quickly. Before calling an API, map the nickname to the provider model ID.

Community labelOfficial model ID to try firstNotes
--------------------------------------------------------
Nano Bananagemini-2.5-flash-image-previewCommon nickname, not an official Google model ID
Nano Banana 2 / ProVerify provider docsUsually a provider preset over Gemini image models
GPT Image 1.5gpt-image-1.5Current OpenAI high-tier image model
GPT Image mini / iMinigpt-image-1-miniBudget/faster OpenAI variant
FLUX 2 Pro / Maxflux-pro / flux-ultraMany platforms rename these SKUs

2. Pick models by task, not by hype

TaskFirst choiceBackup
----------------------------
Exact text in imagegpt-image-1.5Ideogram
Multi-turn editsgemini-2.5-flash-image-previewflux-kontext-pro
Photoreal hero shotsimagen-4.0-ultra-generate-001flux-ultra
Fast low-cost draftsgpt-image-1-miniimagen-4.0-fast-generate-001
Character/product consistencyflux-kontext-maxgpt-image-1.5 with references
Local no-API workflowsflux-schnellSDXL

3. Use benchmark tables as dated snapshots

Benchmarks drift weekly. Use benchmarks-2026.md as a starting point, then recheck current rankings when quality is critical.

4. Draft cheap, finish expensive

Start with 1-4 low-cost drafts, pick one, then upscale or rerender only the winner.

5. Keep a fallback chain

If the preferred model is unavailable, fallback by tier:

1) same provider lower tier, 2) cross-provider equivalent, 3) local/open model.

6. Treat DALL-E as legacy

OpenAI lists DALL-E 2/3 as legacy. Do not use them as default for new projects.

Common Traps

  • Using vendor nicknames as model IDs -> API errors and wasted retries
  • Assuming "Nano Banana Pro" or "FLUX 2" are universal IDs -> provider mismatch
  • Copying old DALL-E prompt habits -> weaker output vs modern GPT/Gemini image models
  • Comparing text-to-image and image-editing scores as if they were the same benchmark
  • Optimizing every draft at max quality -> cost spikes without quality gain

Security & Privacy

Data that leaves your machine:

  • Prompt text
  • Reference images when editing or style matching

Data that stays local:

  • Provider preferences in ~/image-generation/memory.md
  • Optional local history file

This skill does NOT:

  • Store API keys
  • Upload files outside chosen provider requests
  • Persist generated images unless user asks to save them

External Endpoints

ProviderEndpointData SentPurpose
----------------------------------------
OpenAIapi.openai.comPrompt text, optional input imagesGPT Image generation/editing
Google Gemini APIgenerativelanguage.googleapis.comPrompt text, optional input imagesGemini image generation/editing
Google Vertex AIaiplatform.googleapis.comPrompt text, optional input imagesImagen 4 generation
Black Forest Labsapi.bfl.aiPrompt text, optional input imagesFLUX generation/editing
Replicateapi.replicate.comPrompt text, optional input imagesHosted third-party image models
Midjourneydiscord.comPrompt textMidjourney generation via Discord workflows
Leonardocloud.leonardo.aiPrompt text, optional input imagesLeonardo generation/editing
Ideogramapi.ideogram.aiPrompt textTypography-focused image generation

No other data is sent externally.

Migration

If upgrading from a previous version, read migration.md before updating local memory structure.

Trust

This skill may send prompts and reference images to third-party AI providers.

Only install if you trust those providers with your content.

Related Skills

Install with clawhub install if user confirms:

  • image-edit - Specialized inpainting, outpainting, and mask workflows
  • video-generation - Convert image concepts into video pipelines
  • colors - Build palettes for visual consistency across assets
  • ffmpeg - Post-process image sequences and exports

Feedback

  • If useful: clawhub star image-generation
  • Stay updated: clawhub sync

版本历史

共 2 个版本

  • v1.0.3 当前
    2026-03-28 00:05 安全 安全
  • v1.0.2
    2026-03-07 11:32

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

ontology

oswalpalash
类型化知识图谱,用于结构化智能体记忆与可组合技能。支持创建/查询实体(人员、项目、任务、事件、文档)及关联...
★ 709 📥 243,508
ai-intelligence

Proactive Agent

halthelobster
将AI智能体从任务执行者升级为主动预判需求、持续优化的智能伙伴。集成WAL协议、工作缓冲区、自主定时任务及实战验证模式。Hal Stack核心组件 🦞
★ 833 📥 212,743
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,652