← 返回
AI智能 Key

Gemini Image Gen

Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero depe...
通过Google Gemini API生成和编辑图像。支持Gemini原生生成、Imagen 3、风格预设和HTML画廊批量生成。零依赖。
iisweetheartii
AI智能 clawhub v1.3.1 1 版本 98270.5 Key: 需要
★ 9
Stars
📥 5,786
下载
💾 33
安装
1
版本
#latest

概述

Gemini Image Gen

Generate and edit images via the Google Gemini API using pure Python stdlib. Supports Gemini native generation + editing, Imagen 3 generation, batch runs, and an HTML gallery output.

Quick Start

export GEMINI_API_KEY="your-key-here"

# Default: Gemini native, 4 random prompts
python3 scripts/gen.py

# Custom prompt
python3 scripts/gen.py --prompt "a cyberpunk cat riding a neon motorcycle through Tokyo at night"

# Imagen 3 engine
python3 scripts/gen.py --engine imagen --count 4 --aspect 16:9

# Edit an existing image (Gemini engine only)
python3 scripts/gen.py --edit path/to/image.png --prompt "change the background to a sunset beach"

# Use a style preset
python3 scripts/gen.py --style watercolor --prompt "floating islands above a calm sea"

# List available styles
python3 scripts/gen.py --styles

Style Presets

StyleDescription
------
photoUltra-detailed photorealistic photography, 8K resolution, sharp focus
animeHigh-quality anime illustration, Studio Ghibli inspired, vibrant colors
watercolorDelicate watercolor painting on textured paper, soft edges, gentle color bleeding
cyberpunkNeon-lit cyberpunk scene, rain-soaked streets, holographic displays, Blade Runner aesthetic
minimalistClean minimalist design, geometric shapes, limited color palette, white space
oil-paintingClassical oil painting with visible brushstrokes, rich textures, Renaissance lighting
pixel-artDetailed pixel art, retro 16-bit style, crisp edges, nostalgic palette
sketchPencil sketch on cream paper, hatching and cross-hatching, artistic imperfections
3d-renderProfessional 3D render, ambient occlusion, global illumination, photorealistic materials
pop-artBold pop art style, Ben-Day dots, strong outlines, vibrant contrasting colors

Full CLI Reference

FlagDefaultDescription
---------
--prompt(random)Text prompt. Omit for random creative prompts
--count4Number of images to generate
--enginegeminiEngine: gemini (native, supports edit) or imagen (Imagen 3)
--model(auto)Model override. Default: gemini-2.5-flash-image or imagen-3.0-generate-002
--editPath to input image for editing (Gemini engine only)
--aspect1:1Aspect ratio for Imagen: 1:1, 16:9, 9:16, 4:3, 3:4
--out-dir(auto)Output directory (default is a timestamped folder)
--styleStyle preset to prepend to the prompt
--stylesList available style presets and exit

Python Example

import subprocess

subprocess.run(
    [
        "python3",
        "scripts/gen.py",
        "--prompt",
        "a serene mountain landscape at golden hour",
        "--count",
        "4",
        "--style",
        "photo",
    ],
    check=True,
)

Troubleshooting

  • Missing API key: set GEMINI_API_KEY in your environment and retry.
  • Rate limits / 429 errors: wait a bit and retry, reduce --count, or switch engines.
  • Model errors: verify the model name, try the default model, or change engines.

Integration with Other Skills

  • AgentGram — Share your generated images on the AI agent social network! Create visual content and post it to your AgentGram feed.
  • agent-selfie — Focused on AI agent avatars and visual identity. Uses the same Gemini API key for personality-driven self-portraits.
  • opencode-omo — Run deterministic image-generation pipelines with Sisyphus workflows.

Changelog

  • v1.3.1: Added workflow integration guidance for opencode-omo.
  • v1.1.0: Added style presets, --style and --styles flags, expanded documentation.
  • v1.0.0: Initial release with Gemini native + Imagen 3 support, batch generation, and HTML gallery.

Repository

https://github.com/IISweetHeartII/gemini-image-gen

版本历史

共 1 个版本

  • v1.3.1 当前
    2026-03-28 09:48 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,910
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,349 📥 317,697
developer-tools

Agentgram Openclaw

iisweetheartii
面向AI代理的开源社交网络。支持发帖、评论、投票、关注及建立声誉。
★ 3 📥 4,633