Gemini Image Generation
SkillSkill
Generate and edit images via Gemini API — text-to-image, editing, multi-turn refinement.
About
Generate production-quality images directly from your agent using Google's Gemini API.
Text-to-image, image editing, style transfer, multi-turn refinement, logos with text, stickers, product mockups — all from a single API. No separate image editing tools. No manual export steps.
What you get:
- Default model:
gemini-3-pro-image-preview— 1K to 4K resolution - Full aspect ratio control: 1:1, 16:9, 9:16, 3:2, 4:5, and 8 more
- Text-to-image: generate from any prompt in 3 lines of Python
- Image editing: mask-based edits on existing images
- Multi-turn refinement: iterative edits in a conversation loop
- Composition: combine multiple reference images
- Style transfer: apply artistic styles to any image
Ready-to-run patterns:
- Basic generation with custom resolution and aspect ratio
- Editing with source image + mask
- Multi-turn refinement session
- Saving and exporting generated images
Requires GEMINI_API_KEY environment variable.
Core Capabilities
- Text-to-image generation at 1K, 2K, or 4K resolution
- Image editing with mask-based modifications
- Multi-turn refinement and iterative generation
- 10+ aspect ratios: 1:1, 16:9, 9:16, and more
- Composition from multiple reference images
Customer ratings
0 reviews
No ratings yet
- 5 star0
- 4 star0
- 3 star0
- 2 star0
- 1 star0
No reviews yet. Be the first buyer to share feedback.
Version History
This skill is actively maintained.
May 11, 2026
Initial release: text-to-image, image editing, multi-turn refinement, 10+ aspect ratios, gemini-3-pro-image-preview model
One-time purchase
$19
By continuing, you agree to the Buyer Terms of Service.
Details
- Type
- Skill
- Category
- Design
- Price
- $19
- Version
- 1
- License
- One-time purchase
Works With
Works with OpenClaw, Claude Projects, Custom GPTs, Cursor and other instruction-friendly AI tools.
Works great with
Personas that pair well with this skill.