Anthropic ClaudeLLM・AI開発⭐ リポ 0品質スコア 50/100

image-edit

Name: image-edit
Author: agentspace-so

RunComfy上で画像を編集するスマートルーターで、ユーザーの意図に応じてNano Banana Edit（最大20枚バッチ・同一性保持）、OpenAI GPT Image 2 Edit（多言語テキスト書き換え・複数参照合成）、Flux Kontext Pro（高精度ローカル編集）、Z-Image Turbo Inpaint（マスク指定領域編集）を自動で選択します。「画像編集」「背景差し替え」「オブジェクト削除」「i2i」などのリクエストをトリガーに、ローカルのRunComfy CLIを通じて最適なモデルを呼び出します。

description の原文を見る

> Edit images on RunComfy — this skill is a smart router that matches the user's intent to the right edit model in the RunComfy catalog. Picks Nano Banana Edit (batch up to 20, identity-preserving default), OpenAI GPT Image 2 Edit (multilingual in-image text rewrite, multi-ref composition, layout precision), Flux Kontext Pro (single-ref high-fidelity local edit), or Z-Image Turbo Inpaint (mask-driven precise region edit). Bundles each model's documented prompting patterns so the skill gets sharper edits without burning iterations on the wrong model. Calls `runcomfy run <vendor>/<model>/edit` through the local RunComfy CLI. Triggers on "image edit", "edit image", "image-to-image", "i2i", "swap background", "remove object", "rewrite headline", or any explicit ask to edit a single or batch of images.

SKILL.md 本文

Image Edit — Pro Pack on RunComfy

runcomfy.com · Nano Banana Edit · GPT Image 2 Edit · Flux Kontext · Z-Image Inpaint · GitHub

画像編集、意図に応じてルーティング。 このスキルは1つのモデルに限定されません。ユーザーが実際に何を望んでいるかに基づいて、RunComfyカタログから正しい編集モデルを選択します：バッチ身元保持、多言語テキスト書き換え、単一ショット正確編集、またはマスク駆動領域置換。

npx skills add agentspace-so/runcomfy-skills --skill image-edit -g

ユーザーの意図に合わせた正しいモデルを選択

ユーザー意図	モデル	理由
1～20枚の画像を一貫して編集（SKUギャラリー、A/Bバリエーション）	Nano Banana Edit	コールごとに最大20枚の入力画像；シリーズの縦横比/解像度がロック
背景を交換、被写体の身元を保持	Nano Banana Edit	「Xは変わらないままにする」というプロンプトで強い身元保持
ローカライズされたオブジェクト削除/追加と空間言語（「左のオブジェクト」「右上隅」）	Nano Banana Edit	方向的な空間スコープに対応
多言語/非ラテン文字の画像内テキスト書き換え（日本語仮名、キリル文字、アラビア文字）	GPT Image 2 Edit	多言語タイポグラフィで最高クラス
マルチリファレンス合成（img1から被写体、img2からシーン、img3からパレット）	GPT Image 2 Edit	番号付きリファレンスがキューを正しくルーティング
レイアウト精密な位置変更（「見出しを右上から左下中央に移動」）	GPT Image 2 Edit	方向言語がレイアウトレベルで尊重される
翻訳された見出しバリエーション全体で身元を保持	GPT Image 2 Edit	同じソースアセット→多数の言語バリエーション、身元は安定
シングルショット正確ローカル編集（「彼女はオレンジ色の傘を持っている」）	Flux Kontext Pro	シングルリファレンス単一命令、高忠実度保持
マスク駆動オブジェクト削除（ケーブル、ウォーターマーク、邪魔なもの）	Z-Image Turbo Inpaint	マスク必須、強度調整可能、エッジ一貫性あり
マスク駆動領域置換（マスク付き背景全体交換）	Z-Image Turbo Inpaint	高強度+クリーンマスク=クリーンな置換
未指定時のデフォルト	Nano Banana Edit	最も柔軟で、単一とバッチの両方に対応

エージェントはこのテーブルを読み、ユーザーの意図を分類し、以下の対応するサブセクションを選択します。

前提条件

RunComfy CLI — npm i -g @runcomfy/cli
RunComfyアカウント — runcomfy login
CI / コンテナ — RUNCOMFY_TOKEN=<token> を設定

ルート1：Nano Banana Edit — 一般的な編集+バッチのデフォルト

モデル: google/nano-banana-2/edit

スキーマ

フィールド	型	必須	デフォルト	備考
`prompt`	string	yes	—	保持の目標で始まり、変更で終わる。
`image_urls`	array	yes	—	1～20の公開可能なHTTPS URL。
`number_of_images`	int	no	1	コールごと1～4の出力。
`aspect_ratio`	enum	no	`auto`	`auto`は入力に従う；バッチ一貫性のためロック。
`resolution`	enum	no	`1K`	`0.5K` / `1K` / `2K` / `4K`。
`output_format`	enum	no	`png`	`png` / `jpeg` / `webp`。
`seed`	int	no	—	再現性。
`enable_web_search`	bool	no	false	Webベースの編集（追加レイテンシー）。

呼び出し

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "Keep the subject identity, pose, and clothing unchanged. Convert the background into a rainy neon cyberpunk street.",
    "image_urls": ["https://.../portrait.jpg"]
  }' \
  --output-dir <absolute/path>

バッチ（縦横比+解像度をロック）:

runcomfy run google/nano-banana-2/edit \
  --input '{
    "prompt": "Replace the watermark in the bottom-right with the text \"AURA\" in clean white sans-serif. Keep everything else exactly as in the input.",
    "image_urls": ["https://.../sku-1.jpg", "https://.../sku-2.jpg", "https://.../sku-3.jpg"],
    "aspect_ratio": "1:1",
    "resolution": "1K"
  }' \
  --output-dir <absolute/path>

プロンプティングのコツ

保持を最初に：「[身元/ポーズ/ブランド/フレーミング]は変わらないままにする」 その後に変更を述べる。
空間スコープ：「背景のみ」「左のオブジェクト」「右上の象限」 — 具体的な場所が尊重される。
バッチ一貫性：バッチ全体で aspect_ratio と resolution をロック。
小さく繰り返す：複合編集を複数の短いパスに分割。

ルート2：GPT Image 2 Edit — 多言語テキスト+マルチリファレンス合成

モデル: openai/gpt-image-2/edit

スキーマ

フィールド	型	必須	デフォルト	備考
`prompt`	string	yes	—	編集命令；保持で始まる。
`images`	string[]	yes	—	最大10のHTTPS URL。最初がプライマリ；残りは補助。
`size`	enum	no	`auto`	`auto`, `1024_1024`, `1024_1536`, `1536_1024`。これらのみ。

呼び出し

多言語テキスト書き換え：

runcomfy run openai/gpt-image-2/edit \
  --input '{
    "prompt": "Keep the photograph, layout, and brand mark exactly as in the input. Replace only the in-image headline. The new headline reads \"今日のおすすめ\" in bold Japanese kana, same position and font weight.",
    "images": ["https://.../poster-en.jpg"]
  }' \
  --output-dir <absolute/path>

マルチリファレンス合成：

runcomfy run openai/gpt-image-2/edit \
  --input '{
    "prompt": "Compose subject from image 1 into the room from image 2. Match the lighting and color palette of image 2. Keep image 1 subject identity unchanged.",
    "images": ["https://.../subject.jpg", "https://.../room.jpg"]
  }' \
  --output-dir <absolute/path>

プロンプティングのコツ

画像内テキストを正確に引用。 非ラテン文字のスクリプトを名前で指定：「日本語仮名」、「キリル文字」、「アラビア語右から左」。
マルチリファレンスに番号を付ける：「被写体は画像1から、照明は画像2から」。
方向レイアウト言語：「見出しを右上から左下中央に移動」、「右下のウォーターマークを置き換え」。
size: "auto" は入力比を保持 — フレーミングが変わる編集でない限りお勧め。

ルート3：Flux Kontext Pro — シングルショット正確ローカル編集

モデル: blackforestlabs/flux-1-kontext/pro/edit

スキーマ（最小限）

フィールド	型	必須	備考
`prompt`	string	yes	1つの宣言的な編集命令。
`image`	string	yes	シングルソース画像URL。
`aspect_ratio`	enum	no	サポートされているW:H値から選択。
`seed`	int	no	再現性。

シングル画像のみ — 配列ではありません。マルチ画像フロー向けはルート1（Nano Banana Edit）を使用。

呼び出し

runcomfy run blackforestlabs/flux-1-kontext/pro/edit \
  --input '{
    "prompt": "Keep the person'\''s face, pose, and clothing unchanged. Add an orange umbrella in her left hand and a slight smile.",
    "image": "https://.../portrait.jpg"
  }' \
  --output-dir <absolute/path>

プロンプティングのコツ

1つの宣言的な命令。 「彼女はオレンジ色の傘を持って微笑んでいる」 — 命令形、単一の変更。
保持を最初に。 「[変わらない要素]を保持」 で始めて、その後変更を述べる。
小さく繰り返す。 複合編集は1パスで漂流します；順序付きパスに分割。

ルート4：Z-Image Turbo Inpaint — マスク駆動正確領域編集

モデル: tongyi-mai/z-image/turbo/inpainting

スキーマ

フィールド	型	必須	備考
`prompt`	string	yes	何を埋めるか/置き換えるか；マスクされていない周囲の保持制約。
`image`	string	yes	ソース画像URL。
`mask_image`	string	yes	グレースケールマスクURL（白=inpaint、黒=保持）。
`strength`	float	no	0.3～0.6修正、0.7～1.0完全置換。
`control_scale`	float	no	0.6～0.9が一般的。
`aspect_ratio`	enum	no	W:H出力比。
`seed`	int	no	再現性。

呼び出し

オブジェクト削除（低強度）：

runcomfy run tongyi-mai/z-image/turbo/inpainting \
  --input '{
    "prompt": "Remove overhead cables; preserve rooflines and sky gradient; thin clean sky.",
    "image": "https://.../street.jpg",
    "mask_image": "https://.../cables-mask.png",
    "strength": 0.5,
    "control_scale": 0.8
  }' \
  --output-dir <absolute/path>

領域置換（高強度）：

runcomfy run tongyi-mai/z-image/turbo/inpainting \
  --input '{
    "prompt": "Replace busy backdrop with smooth light gray studio paper; mask background only.",
    "image": "https://.../product.jpg",
    "mask_image": "https://.../bg-mask.png",
    "strength": 0.9
  }' \
  --output-dir <absolute/path>

プロンプティングのコツ

マスクURLが必須 — グレースケール、白=inpaint領域、黒=保持。マスクエッジの軽い曖昧さ（1～3px）は鋭いバイナリより自然にブレンド。
強度は意図による：0.3～0.5修正/クリーンアップ、0.6～0.7スタイルマッチ付きオブジェクト置換、0.8～1.0全領域置換。
マスク外に留まるものを名前で指定：「屋根のラインと空のグラデーションを保持」、「レンガパターンとモルタル色を一致させる」。
空間ラベルは引き続きヘルプ：マスクが領域を定義しても 「左の棚」、「右上の象限」 は有用。

制限事項

各ルートはそのモデルの制限を継承します。 Nano Banana：1～20入力、1～4出力。GPT Image 2 Edit：最大10リファレンス、4固定サイズ。Flux Kontext：シングルリファレンス。Z-Image Inpaint：マスク必須。
マルチルートブレンディングなし。 このスキルはコールごとに1つのモデルを選択します。
ブランド固有のオーバーライド — ユーザーが特定のモデルを指定した場合、対応するブランドスキル（gpt-image-edit、flux-kontext、nano-banana-edit）にルーティングして、より完全な処理を行う。

終了コード

code	意味
0	成功
64	CLI引数が不正
65	JSON入力が不正/スキーマ不一致
69	アップストリーム5xx
75	再試行可能：タイムアウト/429
77	サインインしていないまたはトークンが拒否された

完全なリファレンス：docs.runcomfy.com/cli/troubleshooting。

仕組み

スキルはユーザー意図に基づいてNano Banana Edit / GPT Image 2 Edit / Flux Kontext Pro / Z-Image Turbo Inpaintの1つを選択し、runcomfy run <model_id> を対応するJSON本文で呼び出します。CLIはModel APIにPOST、リクエストをポーリング、結果を取得、任意の .runcomfy.net/.runcomfy.com URLを --output-dir にダウンロードします。Ctrl-C は終了前にリモートリクエストをキャンセル。

セキュリティ＆プライバシー

トークンストレージ：runcomfy login はAPIトークンを ~/.config/runcomfy/token.json に、モード0600（オーナーのみ読み書き）で書き込みます。CI/コンテナではファイルを完全にバイパスするために RUNCOMFY_TOKEN 環境変数を設定。
入力境界：ユーザープロンプトは --input を通じてCLIにJSON文字列として渡されます。CLIはプロンプトをシェルで展開しません；JSON本文をHTTPSでModel APIに直接送信。プロンプト内容からのシェルインジェクション界面はなし。
サードパーティコンテンツ：渡した画像/マスク/ビデオURLはRunComfyモデルサーバーでフェッチ、マシン上のCLIでは不可。外部URLを信頼できないものとして扱う；画像ベースプロンプトインジェクションは任意の画像編集/ビデオ編集モデルの既知リスク。
アウトバウンドエンドポイント：model-api.runcomfy.net のみ（リクエスト送信）と *.runcomfy.net / *.runcomfy.com（生成出力のダウンロードホワイトリスト）。テレメトリなし、コールバックなし。
生成ファイルサイズ上限：CLIは単一ダウンロード>2GiBを中止して、悪意のあるまたは暴走モデル出力からのディスク満杯を防止。

ライセンス: MIT(寛容ライセンスのため全文を引用しています) · 原本リポジトリ

詳細情報

作者: agentspace-so
リポジトリ: agentspace-so/runcomfy-agent-skills
ライセンス: MIT
最終更新: 不明

GitHubで原本を見る →フィードバックを送る

Source: https://github.com/agentspace-so/runcomfy-agent-skills / ライセンス: MIT

image-edit

SKILL.md 本文

Image Edit — Pro Pack on RunComfy

ユーザーの意図に合わせた正しいモデルを選択

前提条件

ルート1：Nano Banana Edit — 一般的な編集+バッチのデフォルト

スキーマ

呼び出し

プロンプティングのコツ

ルート2：GPT Image 2 Edit — 多言語テキスト+マルチリファレンス合成

スキーマ

呼び出し

プロンプティングのコツ

ルート3：Flux Kontext Pro — シングルショット正確ローカル編集

スキーマ（最小限）

呼び出し

プロンプティングのコツ

ルート4：Z-Image Turbo Inpaint — マスク駆動正確領域編集

スキーマ

呼び出し

プロンプティングのコツ

制限事項

終了コード

仕組み

セキュリティ＆プライバシー

詳細情報

関連スキル

agent-browser

anyskill

engram

skyvern

pinchbench

openui