wip: use [Image #X] placeholder for clipboard paste and drag and drop by jackwotherspoon · Pull Request #14706 · google-gemini/gemini-cli

jackwotherspoon · 2025-12-08T13:56:19Z

Summary

This branch improves how pasted and drag-and-dropped images are handled in the
Gemini CLI input.

drag.and.drop.images.mp4

Details

Before: Images were inserted as @path/to/image.png file references,
requiring the model to resolve them.

After: Images are displayed as [Image #1], [Image #2], etc. placeholders
in the input, then injected directly as base64-encoded inline data when
submitting to the Gemini API.

Key Features

Visual placeholders - Users see [Image #N] tags that are
syntax-highlighted and editable
Deletable references - Users can remove image tags before submitting;
only images with remaining tags are sent
Multi-file drag-and-drop - Supports dropping multiple images at once,
with proper handling of escaped spaces in filenames
Mixed content - Non-image files in a multi-drop fall back to @path
syntax

Files Changed

File	Purpose
`useClipboardImages.ts`	New hook managing image registry and base64 conversion
`clipboardUtils.ts`	Path parsing, validation, multi-file splitting
`highlight.ts`	Syntax highlighting for `[Image #N]` tokens
`InputPrompt.tsx`	Integration with paste/drop handling
`useGeminiStream.ts`	Injects image parts into API requests

Supported Formats

PNG, JPEG, WEBP, HEIC, HEIF (per
Gemini API spec)

Details

Related Issues

How to Validate

Pre-Merge Checklist

gemini-code-assist · 2025-12-08T13:56:45Z

Summary of Changes

Hello @jackwotherspoon, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the user experience for interacting with image content within the Gemini CLI. Instead of relying on the model to resolve local file paths, images are now displayed as user-friendly [Image #N] placeholders in the input. These placeholders are then converted to base64-encoded inline data and directly injected into the Gemini API request, streamlining the process and improving reliability for image-based prompts.

Highlights

Image Handling Improvement: Pasted and drag-and-dropped images are now represented by [Image #N] placeholders in the input, rather than @path references.
Base64 Injection: Images are base64-encoded and injected directly into the Gemini API request upon submission, improving reliability and streamlining the process.
Interactive Placeholders: [Image #N] tags are syntax-highlighted and editable, allowing users to remove image references before submission, and only images with remaining tags are sent.
Multi-file Drag-and-Drop: The system now supports dropping multiple images at once, correctly handling escaped spaces in filenames.
Mixed Content Fallback: Non-image files included in a multi-drop scenario will gracefully fall back to the traditional @path syntax.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a significant improvement for handling images by using placeholders like [Image #1] instead of file paths. The implementation is well-structured, introducing a new useClipboardImages hook, utility functions for image path parsing, and integrating this new flow into the input prompt and submission stream. The code is generally robust, with good handling of asynchronous operations and edge cases like escaped spaces in file paths. I've identified one high-severity issue where adding the same image multiple times does not behave as a user would expect. Please see the detailed comment.

gemini-code-assist · 2025-12-08T13:59:27Z

+      // Check if this path is already registered to prevent duplicates
+      if (prev.some((img) => img.path === absolutePath)) {
+        return prev;
+      }


The current implementation prevents registering an image if its path already exists in the images array. This can lead to unexpected behavior for the user. For example, if a user pastes the same image twice, they will see two distinct placeholders in the input (e.g., [Image #1] and [Image #2]), but only the first one will be registered and sent to the API. The second placeholder will be ignored, which is not what the user would expect.

To fix this, you should allow registering the same image path multiple times, each with its own unique ID. This ensures that each placeholder in the UI corresponds to an image that will be included in the prompt.

github-actions · 2025-12-08T13:59:36Z

Size Change: +8.95 kB (+0.04%)

Total Size: 21.5 MB

Filename	Size	Change
`./bundle/gemini.js`	21.5 MB	+8.95 kB (+0.04%)

ℹ️ View Unchanged

Filename	Size
`./bundle/sandbox-macos-permissive-closed.sb`	1.03 kB
`./bundle/sandbox-macos-permissive-open.sb`	890 B
`./bundle/sandbox-macos-permissive-proxied.sb`	1.31 kB
`./bundle/sandbox-macos-restrictive-closed.sb`	3.29 kB
`./bundle/sandbox-macos-restrictive-open.sb`	3.36 kB
`./bundle/sandbox-macos-restrictive-proxied.sb`	3.56 kB

_{compressed-size-action}

jackwotherspoon added 8 commits December 7, 2025 23:41

feat: improve pasting of images

8dee76d

chore: same for drag and drop images

8f4a381

chore: fix .bmp files and race condition

2183daf

feat: support multi-file drag and drop

43328e6

chore: add check for removed images

ffb6770

chore: update supported types and misc

ae1e44c

refactor: add new util file

7070af3

chore: cleanup

c7a2050

jackwotherspoon requested a review from a team as a code owner December 8, 2025 13:56

gemini-code-assist Bot reviewed Dec 8, 2025

View reviewed changes

jacob314 self-requested a review December 8, 2025 18:38

This was referenced Dec 9, 2025

fix: use Gemini API supported image formats for clipboard #14762

Merged

feat: support multi-file drag and drop of images #14832

Merged

jackwotherspoon mentioned this pull request Dec 22, 2025

feat: use [Image #X] placeholder for clipboard paste and drag and drop #15432

Closed

26 tasks

jackwotherspoon closed this Dec 22, 2025

jacob314 deleted the image-placeholder branch February 19, 2026 07:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wip: use [Image #X] placeholder for clipboard paste and drag and drop#14706

wip: use [Image #X] placeholder for clipboard paste and drag and drop#14706
jackwotherspoon wants to merge 8 commits intomainfrom
image-placeholder

jackwotherspoon commented Dec 8, 2025 •

edited

Loading

Uh oh!

gemini-code-assist Bot commented Dec 8, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Dec 8, 2025

Uh oh!

github-actions Bot commented Dec 8, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jackwotherspoon commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Key Features

Files Changed

Supported Formats

Details

Related Issues

How to Validate

Pre-Merge Checklist

Uh oh!

gemini-code-assist Bot commented Dec 8, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Dec 8, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Dec 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jackwotherspoon commented Dec 8, 2025 •

edited

Loading

github-actions Bot commented Dec 8, 2025 •

edited

Loading