What would you like to be added?
Summaries of Gemini's thinking.
The ability to give Gemini real-time hints is awesom, but almost useless if the user does not understand what Gemini is doing. Gemini doesn't tend to explain itself much, so the only output available with actionable content is Gemini's thoughts.
Unfortunately, Gemini is a few times more verbose than Claude, and highly repetitive in its thinking. No sane SWE would read all of Gemini's thinking. Fortunately, we have Gemini 2.5 Flash Lite, a cheap model that excells at summaries. I've done a PoC in CodeRhapsody, and it is very cool. Compared to Claude, the resulting thinking quality is good. The only down-side is it isn't displayed as quickly: Claude thinking can stream in real time, while we must wait for thinking streaming to complete before we can geneate a summary.
In building my PoC, I found that context from prior thinking summaries is critical: you can't just summarize thinking attached to a response: you have to track the thinking and their summaries for messages back to the start of the tool-chain call. Once I added this enhancement, thinking summaries became more concise and actionable.
Why is this needed?
Summaries of thinking is required if you want to empower users to guide the AI in real time, collaborating like you would with a human. This results in ~2X increase in productivity.
Additional context
I've used real-time collaboration with Claude since August, and have written 2,000 lines of production-ready code per day since then. I estimate I am 2X more productive with real-time collaboration with the AI because:
- I rarely have to throw away work, fix my prompt, and start over.
- I am the expert in the code by the time the AI finishes generrating it.
Being the expert is required, obviously, and instead of becoming the expert by reverse-engineering the AI's code, I become the expert as a side-benefit of my real-time collaboration with the AI.
I MUST read Gemini's thinking for real-time collaboration to work, but I will go insane if I have to read Gemini's thinking output. If we are to use Gemini 3.0 Pro for real-time collaboration, we MUST improve the feedback to the user. This feature works.
What would you like to be added?
Summaries of Gemini's thinking.
The ability to give Gemini real-time hints is awesom, but almost useless if the user does not understand what Gemini is doing. Gemini doesn't tend to explain itself much, so the only output available with actionable content is Gemini's thoughts.
Unfortunately, Gemini is a few times more verbose than Claude, and highly repetitive in its thinking. No sane SWE would read all of Gemini's thinking. Fortunately, we have Gemini 2.5 Flash Lite, a cheap model that excells at summaries. I've done a PoC in CodeRhapsody, and it is very cool. Compared to Claude, the resulting thinking quality is good. The only down-side is it isn't displayed as quickly: Claude thinking can stream in real time, while we must wait for thinking streaming to complete before we can geneate a summary.
In building my PoC, I found that context from prior thinking summaries is critical: you can't just summarize thinking attached to a response: you have to track the thinking and their summaries for messages back to the start of the tool-chain call. Once I added this enhancement, thinking summaries became more concise and actionable.
Why is this needed?
Summaries of thinking is required if you want to empower users to guide the AI in real time, collaborating like you would with a human. This results in ~2X increase in productivity.
Additional context
I've used real-time collaboration with Claude since August, and have written 2,000 lines of production-ready code per day since then. I estimate I am 2X more productive with real-time collaboration with the AI because:
Being the expert is required, obviously, and instead of becoming the expert by reverse-engineering the AI's code, I become the expert as a side-benefit of my real-time collaboration with the AI.
I MUST read Gemini's thinking for real-time collaboration to work, but I will go insane if I have to read Gemini's thinking output. If we are to use Gemini 3.0 Pro for real-time collaboration, we MUST improve the feedback to the user. This feature works.