fix(resume): use char-aware string truncation for UTF-8 safety #33

echobt · 2026-02-04T14:47:47Z

Summary

Fixes #5288 - Path truncation panics on multi-byte UTF-8.

Problem

String truncation used byte-based slicing that panics on multi-byte characters.

Solution

Use char-based iteration for safe truncation.

greptile-apps · 2026-02-04T14:50:05Z

Greptile Overview

Greptile Summary

Fixed critical UTF-8 safety bugs that caused panics when slicing multi-byte characters. Replaced unsafe byte-based string slicing with char-aware methods across three files:

resume_picker.rs: Replaced .len() with .chars().count() and byte slicing with .chars().take() in truncate_string() function
notification.rs: Used char_indices() with .get() for safe session ID truncation at char boundaries
import_cmd.rs: Wrapped all string slicing operations in .get() to gracefully handle invalid UTF-8 boundaries when extracting base64 data

All changes maintain original functionality while preventing runtime panics on multi-byte UTF-8 input (emoji, CJK characters, etc.). Comprehensive unit tests added for UTF-8 edge cases.

Confidence Score: 5/5

This PR is safe to merge with no risk - it fixes critical safety bugs without changing behavior
All changes follow Rust best practices for UTF-8 handling, comprehensive tests verify correctness, and the fixes only improve safety without altering logic
No files require special attention

Important Files Changed

Filename	Overview
src/cortex-resume/src/resume_picker.rs	Fixed UTF-8 panic in `truncate_string` by using char-based iteration instead of byte slicing, with comprehensive tests
src/cortex-cli/src/utils/notification.rs	Replaced byte-based string slicing with char-aware boundary detection for session ID truncation
src/cortex-cli/src/import_cmd.rs	Added safe slicing with `.get()` to prevent panics when extracting base64 data from message content

Sequence Diagram

sequenceDiagram
    participant User
    participant ResumePicker as Resume Picker UI
    participant Notification as Notification System
    participant ImportCmd as Import Command
    participant TruncateString as truncate_string()
    participant SafeSlicing as Safe UTF-8 Slicing

    Note over ResumePicker,SafeSlicing: String Truncation Flow (resume_picker.rs)
    User->>ResumePicker: Display session with path/title
    ResumePicker->>TruncateString: truncate_string(text, width)
    TruncateString->>TruncateString: Count chars (not bytes)
    alt char_count <= width
        TruncateString-->>ResumePicker: Return original string
    else width > 3
        TruncateString->>TruncateString: Take (width-3) chars safely
        TruncateString-->>ResumePicker: Return truncated + "..."
    else
        TruncateString->>TruncateString: Take width chars
        TruncateString-->>ResumePicker: Return truncated string
    end
    ResumePicker-->>User: Display safe truncated text

    Note over Notification,SafeSlicing: Session ID Truncation (notification.rs)
    User->>Notification: Task completes (session_id)
    Notification->>SafeSlicing: Truncate session_id to 8 chars
    SafeSlicing->>SafeSlicing: char_indices().take_while(idx < 8)
    SafeSlicing->>SafeSlicing: Find last valid char boundary
    SafeSlicing->>SafeSlicing: Use .get(..end) for safe slice
    SafeSlicing-->>Notification: Return safe truncated ID
    Notification-->>User: Show desktop notification

    Note over ImportCmd,SafeSlicing: Base64 Extraction (import_cmd.rs)
    User->>ImportCmd: Import messages with embedded images
    ImportCmd->>ImportCmd: Find "data:image/" pattern
    ImportCmd->>SafeSlicing: Slice at data_uri_start with .get()
    alt Valid byte boundary
        SafeSlicing-->>ImportCmd: Return Some(slice)
        ImportCmd->>SafeSlicing: Slice at base64_start with .get()
        alt Valid byte boundary
            SafeSlicing-->>ImportCmd: Return Some(base64_data)
            ImportCmd->>ImportCmd: Validate base64 encoding
        else Invalid boundary
            SafeSlicing-->>ImportCmd: Return None
            ImportCmd->>ImportCmd: Skip (continue)
        end
    else Invalid boundary
        SafeSlicing-->>ImportCmd: Return None
        ImportCmd->>ImportCmd: Skip message (continue)
    end
    ImportCmd-->>User: Safe import without panics

greptile-apps

_{3 files reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

This PR consolidates the following UTF-8 safety fixes: - #31: Use safe UTF-8 slicing in import command base64 extraction - #32: Use safe UTF-8 slicing for session IDs in notifications - #33: Use char-aware string truncation for UTF-8 safety in resume - #35: Use safe UTF-8 slicing for session IDs in lock command - #37: Validate UTF-8 boundaries in mention parsing All changes ensure safe string operations that respect UTF-8 boundaries: - Replaced direct byte slicing with char-aware methods - Added floor_char_boundary checks before slicing - Prevents panics from slicing multi-byte characters

echobt · 2026-02-04T15:41:20Z

Consolidated into #70 - fix: consolidated UTF-8 safety improvements for string slicing

echobt added 3 commits February 4, 2026 14:43

fix(cli): use safe UTF-8 slicing in import command base64 extraction

912acd7

fix(notifications): use safe UTF-8 slicing for session IDs

9b13828

fix(resume): use char-aware string truncation for UTF-8 safety

90ddb66

greptile-apps bot reviewed Feb 4, 2026

View reviewed changes

echobt mentioned this pull request Feb 4, 2026

fix: consolidated UTF-8 safety improvements for string slicing #70

Closed

echobt closed this Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(resume): use char-aware string truncation for UTF-8 safety #33

fix(resume): use char-aware string truncation for UTF-8 safety #33

Uh oh!

echobt commented Feb 4, 2026

Uh oh!

greptile-apps bot commented Feb 4, 2026

Important Files Changed

Uh oh!

greptile-apps bot left a comment

Uh oh!

echobt commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix(resume): use char-aware string truncation for UTF-8 safety #33

fix(resume): use char-aware string truncation for UTF-8 safety #33

Uh oh!

Conversation

echobt commented Feb 4, 2026

Summary

Problem

Solution

Uh oh!

greptile-apps bot commented Feb 4, 2026

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

echobt commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant