Fix Segment._split_cells for non-unit-width characters (emoji, CJK) by ShiLiguo123 · Pull Request #4118 · Textualize/rich

ShiLiguo123 · 2026-05-12T12:54:54Z

Summary

Fix Segment._split_cells() to correctly handle strings containing double-cell-width characters (emoji, CJK characters, etc.).

Root Cause

The old algorithm used a heuristic initial position: pos = int((cut / cell_length) * len(text)). This assumes uniform character width, which fails badly for mixed-width strings.

For example:

s = Segment('🦊🦊🦊\n\n\n\n\n\n')
Segment._split_cells(s, 3)
# Wrong: (Segment('🦊🦊🦊\n '), Segment(' \n\n\n\n'))

Fix

Replaced the heuristic + while-loop with a character-by-character walk-through. The new algorithm:

Tracks cumulative cell position as it iterates over characters
When it finds the character that exceeds the cut point, it splits there
Double-width characters are replaced with two spaces as before

Fixes #3299

The old algorithm used a heuristic initial position estimate int((cut / cell_length) * len(text)) and then adjusted in a while loop. This fails for strings with mixed single/double cell-width characters (emoji, CJK). Replace with a simple character-by-character walk-through that correctly tracks cumulative cell positions. Fixes Textualize#3299

willmcgugan · 2026-05-12T13:03:58Z

Your PR has been closed due to a AI policy violation.

Please read the following before submitting further PRs.

https://github.com/Textualize/textual/blob/main/AI_POLICY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix Segment._split_cells for non-unit-width characters (emoji, CJK)#4118

Fix Segment._split_cells for non-unit-width characters (emoji, CJK)#4118
ShiLiguo123 wants to merge 1 commit into
Textualize:masterfrom
ShiLiguo123:fix-split-cells-emoji

ShiLiguo123 commented May 12, 2026

Uh oh!

willmcgugan commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ShiLiguo123 commented May 12, 2026

Summary

Root Cause

Fix

Uh oh!

willmcgugan commented May 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants