Add Segment Any Text (SaT) model#7
Merged
BrandonWeng merged 5 commits intoFluidInference:mainfrom Oct 23, 2025
Merged
Conversation
Updated README formatting for clarity.
BrandonWeng
reviewed
Oct 23, 2025
| @@ -0,0 +1,64 @@ | |||
| # Segment Any Text CoreML | |||
| Segment Any Text is state-of-the-art sentence segmentation with 3 Transfomer layers. A pytorch version of the model is used in [wtsplit](https://github.com/segment-any-text/wtpsplit) and additional details can be found in this [paper](https://arxiv.org/abs/2406.16678). | |||
Member
There was a problem hiding this comment.
d you want to link your HF repo as well if folks want to use it directly?
Contributor
Author
There was a problem hiding this comment.
Good point. Just updated the README.
BrandonWeng
reviewed
Oct 23, 2025
Member
BrandonWeng
left a comment
There was a problem hiding this comment.
nit but could you place this under models/segment-text/coreml
Contributor
Author
No problem, I wasn't sure of the structure. It's done. |
BrandonWeng
approved these changes
Oct 23, 2025
Member
BrandonWeng
left a comment
There was a problem hiding this comment.
thank you! once your project is open sourced, do feel free to add it here and to show case as well (same with FluidAudio repo)
Contributor
Author
Will do. |
Alex-Wengg
added a commit
that referenced
this pull request
Jan 31, 2026
* docs: add ANE float16 beeping issue to iOS CoreML issues Add issue #7 documenting the Mimi decoder beeping artifact caused by ANE float16 precision loss in streaming state feedback. Update issue #4 to reflect the per-model compute unit configuration. * docs: add voice duration differences issue, update compute unit docs - Add issue #8: voice-dependent duration differences between MLX and CoreML (azelma -2.00s, javert +1.92s — model-level behavior, not fixable in Swift) - Update issue #4: all models now use .cpuAndGPU (not per-model config) - Update issue #7: reflect all-models .cpuAndGPU fix
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This model segments sentences and works in situations where text is lacking punctuation.