Remove redundant file dialog string conversions #1665

tobil4sk · 2023-04-18T12:25:47Z

Currently, there are a lot of redundant string conversions when using the file dialog api. This was mentioned briefly in #1622.

Here is a rough overview of the current situation:

Linux/macOS:

C++: hxstring -> std::wstring -> std::string, passed into tinyfiledialogs utf-8 api, then std::string -> std::wstring -> hxstring
HL: hl_vstring -> std::wstring -> std::string, passed into tinyfiledialogs utf-8 api, then std::string -> std::wstring -> utf-8 bytes -> native utf-16 hashlink string (on Haxe side via String.fromUtf8())

Windows:

C++: hxstring -> std::wstring, passed into tinyfiledialogs utf-16 api, then std::wstring -> hxstring
HL: hl_vstring -> std::wstring, passed into tinyfiledialogs utf-16 api, then std::wstring -> utf-8 bytes -> native utf-16 hashlink string (on Haxe side via String.fromUtf8())

This cleans things up to avoid unnecessary conversions, so now it looks like this in most cases:

Linux/macOS:

C++: hxstring -> utf-8[0], passed into tinyfiledialogs utf-8 api, then utf-8 -> hxstring
HL: hl_vstring -> utf-8, passed into tinyfiledialogs utf-8 api, then utf-8 -> utf-16 hashlink string (on Haxe side via String.fromUtf8())

Windows:

C++: hxstring -> utf-16[0], passed into tinyfiledialogs utf-16 api, then utf-16 -> hxstring
HL: hl_vstring, passed into tinyfiledialogs utf-16 api, then utf-16 -> utf8 -> utf-16 hashlink string (on Haxe side via String.fromUtf8())

[0] hxcpp can use ASCII (which is compatible with utf-8) or utf-16 depending on the string, so these conversions are avoided in some cases.

We can improve Hashlink further (for Windows), by returning utf-16 strings from the apis, however, I'm not sure if this would be considered a breaking change (new lime.hdlls would be incompatible with old lime haxe files).

In #1622, we briefly discussed using the utf-8 tinyfiledialog functions on Windows to unify the api, however, I looked into this and on Windows these functions just convert back to utf-16, so there would still be extra conversions. So I think this is the cleanest solution.

I've tested on Linux and Windows with the code from #1622, and everything works well.

nixbody · 2023-04-18T12:36:56Z

Just a little correction here, HXCPP (with smart strings enabled) never uses UTF-8 for strings. It uses ASCII and when any character in a string doesn't fit within ASCII range then the entire string is converted to UTF-16. HXCPP, however, provides a function to convert it's strings to UTF-8.

tobil4sk · 2023-04-18T12:44:19Z

Yes, that's correct. However, an ASCII string is valid utf-8, so when using hxs_utf8 on an hxcpp string which is ASCII encoded, it just returns a pointer to the ASCII string without any conversion being necessary.

nixbody · 2023-04-18T12:57:35Z

I was wondering if you point that out :) I thought you might know, my apologies for stating the obvious.

tobil4sk · 2023-04-18T13:30:11Z

No worries, it's worth making the distinction :)

project/src/ui/FileDialog.cpp

tobil4sk · 2023-04-20T14:24:17Z

There are a few more places where wstrings are converted to utf8 still. These include most of the system functions, though these are windows only, where wchar is equal to char16_t so hl_to_utf8/alloc_wstring can be used there. For System::GetDirectory, it can be modified to avoid wstrings.

The bigger issue is Font::GetFamilyName() in text/Font.cpp. Currently it assumes that wchar is 16 bits, which is only the case on Windows. Even if it used char16_t, the alloc_hxs_utf16() function returns an HxString, and I can't seem to figure out to turn that into a value.

project/include/ui/FileDialog.h

No need to iterate through the whole string to find the length, just to see if it is empty.

tobil4sk · 2025-08-28T00:26:59Z

I fixed conflicts in the PR and updated it to clean things up and reduce duplicate code. I've tested on linux and windows and it works as expected.

Aside from cleaning things up and simplicity, it is also a good idea to make this change because the std::wstring currently being created on non-windows systems are actually utf8 with widened characters, which is a bit confusing and doesn't play nicely with apis (e.g. hxcpp's alloc_wstring). This confusion has caused issues in the past like #1622.

player-03 · 2025-11-21T05:10:43Z

I was going to ask about memory leaks, since some delete statements were removed. But then I tracked everything down in tinyfiledialogs.c and found that it only returns static variables. If it's called again, it reuses the variable, overwriting the old value. And that's fine because we copy the value into a Haxe string before calling again. As far as I can tell, everything works as it should.

Not that I don't trust you to check, I just wanted to understand it myself. Plus, I learned some basics of gdb, which is nice.

tobil4sk · 2025-11-22T08:31:45Z

I was going to ask about memory leaks, since some delete statements were removed. But then I tracked everything down in tinyfiledialogs.c and found that it only returns static variables.

This is something I found confusing too so perhaps a comment might be useful. I can't remember if tfd has some documentation explaining this or if i just had to check the source?

Also, I suppose the static string might also not be thread safe, though that is regardless of this PR. Though, given it is file dialogs it might be unlikely to cause issues in practice.

Not that I don't trust you to check, I just wanted to understand it myself.

I agree, we want code that can be easily verified/understood by anyone reading it, rather than just assuming it works based on trust in one contributor.

player-03 · 2025-11-23T00:28:48Z

I can't remember if tfd has some documentation explaining this or if i just had to check the source?

I checked the source when I wrote that, but it turns out they also mention it in the readme: "String memory is preallocated statically for all the returned values."

Also, I suppose the static string might also not be thread safe, though that is regardless of this PR. Though, given it is file dialogs it might be unlikely to cause issues in practice.

Agreed on all counts. I doubt most OSes allow opening dialogs from non-UI threads, but if they did, both threads would read/write that same memory.

tobil4sk mentioned this pull request Apr 18, 2023

UNICODE fixes (clipboard, window title, file dialogs, paths, font glyphs, ...) #1472

Merged

nixbody approved these changes Apr 18, 2023

View reviewed changes

project/src/ui/FileDialog.cpp Show resolved Hide resolved

nixbody approved these changes Apr 18, 2023

View reviewed changes

nixbody approved these changes Apr 20, 2023

View reviewed changes

player-03 reviewed Feb 2, 2024

View reviewed changes

project/include/ui/FileDialog.h Outdated Show resolved Hide resolved

tobil4sk force-pushed the fix/dialogs branch from 4677f70 to c520eeb Compare August 27, 2025 23:52

Remove redundant file dialog string conversions

56087c6

No need to iterate through the whole string to find the length, just to see if it is empty.

tobil4sk force-pushed the fix/dialogs branch from c520eeb to 56087c6 Compare August 28, 2025 00:06

player-03 approved these changes Nov 21, 2025

View reviewed changes

Add comment about file dialog string ownership

4613c72

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Remove redundant file dialog string conversions #1665

Remove redundant file dialog string conversions #1665

Uh oh!

tobil4sk commented Apr 18, 2023 •

edited

Loading

Uh oh!

nixbody commented Apr 18, 2023 •

edited

Loading

Uh oh!

tobil4sk commented Apr 18, 2023

Uh oh!

nixbody commented Apr 18, 2023 •

edited

Loading

Uh oh!

tobil4sk commented Apr 18, 2023

Uh oh!

Uh oh!

tobil4sk commented Apr 20, 2023

Uh oh!

Uh oh!

tobil4sk commented Aug 28, 2025

Uh oh!

player-03 commented Nov 21, 2025

Uh oh!

tobil4sk commented Nov 22, 2025

Uh oh!

player-03 commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Remove redundant file dialog string conversions #1665

Are you sure you want to change the base?

Remove redundant file dialog string conversions #1665

Uh oh!

Conversation

tobil4sk commented Apr 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nixbody commented Apr 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tobil4sk commented Apr 18, 2023

Uh oh!

nixbody commented Apr 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tobil4sk commented Apr 18, 2023

Uh oh!

Uh oh!

tobil4sk commented Apr 20, 2023

Uh oh!

Uh oh!

tobil4sk commented Aug 28, 2025

Uh oh!

player-03 commented Nov 21, 2025

Uh oh!

tobil4sk commented Nov 22, 2025

Uh oh!

player-03 commented Nov 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tobil4sk commented Apr 18, 2023 •

edited

Loading

nixbody commented Apr 18, 2023 •

edited

Loading

nixbody commented Apr 18, 2023 •

edited

Loading