Skip to content

Add runtime encoding detection and conversion #46

Merged
Spartan322 merged 1 commit intomasterfrom
fix/char-detection
Jun 18, 2024
Merged

Add runtime encoding detection and conversion #46
Spartan322 merged 1 commit intomasterfrom
fix/char-detection

Conversation

@Spartan322
Copy link
Copy Markdown
Member

@Spartan322 Spartan322 commented Jun 11, 2024

Fixes #40
Fixes #41
Fixes #42
Fixes #43
Related: OpenVicProject/OpenVic#180

Win-1251/1252 detection is a reduced C++ version of https://github.com/hsivonen/chardetng
Add manually-specified encoding fallback
Add default system encoding fallback
Add error recovery to v2script
Add unknown encoding detection warning
Remove csv::Parser templating
Fix lua files dropping data

Update lexy to foonathan/lexy@1e5d99f

Remove exclusive reliance on lexy::default_encoding for v2script

Move internal concepts to src/openvic-detail/InternalConcepts.hpp
Move contents of DetectUtf8.hpp to src/detail/Detect.hpp
Move openvic-dataloader/AbstractSyntaxTree.hpp to src
Move DiagnosticLogger.hpp to src
Move File.hpp to src
Move openvic-dataloader/detail/utlity files to openvic-dataloader/detail

Add ovdl::utility::type_concat
Add ovdl::utility::type_prepend
Add ovdl::utility::is_instance_of
Overhaul parse error messages

@Spartan322 Spartan322 added the enhancement New feature or request label Jun 11, 2024
@Spartan322 Spartan322 force-pushed the fix/char-detection branch 2 times, most recently from 3f5bc43 to d6c50cb Compare June 15, 2024 07:51
@Spartan322 Spartan322 changed the title Implement character detection and conversion Add runtime encoding detection and conversion Jun 15, 2024
@Spartan322 Spartan322 marked this pull request as ready for review June 15, 2024 07:53
@Spartan322 Spartan322 force-pushed the fix/char-detection branch 2 times, most recently from f9002d2 to 1ed6a19 Compare June 15, 2024 07:57
@Spartan322 Spartan322 force-pushed the fix/char-detection branch from 1ed6a19 to 5b47337 Compare June 17, 2024 10:18
Hop311
Hop311 previously approved these changes Jun 17, 2024
Copy link
Copy Markdown
Contributor

@Hop311 Hop311 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested on ASCII, Windows-1251, Windows-1252 and UTF-8 files 👍

Comment thread src/headless/main.cpp
	Win-1251/1252 detection is a reduced C++ version of https://github.com/hsivonen/chardetng
Add manually-specified encoding fallback
Add default system encoding fallback
Add error recovery to v2script
Add unknown encoding detection warning
Remove csv::Parser templating
Fix lua files dropping data

Update lexy to foonathan/lexy@1e5d99f

Remove exclusive reliance on lexy::default_encoding for v2script

Move internal concepts to src/openvic-detail/InternalConcepts.hpp
Move contents of DetectUtf8.hpp to src/detail/Detect.hpp
Move openvic-dataloader/AbstractSyntaxTree.hpp to src
Move DiagnosticLogger.hpp to src
Move File.hpp to src
Move openvic-dataloader/detail/utlity files to openvic-dataloader/detail

Add ovdl::utility::type_concat
Add ovdl::utility::type_prepend
Add ovdl::utility::is_instance_of
Overhaul parse error messages
@Spartan322 Spartan322 merged commit 8b623bf into master Jun 18, 2024
@Hop311 Hop311 deleted the fix/char-detection branch July 10, 2024 19:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

enhancement New feature or request

Projects

None yet

3 participants