Fix the handling of supplementary characters (characters > U+FFFF) by mkauf · Pull Request #66 · xerial/sqlite-jdbc

mkauf · 2015-10-26T16:26:58Z

JNI uses a modified UTF-8 encoding. For supplementary characters, an invalid UTF-8 sequence was written to the database, which resulted in interoperability problems. The solution is to avoid UTF-8 in the native code and use the UTF-16 functions of SQLite (where possible). SQLite will then convert the UTF-16 to standards-compliant, unmodified UTF-8.

This also fixes related bugs in JDBC3PreparedStatement and improves the "out of memory" handling in the native code.

Fixed Issues:

JNI uses a modified UTF-8 encoding. For supplementary characters, an invalid UTF-8 sequence was written to the database, which resulted in interoperability problems. The solution is to avoid UTF-8 in the native code and use the UTF-16 functions of SQLite (where possible). SQLite will then convert the UTF-16 to standards-compliant, unmodified UTF-8. This also fixes related bugs in JDBC3PreparedStatement and improves the "out of memory" handling in the native code. Fixed Issues: - https://bitbucket.org/xerial/sqlite-jdbc/issues/200/wrong-utf-8-decoding-of-unicode-code , same as #61 - https://bitbucket.org/xerial/sqlite-jdbc/issues/144/nativedbexec-throws-an-exception-without - https://bitbucket.org/xerial/sqlite-jdbc/issues/84/bug-in-nativedbc-bind_1blob - https://bitbucket.org/xerial/sqlite-jdbc/issues/70/setting-a-blob-in-prepstmt

mkauf · 2015-10-26T16:37:42Z

I think that the continuous integration system has not compiled the native library, but used an old version of the native library instead. A new unit test that I have written fails because the old native library has been used.

xerial · 2015-10-26T18:11:26Z

OK. I will prepare native binaries rebuilt with this fix.

xerial · 2015-10-26T18:28:04Z

Thanks for the proper error handling and the improvement of the query execution by using new API of SQLite.

mkauf · 2015-11-01T10:29:09Z

Thank you for merging!

jberkel · 2015-12-08T17:38:02Z

Just ran into this bug, glad it's already been fixed. However there doesn't seem to be a 3.9.1-SNAPSHOT version on sonatype, I only found 3.9.0.

xerial mentioned this pull request Oct 26, 2015

Mkauf fix utf8 supplementary characters #67

Merged

xerial merged commit a4cf82d into xerial:master Oct 26, 2015

mkauf deleted the fix-utf8-supplementary-characters branch November 1, 2015 10:29

gwenn mentioned this pull request Jun 7, 2016

Supplementary characters (characters > U+FFFF) liteglue/Android-sqlite-native-driver#2

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the handling of supplementary characters (characters > U+FFFF)#66

Fix the handling of supplementary characters (characters > U+FFFF)#66
xerial merged 1 commit intoxerial:masterfrom
mkauf:fix-utf8-supplementary-characters

mkauf commented Oct 26, 2015

Uh oh!

mkauf commented Oct 26, 2015

Uh oh!

xerial commented Oct 26, 2015

Uh oh!

xerial commented Oct 26, 2015

Uh oh!

mkauf commented Nov 1, 2015

Uh oh!

jberkel commented Dec 8, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

mkauf commented Oct 26, 2015

Uh oh!

mkauf commented Oct 26, 2015

Uh oh!

xerial commented Oct 26, 2015

Uh oh!

xerial commented Oct 26, 2015

Uh oh!

mkauf commented Nov 1, 2015

Uh oh!

jberkel commented Dec 8, 2015

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments