Added check for obsolete keys (no assertions though) and removed thos… by oscargus · Pull Request #869 · JabRef/jabref

oscargus · 2016-02-25T20:06:13Z

There's not a way to automatically remove keys, right?

simonharrer · 2016-02-26T09:22:12Z

Good idea to clean this up. What you still need to do to make this cleanup complete:

change the python script which is called via the localization.gradle so that it not only adds the keys but also deletes unused keys, kind of a sync functionality; maybe we can rename the long method to something short which says syncPropertyFiles
add assertions to your two test methods which make the build fail if there are unused keys in any properties file
take into account that we not only have english but also other languages

oscargus · 2016-03-05T14:25:44Z

Assertions are added. Not sure if my Python skills are good enough to sort out the removal. I assume that one will need to rewrite the complete translation file when removing the entries. Regarding other languages it could work as when adding: English is manual, python-script to remove from other languages.

simonharrer · 2016-03-06T16:11:41Z

Nice. To help you in your python skills, I have implemented the script - you can use this code:

$ git diff
diff --git a/scripts/syncLang.py b/scripts/syncLang.py
index 40fb077..02ce513 100644
--- a/scripts/syncLang.py
+++ b/scripts/syncLang.py
@@ -60,6 +60,19 @@ def append_keys_to_file(filename, keys):
     f.close()


+def remove_keys_from_file(filename, keys):
+    lines = open(filename).readlines()
+    lines_to_write = []
+    for line in lines:
+        add = True
+        for key in keys:
+            if(line.startswith(key+"=")):
+                add = False
+        if add:
+            lines_to_write.append(line)
+    open(filename, 'w').writelines(lines_to_write)
+
+
 def compare_property_files_to_main_property_file(main_properties_file, other_properties_files, append_missing_keys_to_other_properties_files):
     keys_in_properties_file = get_keys_from_lines(read_all_lines(main_properties_file))

@@ -86,6 +99,9 @@ def compare_property_files_to_main_property_file(main_properties_file, other_pro
             print "----> Possible obsolete keys (not in English language file):"
             for key in keys_obsolete:
                 print key
+
+            if append_missing_keys_to_other_properties_files:
+                remove_keys_from_file(other_properties_file, keys_obsolete)
             print ""

Have a look at the allFilesMustHaveSameKeys test as well, as it can be extended to ensure that all language files have only the keys in the english file and only those.

simonharrer · 2016-03-17T12:49:09Z

@oscargus do you need any help with this PR?

oscargus · 2016-03-17T12:59:38Z

I think it should be OK. I had missed that you provided the missing Python functions though. Thanks!

oscargus · 2016-03-17T21:10:40Z

I can't find the problem in the Russian translation file... The Python script doesn't find the problematic line, but apparently the Java parser finds it...

simonharrer · 2016-03-17T21:49:27Z

in http://www.bouncycastle.org/ the : is not escaped.

simonharrer · 2016-03-17T21:50:39Z

in File_rename_failed_for_%0_entries.=\there is an escaped space. Maybe that is the issue?

oscargus · 2016-03-17T21:52:16Z

bouncycastle is not in the file anymore and the space I've tried. However, it seems like #, :, and ! (and =) should be escaped, so I'll try those (just found some information...).

simonharrer · 2016-03-17T21:54:24Z

I am not sure if # has to be escaped. I think, we only escape colon, equals and backslashes.

simonharrer · 2016-03-17T21:59:22Z

Tests are OK on my machine locally. Very strange.

oscargus · 2016-03-17T22:08:00Z

According to Wikipedia:

# You are reading the ".properties" entry.
! The exclamation mark can also mark text as comments.
# The key and element characters #, !, =, and : are written with
# a preceding backslash to ensure that they are properly loaded.

Doesn't work on my local machine, but now I have at least escaped all characters. Some translations were not correctly escaped (including the Russian). Still no success though...

oscargus · 2016-03-17T22:11:54Z

If one removes the ! in the three first comments, the extra string doesn't contain a !. Removing all comments also removes the #. But really out of ideas at the moment...

oscargus · 2016-03-17T22:19:55Z

I have read up a bit more. As I understand it:

= and : should be escaped if they are in the key
and ! should be escaped if they are the first character in the key

Still, these characters can always be escaped.

(Doesn't help though...)

simonharrer · 2016-03-17T22:33:12Z

hm, you could try to debug the test and see why it fails. Or create a small main class which does this only for the russian language. As it works on my machine, I am unable to help here. :-(

oscargus · 2016-03-17T22:54:32Z

I've tried this (by print-out-debugging), but since the whole file is loaded through properties.load(is); I cannot even figure out when the extra entry is inserted (and the Map/keySet is not in any order I can figure out).

But I just made some progress! Using a Reader and setting the encoding to "UTF-8" lead to that the obsolete key is #!...

oscargus · 2016-03-17T23:13:49Z

"The encoding of a .properties file is ISO-8859-1, also known as Latin-1."... Bad idea to encode it in UTF-8 then...

oscargus · 2016-03-17T23:25:34Z

Bah! Almost three hours because someone saved a file in an invalid encoding... Anyway, now I think that it is working and that the translations are slightly easier to maintain.

koppor · 2016-03-18T08:20:51Z

Does ISO-8859-1 really cover Russian characters?

oscargus · 2016-03-18T08:23:37Z

Somehow, yes. By using Unicode escaping it is claimed that it works anyway. What that means in practice I'm not really sure about. I also notes that the Japanese translation used UTF-8 without any problems, so I cannot say that I fully understand it...

matthiasgeiger · 2016-03-18T08:27:07Z

My Notepad++ says that all those files are saved in UTF8 (regardless what the comment says) - but russian was the only one not saved with "UTF8 without BOM".

oscargus · 2016-03-18T08:36:53Z

OK! I changed the encoding properties in Eclipse and then it worked, but clearly the diff was quite small... Maybe my edit in the Wiki was a bit quick...

simonharrer · 2016-03-18T09:12:55Z

Ok, then can this be merged?

Btw. we use a custom written class which enables loading properties files encoded in UTF8 instead of the default ISO....

oscargus · 2016-03-18T14:19:10Z

I added escaping for # and ! as well (a bit annoying is we happens to use the translation string "#mon# undefined" and it ends up to be a comment...).

I can also confirm that with the current format of the ru-files it works fine on my Windows 7 laptop.

Siedlerchr · 2016-03-18T15:51:03Z

I suppose it has sth do to with the Python script not reading the files in UTF8:
http://stackoverflow.com/questions/10971033/backporting-python-3-openencoding-utf-8-to-python-2
And I strongly would advise to let the properties files in UTF8, makes work for the tranlators easiert.

oscargus · 2016-03-18T16:06:30Z

No, nothing to do with Python. I think @matthiasgeiger s comment about "UTF8 without BOM" is the key thing here. (And no, the comment has nothing to do with it, not sure why it is there...)

I quite sure that the Russian files are indeed saved as UTF-8 now as well (based on the small final diff). #994 is a bit more doubtful though... Either way, good editors will handle it transparently, but we should probably wait before merging #994.

koppor · 2016-03-18T16:57:40Z


    public String getPropertiesKeyUnescaped() {
-        // space, = and : are not allowed in properties file keys
+        // space, #, !, = and : are not allowed in properties file keys


Why are'nt they repleaced here? - I don't get how the comment matches with the code.

I agree that the comment is not really clear, still correct, but for consistency it made sense to add the new escaped characters.

…e from the English translation files

…he tests are OK

oscargus · 2016-03-24T14:33:53Z

I've found a good way to actually store the files in UTF-8, even those that are now mixed (like the French translation). See the latest commit. Should I go ahead and convert all files?

koppor · 2016-03-24T16:16:43Z

Maybe @JabRef/translators should state their opinion here. Since popeye works perfectly, I would see no reason for keeping outdated encodings.

mlep · 2016-03-25T07:37:21Z

For me, files can be converted.

domwass · 2016-03-25T07:52:42Z

+1

Siedlerchr · 2016-03-28T16:03:53Z

Please also add a gradle task to call the new functionality

oscargus · 2016-03-28T16:07:51Z

It is already done. The (easiest) solution was to do both adding new and remove obsolete in the same command. I also tried to make the test print similar instructions, but it seems like that doesn't work...

Siedlerchr · 2016-03-28T16:30:44Z

@oscargus Hm the test instructions are coming from the convertPropertiesFile in LocalizationConsistencyTest
There the Sentence with "Execute...." is printed

oscargus · 2016-03-28T16:35:28Z

Maybe I didn't finish it...

matthiasgeiger · 2016-03-30T12:18:35Z

FYI the problem was that the fail(...) terminates the test before the information could be printed.

Just merged this in...

oscargus added [outdated] type: enhancement component: internationalization i18n status: ready-for-review Pull Requests that are ready to be reviewed by the maintainers labels Feb 25, 2016

oscargus force-pushed the obsoletetranslationkeys branch from 31419fe to 8080501 Compare March 5, 2016 14:29

oscargus force-pushed the obsoletetranslationkeys branch 2 times, most recently from b988726 to 6bddafb Compare March 17, 2016 21:09

oscargus mentioned this pull request Mar 17, 2016

Translations - what to translate? #989

Closed

koppor reviewed Mar 18, 2016
View reviewed changes

oscargus force-pushed the obsoletetranslationkeys branch from 0068ef5 to 95d49b8 Compare March 24, 2016 13:50

oscargus added 6 commits March 24, 2016 14:57

Added check for obsolete keys (no assertions though) and removed thos…

af0be0d

…e from the English translation files

Added assertions so tests fail on obsolete keys

aaf10e4

Updated Python scripts (thanks @simonharrer)

0c21286

Escaped all translation strings

22b2723

Changed encoding of Russian translation files to ISO-8859-1 and now t…

90740f8

…he tests are OK

Added escaping for # and !

d68cb85

oscargus force-pushed the obsoletetranslationkeys branch 2 times, most recently from 5c5063e to d78e814 Compare March 24, 2016 14:24

Updated translations

5c11f96

oscargus force-pushed the obsoletetranslationkeys branch from d78e814 to 5c11f96 Compare March 24, 2016 14:27

Used UTF-8 encoding for some of the translations (not escaped)

0febc16

oscargus mentioned this pull request Mar 25, 2016

[WIP] Translating some entries to pt_BR #1033

Merged

4 tasks

matthiasgeiger added this to the v3.3 milestone Mar 30, 2016

matthiasgeiger merged commit 0febc16 into JabRef:master Mar 30, 2016

oscargus deleted the obsoletetranslationkeys branch March 30, 2016 12:43

adaerr mentioned this pull request Feb 27, 2026

PDF-file metadata: Privacy Filtering all metadata #8

Merged

Uh oh!

Conversation

oscargus commented Feb 25, 2016

Uh oh!

simonharrer commented Feb 26, 2016

Uh oh!

oscargus commented Mar 5, 2016

Uh oh!

simonharrer commented Mar 6, 2016

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016 via email

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

and ! should be escaped if they are the first character in the key

Uh oh!

simonharrer commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

oscargus commented Mar 17, 2016

Uh oh!

koppor commented Mar 18, 2016

Uh oh!

oscargus commented Mar 18, 2016 via email

Uh oh!

matthiasgeiger commented Mar 18, 2016

Uh oh!

oscargus commented Mar 18, 2016 via email

Uh oh!

simonharrer commented Mar 18, 2016

Uh oh!

oscargus commented Mar 18, 2016

Uh oh!

Siedlerchr commented Mar 18, 2016

Uh oh!

oscargus commented Mar 18, 2016

Uh oh!

koppor Mar 18, 2016

Choose a reason for hiding this comment

Uh oh!

oscargus Mar 18, 2016 via email

Choose a reason for hiding this comment

Uh oh!

oscargus commented Mar 24, 2016

Uh oh!

koppor commented Mar 24, 2016

Uh oh!

mlep commented Mar 25, 2016

Uh oh!

domwass commented Mar 25, 2016

Uh oh!

Siedlerchr commented Mar 28, 2016

Uh oh!

oscargus commented Mar 28, 2016 via email

Uh oh!

Siedlerchr commented Mar 28, 2016

Uh oh!

oscargus commented Mar 28, 2016 via email

Uh oh!

matthiasgeiger commented Mar 30, 2016