alignUI: Indian Language support with utf-8 encoding #2

indiclinguist · 2017-08-31T04:11:12Z

I find this tool very useful. So I compiled the jar as per the instruction given in manula.html. However, I am unable to work on Bangla (e.g., vrinda in MS) and Oriya (e.g., Kalinga in MS) Scripts. Sample parallel files are attached. It works correctly for test.hn (hindi) and English.
The exact issue is that the alignUI does not display test.mn or test.or sentences. It shows blank boxes as shown in the image file. But, when I link the words blindly it creates desired output file as expected in UTF-8 (aligned.2 file in the sample-utf8-data.zip). Neither the sentence pane nor the Wordbuttons displays the test.mn or test.or text properly.

The other minor issues are:
(1) There is no warning pop-up as described in the manual, which could prevent incomplete alignment of any pair.
(2) The current sentence number is displayed. It could be really useful for the annotator to have the Total number# and the completed sentence number# displayed along side the current Sentence ID (pair_id) at the top frame.

Can anyone help to resolve the above issues? Thanks in Advance.

sample-utf8-data.zip

desilinguist · 2017-08-31T12:45:29Z

Thanks for filing the issue. However, I'm going to be away at a conference starting Saturday for a couple of weeks so I won't have a chance to look at this until then.

…

On Thu, Aug 31, 2017 at 12:11 AM indiclinguist ***@***.***> wrote: I find this tool very useful. So I compiled the jar as per the instruction given in manula.html. However, I am unable to work on Bangla (e.g., vrinda in MS) and Oriya (e.g., Kalinga in MS) Scripts. Sample parallel files are attached. It works correctly for test.hn (hindi) and English. The exact issue is that the alignUI does not display test.mn or test.or sentences. It shows blank boxes as shown in the image file. But, when I link the words blindly it creates desired output file as expected in UTF-8 (aligned.2 file in the sample-utf8-data.zip). Neither the sentence pane nor the Wordbuttons displays the test.mn or test.or text properly. The other minor issues are: (1) There is no warning pop-up as described in the manual, which could prevent incomplete alignment of any pair. (2) The current sentence number is displayed. It could be really useful for the annotator to have the Total number# and the completed sentence number# displayed along side the current Sentence ID (pair_id) at the top frame. Can anyone help to resolve the above issues? Thanks in Advance. sample-utf8-data.zip <https://github.com/desilinguist/wordalignui/files/1265728/sample-utf8-data.zip> — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#2>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAJ3kFF_uU8y_dJPmQGVXVevojAhS7gPks5sdjJggaJpZM4PIR4C> .

indiclinguist · 2017-09-01T07:42:27Z

OK. You can look at it when you are back. To add a few observation, I find that it works fine in Fedora23. In Mac, it works but fonts are not rendered in the panel and button. In windows 7 and 10, except English the fonts are displayed as white boxes. Hope this helps in resolving the issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

alignUI: Indian Language support with utf-8 encoding #2

alignUI: Indian Language support with utf-8 encoding #2

indiclinguist commented Aug 31, 2017

desilinguist commented Aug 31, 2017 via email

indiclinguist commented Sep 1, 2017

alignUI: Indian Language support with utf-8 encoding #2

alignUI: Indian Language support with utf-8 encoding #2

Comments

indiclinguist commented Aug 31, 2017

desilinguist commented Aug 31, 2017 via email

indiclinguist commented Sep 1, 2017