tag:blogger.com,1999:blog-18790794.post4573126637555665414..comments2023-10-20T01:23:03.791-07:00Comments on Jigtenmig - Classical Tibetan Language Blog: Issues with ACIP to Unicode Conversion and MacOSXKenthttp://www.blogger.com/profile/16346226940219880822noreply@blogger.comBlogger5125tag:blogger.com,1999:blog-18790794.post-73646461867038582632008-07-16T12:40:00.000-07:002008-07-16T12:40:00.000-07:00Thanks, this will be handy, especially for convert...Thanks, this will be handy, especially for converting a large set of ACIP files using a bash on the command line.<BR/><BR/>Now, for non-programmers, this all could daunting. There's an option with MacOSX to make an icon that accepts files, and underneath it will trigger bash scripts, so if I ever had more spare time something like this would be handy for those who don't dare to open up the terminal app.Kenthttps://www.blogger.com/profile/16346226940219880822noreply@blogger.comtag:blogger.com,1999:blog-18790794.post-22664605500380098412008-07-13T07:49:00.000-07:002008-07-13T07:49:00.000-07:00I've converted several hundred of pages of our pro...I've converted several hundred of pages of our project with JSKAD on Mac OS X <BR/>(http://www.ittm.org/projects/dataInput/<BR>DataInputProject.htm)<BR/><BR/>If you familiar with Terminal on Mac OS X, try the following command:<BR/><BR/>java -Dthdl.acip.to.unicode.conversions.use.0F52.et.cetera = true -cp PATH/lib-vanilla/Jskad.jar org.thdl.tib.input.TibetanConverter --colors no --warning-level None --acip-to-tibetan-warning-and-error-messages long --acip-to-unicode ACIP_file.txt >> UNICODE_file.txt<BR/><BR/>Replace PATH with the JSKAD path and ACIP_file.txt is the input file and UNICODE_file.txt the output file.<BR/><BR/>BTW, JSKAD don't include a UTF-8 BOM at the beginning of the file, which is in hexa: EF BB BF.<BR/><BR/>Hope this helps,<BR/>DanielAnonymousnoreply@blogger.comtag:blogger.com,1999:blog-18790794.post-60241507327848575242008-05-12T20:47:00.000-07:002008-05-12T20:47:00.000-07:00I found the same thing as you when using Jskad.jar...I found the same thing as you when using Jskad.jar, but I did manage to get readable unicode from a ACIP file in a two step process.<BR/><BR/>First go to:<BR/>Tools→Launch Converter...→ACIP to Wylie (Text->Text)→Convert<BR/><BR/>Second:<BR/>Close the converter dialogue and open the text file that was produced by the converter and select all and copy then paste into Jskad.<BR/><BR/>Third:<BR/>Select all in Jskad, then:<BR/>Tools→Convert All→Convert Tibetan Machine Web (non-Unicode) to Unicode.<BR/><BR/>Fourth:<BR/>Select All→Copy and paste into a text file. Save the file (make sure that the encoding is UTF) and you have a Unicode file.<BR/><BR/>It's a little clumsy, but not too bad, I think.<BR/><BR/>*Except that the Unicode stackings are far from perfect - at least in Windows. I still haven't tried with Linux. (I left a message about this at http://jigtenmig.blogspot.com/2008/03/tibetan-unicode-fonts-and-this-blog.html)Anonymousnoreply@blogger.comtag:blogger.com,1999:blog-18790794.post-89323590463840005552008-03-28T14:32:00.000-07:002008-03-28T14:32:00.000-07:00Thanks. I tried both the tested version of Jskad a...Thanks. I tried both the tested version of Jskad as well as last night's build. I suspect that the file header needs some specific information that this is a unicode 16 or UTF-8 file, but I'm no expert on Unicode files.<BR/><BR/>Maybe someone from the Jskad team is reading this...Kenthttps://www.blogger.com/profile/16346226940219880822noreply@blogger.comtag:blogger.com,1999:blog-18790794.post-31257314290043498502008-03-28T13:57:00.000-07:002008-03-28T13:57:00.000-07:00Uh-oh, I just tested to see for myself and got the...Uh-oh, I just tested to see for myself and got the same gibberish. ACIP>Unicode used to work fine. It doesn't seem like Leopard would have messed up a good thing, but I don't know what else has changed since then. I tried numerous different plain text encodings via TextEdit, but the all had different problems when converted.<BR/><BR/><BR/>That's really a shame. I was banking on ACIP>Uni on a Mac. I'll keep searching for a solution.Evan Osherowhttps://www.blogger.com/profile/13339656797303267262noreply@blogger.com