[vox] Tool to identify & convert character sets?

Mark K. Kim vox@lists.lugod.org
Wed, 4 Feb 2004 12:54:19 -0800 (PST)


  This message is in MIME format.  The first part should be readable text,
  while the remaining parts are likely unreadable without MIME-aware tools.
  Send mail to mime@docserver.cac.washington.edu for more info.

---909110542-1934179118-1075928059=:6530
Content-Type: TEXT/PLAIN; charset=US-ASCII

Still don't know any command-line tool to identify character sets, but I
found out you can use iconv to convert character sets.  Woohoo!  Try
converting the attached [very very small] text file (in EUC-KR Korean
text) to UTF-8 like this:

   iconv -f euc-kr -t utf-8 euc-kr.txt

-Mark =)

On Fri, 30 Jan 2004, Mark K. Kim wrote:

> Hi.  Do you guys know of any tools to identify character sets of a text
> string, and/or convert it to any other [valid] character set?  More
> specifically, I want to identify and convert to/from UTF-8 and EUC-KR.
>
> Thanks!
>
> -Mark

-- 
Mark K. Kim
AIM: markus kimius
Homepage: http://www.cbreak.org/
Xanga: http://www.xanga.com/vindaci
Friendster: http://www.friendster.com/user.jsp?id=13046
PGP key fingerprint: 7324 BACA 53AD E504 A76E  5167 6822 94F0 F298 5DCE
PGP key available on the homepage
---909110542-1934179118-1075928059=:6530
Content-Type: TEXT/plain; charset=X-UNKNOWN; name="euc-kr.txt"
Content-Transfer-Encoding: BASE64
Content-ID: <Pine.LNX.4.58.0402041254190.6530@bolt.sonic.net>
Content-Description: 
Content-Disposition: attachment; filename="euc-kr.txt"

vsiz58fPvLy/5D8NCg==

---909110542-1934179118-1075928059=:6530--