Newsgroups: sci.lang.japan,soc.culture.japan,fj.kanji,fj.misc
Path: galaxy.trc.rwcp.or.jp!jaist-news!cs.titech!wnoc-tyo-news!tiger!newsserver.jvnc.net!howland.reston.ans.net!usc!sdd.hp.com!saimiri.primate.wisc.edu!caen!batcomputer!munnari.oz.au!bunyip.cc.uq.oz.au!monu6!capek.rdt.monash.edu.au!jwb
From: jwb@capek.rdt.monash.edu.au (Jim Breen)
Subject: New Versions of EDICT and KANJIDIC released
Message-ID: <jwb.739693938@capek.rdt.monash.edu.au>
Sender: news@monu6.cc.monash.edu.au (Usenet system)
Organization: Monash University, Melb., Australia.
Date: Thu, 10 Jun 1993 06:32:18 GMT
Lines: 57
Xref: galaxy.trc.rwcp.or.jp fj.kanji:1398 fj.misc:3343
X-originally-archived-at: http://galaxy.rwcp.or.jp/text/cgi-bin/newsarticle2?ng=fj.kanji&nb=1398&hd=a
X-reformat-date: Mon, 18 Oct 2004 15:18:22 +0900
X-reformat-comment: Tabs were expanded into 4 column tabstops by the Galaxy's archiver. See http://katsu.watanabe.name/ancientfj/galaxy-format.html for more info.


New versions of the EDICT and KANJIDIC files are released and are
on monu6.cc.monash.edu.au in the /pub/nihongo directory.

EDICT (PD Japanese-English Dictionary File), this version has about
2,000 extra lines. Most of the new entries have come from Kurt Stueber,
Jeffrey Friedl and Ken Lunde, as well as my own background activity
adding translations to the WNN pubdic material. As well as that, Jeffrey
Friedl has put a lot of work into tracking down a lot of okurigana
mismatches, and regularizing the romaji on person and place names. The
remaining "qqq" entries have either been corrected, confirmed or deleted. 
I think this release sees a significant improvement in the quality of the
file.

EDICT is available as plain text, .Z and .zoo forms.

KANJIDIC (PD Kanji information file) has had a bit of general tidying, as well
as having the missing "Nelson" numbers, readings and meanings assigned to 
about 100 extra kanji. Nearly  5000 kanji now have these numbers. The file 
is available is text and .Z, but NOT in the kinfo*.* form yet.

SOME OTHER NEWS

V2.3 of JDIC and JREADER are about to finish beta testing. The main features
of this version are:

JDIC:- ability to select kanji by Halpern no. and kuten code.
- ability to suppress (selectively) the proper name entries from
  displaying.

JREADER:- a verb "deinflecter", which means that if the text string being
  sought is a common verb (or adjective) inflection, e.g. TOnde, the
  dictionary is examined for the plain form first, e.g. TObu.

In addition, the search has been speeded, and the word-wrap in the display
improved.

I hope to turn these loose next week, as well as new versions of the 
kinfo*.* files, which are needed because of the new Halpern indices.

V1.1 of XJDIC is in early testing. The main changes are:

- a built-in romaji-kana FEP. You prefix the text with @ or #, and
  it goes into kana as you type. Both Hepburn & kunrei work. The
  mappings are in a user-modifiable file.
- a more flexible kanji index selection method, so that any key
  can be used.
- the verb deinflection and proper name, etc. filters going into
  JDIC and JREADER.
- ability to specify stroke count when selecting a kanji by bushu no.

I hope to have it ready for beta testing by next month.
-- 
Jim Breen          [$B%8%`(B.$B%V%j!<%s(B@$B%b%J%7%eBg3X(B]
Department of Robotics & Digital Technology. Monash University. 
Wellington Rd, Clayton VIC 3168 Australia (ph) +61 3 565 3298 
(fax) +61 3 565 3574  AARNet/Internet:j.breen@rdt.monash.edu.au  
